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Introduction 


Today's STEM (science, technology, engineering, and mathematics) student must mas- 
ter vast quantities of applied mathematics. This is why I wrote Advanced Engineering 
Mathematics with MATLAB. Three assumptions underlie its structure: (1) All students 
need a firm grasp of the traditional disciplines of ordinary and partial differential equa- 
tions, vector calculus, and linear algebra. (2) The digital revolution will continue. Thus 
the modern student must have a strong foundation in transform methods because they 
provide the mathematical basis for electrical and communication studies. (3) The biologi- 
cal revolution will become more mathematical and require an understanding of stochastic 
(random) processes. Already, stochastic processes play an important role in finance, the 
physical sciences, and engineering. These techniques will enjoy an explosive growth in the 
biological sciences. For these reasons, an alternative title for this book could be Advanced 
Engineering Mathematics for the Twenty-First Century. 

This is my fourth attempt at realizing these goals. It continues the tradition of including 
technology into the conventional topics of engineering mathematics. Of course, I took this 
opportunity to correct misprints and include new examples, problems, and projects. I now 
use the small rectangle O to separate the end of an example or theorem from the continuing 
text. The two major changes are a section on conformal mapping (Section 10.11) and a 
new chapter on stochastic calculus. 

A major change is the reorganization of the order of the chapters. In line with my 
goals I have subdivided the material into three groups: classic engineering mathematics, 
transform methods, and stochastic processes. In its broadest form, there are two general 
tracks: 


Differential Equations Course: Most courses on differential equations cover three gen- 
eral topics: fundamental techniques and concepts, Laplace transforms, and separation of 
variable solutions to partial differential equations. 

'The course begins with first- and higher-order ordinary differential equations, Chapters 
1 and 2, respectively. After some introductory remarks, Chapter 1 devotes itself to present- 
ing general methods for solving first-order ordinary differential equations. These methods 
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include separation of variables, employing the properties of homogeneous, linear, and exact 
differential equations, and finding and using integrating factors. 

The reason most students study ordinary differential equations is for their use in ele- 
mentary physics, chemistry, and engineering courses. Because these differential equations 
contain constant coefficients, we focus on how to solve them in Chapter 2, along with 
a detailed analysis of the simple, damped, and forced harmonic oscillator. Furthermore, 
we include the commonly employed techniques of undetermined coefficients and variation 
of parameters for finding particular solutions. Finally, the special equation of Euler and 
Cauchy is included because of its use in solving partial differential equations in spherical 
coordinates. 

Some courses include techniques for solving systems of linear differential equations. A 
chapter on linear algebra (Chapter 3) is included if that is a course objective. 

After these introductory chapters, the course would next turn to Laplace transforms. 
Laplace transforms are useful in solving nonhomogeneous differential equations where the 
initial conditions have been specified and the forcing function “turns on and off" The 
general properties are explored in Section 12.1 to Section 12.7; the actual solution technique 
is presented in Section 12.8. 

Most differential equations courses conclude with a taste of partial differential equa- 
tions via the method of separation of variables. This topic usually begins with a quick 
introduction to Fourier series, Sections 5.1 to 5.4, followed by separation of variables as 
it applies to the heat (Sections 8.1-8.3), wave (Sections 7.1-7.3), or Laplace's equations 
(Sections 9.1—9.3). The exact equation that is studied depends upon the future needs of the 
students. 


Engineering Mathematics Course: This book can be used in a wide variety of engi- 
neering mathematics classes. In all cases the student should have seen most of the material 
in Chapters 1 and 2. There are at least four possible combinations: 


eOption A: The course is a continuation of a calculus reform sequence where elementary 
differential equations have been taught. This course begins with Laplace transforms and 
separation of variables techniques for the heat, wave, and/or Laplace's equations, as outlined 
above. The course then concludes with either vector calculus or linear algebra. Vector 
calculus is presented in Chapter 4 and focuses on the gradient operator as it applies to 
line integrals, surface integrals, the divergence theorem, and Stokes’ theorem. Chapter 
3 presents linear algebra as a method for solving systems of linear equations and includes 
such topics as matrices, determinants, Cramer's rule, and the solution of systems of ordinary 
differential equations via the classic eigenvalue problem. 


e Option B: This is the traditional situation where the student has already studied differen- 
tial equations in another course before he takes engineering mathematics. Here separation of 
variables is retaught from the general viewpoint of eigenfunction expansions. Sections 9.1— 
9.3 explain how any piece-wise continuous function can be reexpressed in an eigenfunction 
expansion using eigenfunctions from the classic Sturm-Liouville problem. Furthermore, we 
include two sections that focus on Bessel functions (Section 6.5) and Legendre polynomials 
(Section 6.4). These eigenfunctions appear in the solution of partial differential equations 
in cylindrical and spherical coordinates, respectively. 
The course then covers linear algebra and vector calculus as given in Option A. 


eOption C: I originally wrote this book for an engineering mathematics course given to 
sophomore and junior communication, systems, and electrical engineering majors at the 
U.S. Naval Academy. In this case, you would teach all of Chapter 10 with the possible 
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exception of Section 10.10 on Cauchy principal-value integrals. This material was added to 
prepare the student for Hilbert transforms, Chapter 14. 

Because most students come to this course with a good knowledge of differential equa- 
tions, we begin with Fourier series, Chapter 5, and proceed through Chapter 14. Chapter 11 
generalizes the Fourier series to aperiodic functions and introduces the Fourier transform. 
'This leads naturally to Laplace transforms, Chapter 12. Throughout these chapters, I make 
use of complex variables in the treatment and inversion of the transforms. 

With the rise of digital technology and its associated difference equations, a version 
of the Laplace transform, the z-transform, was developed. Chapter 13 introduces the z- 
transform by first giving its definition and then developing some of its general properties. 
We also illustrate how to compute the inverse by long division, partial fractions, and con- 
tour integration. Finally, we use z-transforms to solve difference equations, especially with 
respect to the stability of the system. 

Finally, there is a chapter on the Hilbert transform. With the explosion of interest in 
communications, today's engineer must have a command of this transform. The Hilbert 
transform is introduced in Section 14.1 and its properties are explored in Section 14.2. 
'Two important applications of Hilbert transforms are introduced in Sections 14.3 and 14.4, 
namely the concept of analytic signals and the Kramers-Kronig relationship. 


eOption D: Many engineering majors now require a course in probability and statistics 
because of the increasing use of probabilistic concepts in engineering analysis. To incorpo- 
rate this development into an engineering mathematics course we adopt a curriculum that 
begins with Fourier transforms (minus inversion by complex variables) given in Chapter 
11. The remaining portion involves the fundamental concepts of probability presented in 
Chapter 16 and random processes in Chapter 17. Chapter 16 introduces the student to 
the concepts of probability distributions, mean, and variance because these topics appear 
so frequently in random processes. Chapter 17 explores common random processes such as 
Poisson processes and birth and death. Of course, this course assumes a prior knowledge 
of ordinary differential equations and Fourier series. 

A unique aspect of this book appears in Chapter 18, which is devoted to stochastic 
calculus. We start by exploring deterministic differential equations with a stochastic forcing. 
Next, the important stochastic process of Brownian motion is developed in depth. Using this 
Brownian motion, we introduce the concept of (Itó) stochastic integration, Itó's lemma, and 
stochastic differential equations. The chapter concludes with various numerical methods to 
integrate stochastic differential equations. 


In addition to the revisions of the text and topics covered in this new addition, MATLAB 
is still employed to reinforce the concepts that are taught. Of course, this book still continues 
my principle of including a wealth of examples from the scientific and engineering literature. 
The answers to the odd problems are given in the back of the book, while worked solutions 
to all of the problems are available from the publisher. Most of the MATLAB scripts may 
be found at http://www.crcpress.com/product /isbn/9781439816240. 
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Chapter 1 
First-Order Ordinary 


Differential Equations 


A differential equation is any equation that contains the derivatives or differentials of 
one or more dependent variables with respect to one or more independent variables. Because 
many of the known physical laws are expressed as differential equations, a sound knowledge 
of how to solve them is essential. In the next two chapters we present the fundamental 
methods for solving ordinary differential equations - a differential equation that contains 
only ordinary derivatives of one or more dependent variables. Later, in Sections 11.6 and 
12.8, we show how transform methods can be used to solve ordinary differential equations, 
while systems of linear ordinary differential equations are treated in Section 3.6. Solutions 
for partial differential equations—a differential equation involving partial derivatives of one 
or more dependent variables of two or more independent variables—are given in Chapters 
7, 8, and 9. 


1.1 CLASSIFICATION OF DIFFERENTIAL EQUATIONS 
Differential equations are classified three ways: by type, order, and linearity. There 


are two types: ordinary and partial differential equations, which have already been defined. 
Examples of ordinary differential equations include 


d 
-a y =r, (1.1.1) 
(x — y) dz + 4y dy = 0, (1.1.2) 
du d 
TC — 14 5z, (1.1.3) 
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and E . 
y y : 
da + egy 95m. (1.1.4) 
On the other hand, examples of partial differential equations include 
ðu | Qu 
SELLE esed 1.1.5 
RI (1.1.5) 
Ou Ou 
— a= 2 1.1.6 
RES "IE (1.1.6) 
and ga 5 E 
EA Ed (1.1.7) 


In the examples that we have just given, we have explicitly written out the differ- 
entiation operation. However, from calculus we know that dy/dx can also be written y’. 
Similarly the partial differentiation operator 0*u/Ox?0w? is sometimes written Usryy. We 
will also use this notation from time to time. 

The order of a differential equation is given by the highest-order derivative. For exam- 


dy d dyY 
2 4 355 + (+) y = sin(x) (1.1.8) 


is a third-order ordinary differential equation. Because we can rewrite 


ple, 


(x + y)dy -— zdz =0 (1.1.9) 

as d 
(ety) =a (1.1.10) 

dx 


by dividing Equation 1.1.9 by dx, we have a first-order ordinary differential equation here. 
Finally 
tu Oru 
Ox?20y? Ot: 
is an example of a fourth-order partial differential equation. In general, we can write an 
nth-order, ordinary differential equation as 


dy d"y 
PE A ar d 1.1.12 
res dx’ , zi 0 ( ) 


(1.1.11) 


The final classification is according to whether the differential equation is linear or 
nonlinear. A differential equation is linear if it can be written in the form: 


d"y d"-ly dy 
an(s) zn t 6n) ni Ee az) 


Fao(x)y = f(x). (1.1.13) 


Note that the linear differential equation, Equation 1.1.13, has two properties: (1) The 
dependent variable y and all of its derivatives are of first degree (the power of each term 
involving y is 1). (2) Each coefficient depends only on the independent variable x. Examples 
of linear first-, second-, and third-order ordinary differential equations are 


(x + 1) dy — y dz = 0, (1.1.14) 
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y" + 3y' + 2y = e”, (1.1.15) 

and E 4 
y — CER- +y = sin(x), (1.1.16) 


respectively. If the differential equation is not linear, then it is nonlinear. 
nonlinear first-, second-, and third-order ordinary differential equations are 


Examples of 


d 
Y domu! =r, 1.1.17) 
dx 
d? dy Y 
T = (+) + 2zy = sin(z), 1.1.18) 
and 
yy" + 2y = e€”, 1.1.19) 
respectively. 


At this point it is useful to highlight certain properties that all differential equations 
have in common regardless of their type, order, and whether they are linear or not. First, it 
is not obvious that just because we can write down a differential equation, a solution exists. 
The existence of a solution to a class of differential equations constitutes an important aspect 
of the theory of differential equations. Because we are interested in differential equations 
that arise from applications, their solution should exist. In Section 1.2 we address this 
question further. 

Quite often a differential equation has the solution y = 0, a trivial solution. For 
example, if f(x) = 0 in Equation 1.1.13, a quick check shows that y = 0 is a solution. 
Trivial solutions are generally of little value. 

Another important question is how many solutions does a differential equation have? 
In physical applications uniqueness is not important because, if we are lucky enough to 
actually find a solution, then its ties to a physical problem usually suggest uniqueness. 
Nevertheless, the question of uniqueness is of considerable importance in the theory of 
differential equations. Uniqueness should not be confused with the fact that many solutions 
to ordinary differential equations contain arbitrary constants, much as indefinite integrals 
in integral calculus. A solution to a differential equation that has no arbitrary constants is 
called a particular solution. 


e Example 1.1.1 
Consider the differential equation 
=2+1, y(1) = 2. (1.1.20) 


This condition y(1) — 2 is called an initial condition and the differential equation plus the 
initial condition constitute an initial-value problem. Straightforward integration yields 


yla) = f(e+tde+ C= bh +040. (1.1.21) 


Equation 1.1.21 is the general solution to the differential equation, Equation 1.1.20, because 
it is a solution to the differential equation for every choice of C. However, if we now satisfy 
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the initial condition y(1) = 2, we obtain a particular solution. This is done by substituting 
the corresponding values of x and y into Equation 1.1.21, or 


2=43(1)?+1+C=24+C, or C= (1.1.22) 


1 
2 . 
Therefore, the solution to the initial-value problem Equation 1.1.20 is the particular solution 


ylz) = (x + 1)?/2. (1.1.23) 


” 


Finally, it must be admitted that most differential equations encountered in the “real 
world cannot be written down either explicitly or implicitly. For example, the simple differ- 
ential equation y’ = f(x) does not have an analytic solution unless you can integrate f(z). 
This begs the question of why it is useful to learn analytic techniques for solving differential 
equations that often fail us. The answer lies in the fact that differential equations that we 
can solve share many of the same properties and characteristics of differential equations 
which we can only solve numerically. Therefore, by working with and examining the dif- 
ferential equations that we can solve exactly, we develop our intuition and understanding 
about those that we can only solve numerically. 


Problems 


Find the order and state whether the following ordinary differential equations are linear or 
nonlinear: 


1. y /y =£? +g 2. y =x2+3 

3. sin(y’) = 5y 4. y" =y 

5. y" = 3a? 6. (y) 21— 3y 

7. y" =y’ 8. y" — 4y' + 5y = sin(x) 

9. y” + zy = cos(y") 10. (2x + y) dz + (x — 3y) dy = 0 
11. (1+ z2)y/ 2 (1 +y)? 12. yy" = z(y? + 1) 

13. y +y +y? =r +e” 14. y" + cos(z)y' + y = 0 

15. z2y" +a? (y) +y = e 16. y" + zy" + e? = r? 


1.2 SEPARATION OF VARIABLES 


'The simplest method of solving a first-order ordinary differential equation, if it works, is 
separation of variables. It has the advantage of handling both linear and nonlinear problems, 
especially autonomous equations.! From integral calculus, we already met this technique 
when we solved the first-order differential equation 


dy 
dod (1.2.1) 


1 An autonomous equation is a differential equation where the independent variable does not explicitly 


appear in the equation, such as y’ = f(y). 
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By multiplying both sides of Equation 1.2.1 by dx, we obtain 
dy = f(x) da. (1.2.2) 


At this point we note that the left side of Equation 1.2.2 contains only y while the right 
side is purely a function of x. Hence, we can integrate directly and find that 


y= f #0) dx + C. (1.2.3) 


For this technique to work, we must be able to rewrite the differential equation so that all 
of the y dependence appears on one side of the equation while the z dependence is on the 
other. Finally we must be able to carry out the integration on both sides of the equation. 

One of the interesting aspects of our analysis is the appearance of the arbitrary constant 
C in Equation 1.2.3. To evaluate this constant we need more information. 'The most 
common method is to require that the dependent variable give a particular value for a 
particular value of x. Because the independent variable x often denotes time, this condition 
is usually called an initial condition, even in cases when the independent variable is not 
time. 


e Example 1.2.1 


Let us solve the ordinary differential equation 


d y 
x ME (1.2.4) 
dr xy 
Because we can separate variables by rewriting Equation 1.2.4 as 
d. 
ye ? dy = ig (1.2.5) 
x 
its solution is simply 
—ye "—e ? = ln|jr| +C (1.2.6) 
by direct integration. 
e Example 1.2.2 
Let us solve 
dy n 
— +y = re"y, (1.2.7) 
dx 
subject to the initial condition y(0) = 1. 
Multiplying Equation 1.2.7 by dx, we find that 
dy + y dx = xe*y dz, (1.2.8) 


Or 
79 — (ze? — 1) dz. (1.2.9) 
y 
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A quick check shows that the left side of Equation 1.2.9 contains only the dependent variable 
y while the right side depends solely on x and we have separated the variables onto one side 
or the other. Finally, integrating both sides of this equation, we have 


In(y) = re” — e — x + C. (1.2.10) 
Since y(0) = 1, C = 1 and 


y(x) = expl(z — 1)e* +1 — zx]. (1.2.11) 


In addition to the tried-and-true method of solving ordinary differential equations by 
hand, scientific computational packages such as MATLAB provide symbolic toolboxes that 
are designed to do the work for you. In the present case, typing 


dsolve(?Dy*y-x*exp(x)*y? ,?y(0)71? ,? x?) 
yields 

ans - 
1/exp(-1)*exp(-x*x*exp(x)-exp (x) ) 
which is equivalent to Equation 1.2.11. 

Our success here should not be overly generalized. Sometimes these toolboxes give 
the answer in a rather obscure form or they fail completely. For example, in the previous 
example, MATLAB gives the answer 
ans = 
-lambertw((log(x)+C1) *exp(-1))-1 


The MATLAB function lambertw is Lambert’s W function, where w = lambertw(x) is the 
solution to we" = x. Using this definition, we can construct the solution as expressed in 
Equation 1.2.6. 


e Example 1.2.3 
Consider the nonlinear differential equation 
ry +y — 0. (1.2.12) 
Separating variables, we find that 


1 x 
E — C. = . 
e Qo MEC Cr-—1 


1 
— or = (1.2.13) 
x y 


Equation 1.2.13 shows the wide variety of solutions possible for an ordinary differential 
equation. For example, if we require that y(0) = 0, then there are infinitely many different 
solutions satisfying this initial condition because C can take on any value. On the other 
hand, if we require that y(0) — 1, there is no solution because we cannot choose any constant 
C such that y(0) — 1. Finally, if we have the initial condition that y(1) — 2, then there is 
only one possible solution corresponding to C — 3. 

Consider now the trial solution y — 0. Does it satisfy Equation 1.2.127 Yes, it does. 
On the other hand, there is no choice of C that yields this solution. The solution y — 0 is 
called a singular solution to this equation. Singular solutions are solutions to a differential 
equation that cannot be obtained from a solution with arbitrary constants. 
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Figure 1.2.1: The solution to Equation 1.2.13 when C — —2,0,2,4. 


Finally, we illustrate Equation 1.2.13 using MATLAB. This is one of MATLAB’s strengths 
— the ability to convert an abstract equation into a concrete picture. Here the MATLAB 
script 
clear 
hold on 
x = -b:0,5:5; 
for c = -2:2:4 
y =x ./ (c*x-1); 
if (c== -2) subplot(2,2,1), plot(x,y,’*’) 
axis tight; title(’c = -2’); ylabel(’y’,’Fontsize’ ,20); end 
if (c== 0) subplot(2,2,2), plot(x,y,’~’) 
axis tight; title(’c = 0’); end 
if (c== 2) subplot(2,2,3), plot(x,y,’s’) 
axis tight; title(’c = 2’); xlabel(’x’,’Fontsize’ ,20); 
ylabel(’y’,’Fontsize’,20); end 
if (c== 4) subplot(2,2,4), plot(x,y,’h’) 
axis tight; title(’c = 4’); xlabel(’x’,’Fontsize’,20); end 
end 


yields Figure 1.2.1, which illustrates Equation 1.2.13 when C = —2,0,2, and 4. 


The previous example showed that first-order ordinary differential equations may have a 
unique solution, no solution, or many solutions. From a complete study? of these equations, 
we have the following theorem: 


? The proof of the existence and uniqueness of first-order ordinary differential equations is beyond the 
scope of this book. See Ince, E. L., 1956: Ordinary Differential Equations. Dover Publications, Inc., 
Chapter 3. 
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Theorem: Existence and Uniqueness 


Suppose some real-valued function f(x,y) is continuous on some rectangle in the xy- 
plane containing the point (a,b) in its interior. Then the initial-value problem 


dy 

T 
has at least one solution on the same open interval I containing the point x = a. Further- 
more, if the partial derivative Of /Oy is continuous on that rectangle, then the solution is 
unique on some (perhaps smaller) open interval Io containing the point x = a. 


e Example 1.2.4 


Consider the initial-value problem y’ = 3y!/3/2 with y(0) = 1. Here f(x, y) = 3y!/3/2 
and fy = y 2/3 /2. Because fy is continuous over a small rectangle containing the point 
(0,1), there is a unique solution around x = 0, namely y = (x + 1)3/?, which satisfies the 
differential equation and the initial condition. On the other hand, if the initial condition 
reads y(0) — 0, then f, is not continuous on any rectangle containing the point (0,0) and 
there is no unique solution. For example, two solutions to this initial-value problem, valid 
on any open interval that includes x = 0, are yı (x) = z?/? and 


— (z — 1)92, r21l, 1.2.1 
yi) { 0, go da) 


e Example 1.2.5: Hydrostatic equation 


Consider an atmosphere where its density varies only in the vertical direction. The 
pressure at the surface equals the weight per unit horizontal area of all of the air from sea 
level to outer space. As you move upward, the amount of air remaining above decreases 
and so does the pressure. This is why we experience pressure sensations in our ears when 
ascending or descending in an elevator or airplane. If we rise the small distance dz, there 
must be a corresponding small decrease in the pressure, dp. This pressure drop must equal 
the loss of weight in the column per unit area, —pg dz. Therefore, the pressure is governed 
by the differential equation 

dp — —pg dz, (1.2.16) 


commonly called the hydrostatic equation. 

To solve Equation 1.2.16, we must express p in terms of pressure. For example, in 
an isothermal atmosphere at constant temperature T;, the ideal gas law gives p = pRTs, 
where R is the gas constant. Substituting this relationship into our differential equation 
and separating variables yields 


c ec een (1.2.17) 


p(z) = p(0) exp(- a ) (1.2.18) 
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Thus, the pressure (and density) of an isothermal atmosphere decreases exponentially with 
height. In particular, it decreases by e^! over the distance RT;/g, the so-called “scale 
height." 


e Example 1.2.6: Terminal velocity 


As an object moves through a fluid, its viscosity resists the motion. Let us find the 
motion of a mass m as it falls toward the earth under the force of gravity when the drag 
varies as the square of the velocity. 

From Newton's second law, the equation of motion is 


m—- = mg — Cpv?, (1.2.19) 


where v denotes the velocity, g is the gravitational acceleration, and Cp is the drag coeffi- 
cient. We choose the coordinate system so that a downward velocity is positive. 

Equation 1.2.19 can be solved using the technique of separation of variables if we change 
from time t as the independent variable to the distance traveled x from the point of release. 
This modification yields the differential equation 


mof? = mg — Cpr’, (1.2.20) 


since v = dx/dt. Separating the variables leads to 


vdv 
Or 
k 2 
m(1 = =) = —2kr, (1.2.22) 
g 


where k = Cp/m and v = 0 for x = 0. Taking the inverse of the natural logarithm, we 
finally obtain 
viz) = 7 ise); (1.2.23) 
Thus, as the distance that the object falls increases, so does the velocity, and it eventually 
approaches a constant value 4/g/k, commonly known as the terminal velocity. 
Because the drag coefficient Cp varies with the superficial area of the object while 
the mass depends on the volume, k increases as an object becomes smaller, resulting in a 
smaller terminal velocity. Consequently, although a human being of normal size will acquire 
a terminal velocity of approximately 120 mph, a mouse, on the other hand, can fall any 
distance without injury. 


e Example 1.2.7: Interest rate 


Consider a bank account that has been set up to pay out a constant rate of P dollars 
per year for the purchase of a car. This account has the special feature that it pays an 
annual interest rate of r on the current balance. We would like to know the balance in the 
account at any time t. 
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Although financial transactions occur at regularly spaced intervals, an excellent ap- 
proximation can be obtained by treating the amount in the account z(t) as a continuous 
function of time governed by the equation 


a(t + A) e z(t) + rx (t) At — PAt, (1.2.24) 


where we have assumed that both the payment and interest are paid in time increments 
of At. As the time between payments tends to zero, we obtain the first-order ordinary 


differential equation 
dx 


dt 


If we denote the initial deposit into this account by x(0), then at any subsequent time 


=ra—P. (1.2.25) 


a(t) = 2(O)e™ — P (e"* — 1) /r. (1.2.26) 


Although we could compute x(t) as a function of P, r, and z(0), there are only three 
separate cases that merit our close attention. If P/r > x(0), then the account will eventually 
equal zero at rt = In{P/ [P — rz(0)]]$. On the other hand, if P/r < z(0), the amount 
of money in the account will grow without bound. Finally, the case z(0) = P/r is the 
equilibrium case where the amount of money paid out balances the growth of money due 
to interest so that the account always has the balance of P/r. 


e Example 1.2.8: Steady-state flow of heat 


When the inner and outer walls of a body, for example the inner and outer walls 
of a house, are maintained at different constant temperatures, heat will flow from the 
warmer wall to the colder one. When each surface parallel to a wall has attained a constant 
temperature, the flow of heat has reached a steady state. In a steady-state flow of heat, 
each surface parallel to a wall, because its temperature is now constant, is referred to as an 
isothermal surface. Isothermal surfaces at different distances from an interior wall will have 
different temperatures. In many cases the temperature of an isothermal surface is only a 
function of its distance x from the interior wall, and the rate of flow of heat Q in a unit 
time across such a surface is proportional both to the area A of the surface and to dT/dz, 
where T' is the temperature of the isothermal surface. Hence, 


Q=-KA—, (1.2.27) 


where & is called the thermal conductivity of the material between the walls. 
In place of a flat wall, let us consider a hollow cylinder whose inner and outer surfaces 
are located at r = rı and r = r2, respectively. At steady state, Equation 1.2.27 becomes 


dT 


— = AEQ (1.2.28) 
dr 


Qr = KA " 
assuming no heat generation within the cylindrical wall. 

We can find the temperature distribution inside the cylinder by solving Equation 1.2.28 
along with the appropriate conditions on T(r) at r = rı and r = r2 (the boundary con- 
ditions). To illustrate the wide choice of possible boundary conditions, let us require that 
inner surface is maintained at the temperature Tı. We assume that along the outer surface 
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heat is lost by convection to the environment, which has the temperature T,,. This heat 
loss is usually modeled by the equation 


dT 


ia ==A(T —T,,), (1.2.29) 
dr |... 


T2 


where h > 0 is the convective heat transfer coefficient. Upon integrating Equation 1.2.28, 


Qr. 
2n&L 


T(r) = In(r) + C, (1.2.30) 


where Qr is also an unknown. Substituting Equation 1.2.30 into the boundary conditions, 


we obtain 
Qr 


T(r)=T 
o 1+ 27K. 


L In(ri/r), (1.2.31) 


with 
2n&L(Ti — To.) 


K/T3 + hln(ra/ri)' 


Q= (1.2.32) 
As rg increases, the first term in the denominator of Equation 1.2.32 decreases while the 
second term increases. Therefore, Q, has its largest magnitude when the denominator is 
smallest, assuming a fixed numerator. This occurs at the critical radius rer = x/h, where 


2n&L(Ti — To) 


Ons E 1+ In(rep-/ri) ` 


(1.2.33) 


e Example 1.2.9: Population dynamics 


Consider a population P(t) that can change only by a birth or death but not by immi- 
gration or emigration. If B(t) and D(t) denote the number of births or deaths, respectively, 
as a function of time t, the birth rate and death rate (in births or deaths per unit time) is 


B(t+At)-B(t) 1dB 


= Ee P(OA ~ P dt’ QE) 
Bs D(t--At) - D(t) 1dD 
tAD- 
d EP cs P(OA Pd (1:235) 
Now, 
P'(t) = lim PUE xi PU) 1.2.36) 
— qs B+ At) - BY] - [D(t + At) - DO) eee) 
At—0 At 
— B'(t) — D'(t). 1.2.38) 
Therefore, 
P'(t) = [b(t) — d(t)] P(t). 1.2.39) 
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When the birth and death rates are constants, namely b and d, respectively, the pop- 
ulation evolves according to 


P(t) = P(0) exp [(b — d) t] . (1.2.40) 


e Example 1.2.10: Logistic equation 


The study of population dynamics yields an important class of first-order, nonlinear, 
ordinary differential equations: the logistic equation. This equation arose in Pierre Francois 
Verhulst’s (1804-1849) study of animal populations.? If x(t) denotes the number of species 
in the population and k is the (constant) environment capacity (the number of species that 
can simultaneously live in the geographical region), then the logistic or Verhulst’s equation 
is 

x’ — ax(k — x)/k, (1.2.41) 


where a is the population growth rate for a small number of species. 
To solve Equation 1.2.41, we rewrite it as 


B. eiu cM. pid (1.2.42) 
(l-a/k)e x 1-a/k 
Integration yields 
In |z| — In |1 — z/k| = rt + In(C), (1.2.43) 
Or 
—~_=¢e (1.2.44) 
1— ak S 
If z(0) = zo, 
k 
a(t) = 2 (1.2.45) 


zo + (k — zo)e-"t* 


As t > oo, x(t) > k, the asymptotically stable solution. 
e Example 1.2.11: Chemical reactions 


Chemical reactions are often governed by first-order ordinary differential equations. 


For example, first-order reactions, which describe reactions of the form A E3 B, yield the 


differential equation 


1d[A] _ 
Ec k[A], (1.2.46) 


where k is the rate at which the reaction is taking place. Because for every molecule of A 
that disappears one molecule of B is produced, a — 1 and Equation 1.2.46 becomes 


——— = k[A]. (1.2.47) 


3 Verhulst, P. F., 1838: Notice sur la loi que la population suit dans son accroissement. Correspond. 
Math. Phys., 10, 113-121. 
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Integration of Equation 1.2.47 leads to 


-E zal dt. (1.2.48) 


If we denote the initial value of [A] by [A]o, then integration yields 
— In [A] = kt — In [A];, (1.2.49) 


Or 

[A] = [A] e. (1.2.50) 
The exponential form of the solution suggests that there is a time constant T, which is called 
the decay time of the reaction. This quantity gives the time required for the concentration 
of decrease by 1/e of its initial value [A]y. It is given by 7 = 1/k. 

Turning to second-order reactions, there are two cases. The first is a reaction between 

two identical species: A + A = products. The rate expression here is 

1 d|A] 2 

—-—— —EK|AJ.. 1.2.51 

; = kA] (1.2.51) 

The second case is an overall second-order reaction between two unlike species, given by A 


+B 5 X. In this case, the reaction is first order in each of the reactants A and B and the 
rate expression is 


A 
EB — k[A][B]. (1.2.52) 
dt 
Turning to Equation 1.2.51 first, we have by separation of variables 
[A] t 
-f dA) - | dr, (1.2.53) 
[Alo [A] 0 
Or 
1 1 
A nk. (1.2.54) 
[A] [A] 


Therefore, a plot of the inverse of A versus time will yield a straight line with slope equal 
to 2k and intercept 1/[A]p. 
With regard to Equation 1.2.52, because an increase in X must be at the expense 
of A and B, it is useful to express the rate equation in terms of the concentration of X, 
[X] = [A], — [A] = [B]; — [B], where [A], and [B], are the initial concentrations. Then, this 
equation becomes 
d[X] 


24 = k (Alo — [XD (Blo - IX). (1.2.55) 
Separation of variables leads to 
[X] d£ 7 t 
he (rer eine * nee) 


To integrate the left side, we rewrite the integral 


dé = dé dé 
| ws = | [= BLS | TRISTE TUR (1.2.57) 
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Carrying out the integration, 


Lf (BIA L 
[A], — [Blo (ance = kt. (1.2.58) 


Again the reaction rate constant k can be found by plotting the data in the form of the left 
side of Equation 1.2.58 against t. 


Problems 


For Problems 1-10, solve the following ordinary differential equations by separation of 
variables. Then use MATLAB to plot your solution. Try and find the symbolic solution 
using MATLAB’s dsolve. 


i d 
1. E c ae 2. (12-3?) dz — (12-2?) dy =0 3. mo) = xy 
2 2 
4, 9.09 —1 gru eque o, WE 
x dx TOP dx ytay dr (xy) 
dy ty dy 3 2 
i uid Mi d F1 
ur 8. = (a? +5)(y? +1) 


where a and b are constants. 
10. Setting u = y — x, solve the first-order ordinary differential equation 


dy y-t 
dr x? 


+1. 


11. Using the hydrostatic equation, show that the pressure within an atmosphere where 
the temperature decreases uniformly with height, T(z) = To — Tz, varies as 


where po is the pressure at z = 0. 


12. Using the hydrostatic equation, show that the pressure within an atmosphere with the 
temperature distribution 


|. J To- F2, 0<z 
Te)={ org H< 
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is 


g/ (RT) 
(= T =) 0<2<H, 
0 
p) — P on pye g(z — H) 
2 exp|— ; H < z, 
To R(To — TH) 


where po is the pressure at z = 0. 


13. The voltage V as a function of time ¢ within an electrical circuit* consisting of a 
capacitor with capacitance C and a diode in series is governed by the first-order ordinary 
differential equation 


where R and S are positive constants. If the circuit initially has a voltage Vo at t = 0, find 
the voltage at subsequent times. 


14. A glow plug is an electrical element inside a reaction chamber, which either ignites the 
nearby fuel or warms the air in the chamber so that the ignition will occur more quickly. 
An accurate prediction of the wire’s temperature is important in the design of the chamber. 

Assuming that heat convection and conduction are not important,” the temperature T 
of the wire is governed by 


where A equals the specific heat of the wire times its mass, B equals the product of the 
emissivity of the surrounding fluid times the wire’s surface area times the Stefan-Boltzmann 
constant, T, is the temperature of the surrounding fluid, and P is the power input. The 
temperature increases due to electrical resistance and is reduced by radiation to the sur- 
rounding fluid. 

Show that the temperature is given by 


D afu () (2) rs 


where 44^ = P/B + T? and To is the initial temperature of the wire. 


15. Let us denote the number of tumor cells by N(t). Then a widely used deterministic 
tumor growth law? is 

dN 

— — bN In(K/N 

N = ON TI(KIN), 
where K is the largest tumor size and 1/b is the length of time required for the specific 
growth to decrease by 1/e. If the initial value of N(t) is N(0), find N(t) at any subsequent 
time t. 


^ See Aiken, C. B., 1938: Theory of the diode voltmeter. Proc. IRE, 26, 859-876. 
5 See Clark, S. K., 1956: Heat-up time of wire glow plugs. Jet Propulsion, 26, 278-279. 


6 See Hanson, F. B., and C. Tier, 1982: A stochastic model of tumor growth. Math. Biosci., 61, 73-100. 
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16. The drop in laser intensity in the direction of propagation x due to one- and two-photon 
absorption in photosensitive glass is governed’ by 


— =al - Br, 


where I is the laser intensity, a and 6 are the single-photon and two-photon coefficients, 
respectively. Show that the laser intensity distribution is 

al(0)e-?* 
a+ 8I(0) (1 — e727)’ 


(a) = 
where 7(0) is the laser intensity at the entry point of the media, x = 0. 


17. The third-order reaction A + B + C 5 X is governed by the kinetics equation 


as = k ([A]g — [X]) ([Blo — [X]) ([C]p — [X]) . 


where [A]o, [B]o, and [C]o denote the initial concentration of A, B, and C, respectively. Find 
how [X] varies with time t. 


ki 
18. The reversible reaction A = B is described by the kinetics equation? 
"2 


d[X] 

-z 7 (Alo — [XI — k2 ([B]o + [X], 
where [X] denotes the increase in the concentration of B while [A]o and [B]o are the initial 
concentrations of A and B, respectively. Find [X] as a function of time t. Hint: Show that 
this differential equation can be written 


kı [A]o m k2[B]o 
kı + k2 


= (kı — k2) (a + [X]), d 


1.3 HOMOGENEOUS EQUATIONS 
A homogeneous ordinary differential equation is a differential equation of the form 
M(a,y) dz + N(z,y) dy = 0, (1.3.1) 
where both M (xz, y) and N(z,y) are homogeneous functions of the same degree n. That 
means: M(tz,ty) = t” M(x,y) and N(tz,ty) = t*N(x,y). For example, the ordinary 


differential equation 
(x? + y?) dz + (x? — ay) dy =0 (1.3.2) 


7 See Weitzman, P. S., and U. Osterberg, 1996: Two-photon absorption and photoconductivity in 
photosensitive glasses. J. Appl. Phys., 79, 8648-8655. 


8 See Kiister, F. W., 1895: Ueber den Verlauf einer umkehrbaren Reaktion erster Ordnung in homogenem 
System. Z. Physik. Chem., 18, 171-179. 
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is a homogeneous equation because both coefficients are homogeneous functions of degree 
2: 

M (tz,ty) =a? + Py? = (2? - y?) 2 OM(z, y), (1.3.3) 
and 


N(ta, ty) = Px? — Pay = P (a? — zy) = N(a, y). (1.3.4) 


Why is it useful to recognize homogeneous ordinary differential equations? Let us set 
y = uz so that Equation 1.3.2 becomes 


(a? + u?z?) dz + (x? — uz?)(udxz + z du) = 0. (1.3.5) 
Then, 
z?(1-- u) dz + z?(1— u) du = 0, (1.3.6) 
1- d. 
“ du+ Z =0, (1.3.7) 
lcu x 
or ; P 
x 
14 du = 0. 1.3.8 
( qm -) u+- 0 (1.3.8) 
Integrating Equation 1.3.8, 
—u + 21n|1 + u| + In|z| = ln|c], (1.3.9) 
4 2In|1 2 + Infa| = Inlel, (1.3.10) 
x x 
2 
in| EH Eu (1.3.11) 
Cx x 
or 
(x+y)? = cze”, (1.3.12) 
Problems 


First show that the following differential equations are homogeneous and then find their 
solution. Then use MATLAB to plot your solution. Try and find the symbolic solution using 
MATLAB’s dsolve. 


dy 2 dy u dy x 2 2 
1. (tyg TY 2. (z +y) TITY 3. ry c (t Ty) 
d 
4. x(a e y) —-y(r-y) 5. £y =y+2y/Ty 6. zy =y- yr? +y? 
7. y' = sec(y/x) + y/x 8. y! = eV? +y/x. 


1.4 EXACT EQUATIONS 


Consider the multivariable function z = f(x,y). Then the total derivative is 


dz = OF aus. quss E dark Now) ay, (1.4.1) 
Ox Oy 
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If the solution to a first-order ordinary differential equation can be written as f(x,y) = c, 
then the corresponding differential equation is 


M (x, y) dz + N(z,y) dy = 0. (1.4.2) 


How do we know if we have an exact equation, Equation 1.4.2? From the definition of 
M(x,y) and N(z, y), 
OM _ 0? f E 0? f _ oN 
Oy OyOx OxdOy Ox’ 
if M(x,y) and N(z, y) and their first-order partial derivatives are continuous. Consequently, 
if we can show that our ordinary differential equation is exact, we can integrate 


(1.4.3) 


A = Mz, y) and " = N(z,y) (1.4.4) 
to find the solution f(x,y) = c. 
e Example 1.4.1 
Let us check and see if 
[y? cos(x) — 3z?y — 2a] dx + [2y sin(x) — z? + In(y)] dy = 0 (1.4.5) 


is exact. 
Since M (z, y) = y? cos(x) — 3z?y — 2x, and N(z, y) = 2ysin(x) — x3 + In(y), we find 


that 
OM 


je | mt 3 — 2 
"m 2y cos(x) — 3x^, (1.4.6) 
and A 
ON — 2 
= 2y cos(x) — 32°. (1.4.7) 


Because N, = M,, Equation 1.4.5 is an exact equation. 
e Example 1.4.2 


Because Equation 1.4.5 is an exact equation, let us find its solution. Starting with 


o 
A = M(z,y) = y? cos(x) — 3x?°y — 2x, (1.4.8) 
z 
direct integration gives 
f(x,y) = y’ sin(x) — z?y — a^ + g(y). (1.4.9) 


Substituting Equation 1.4.9 into the equation fy = N, we obtain 


Fe 2ysin(z) ~ 2° + gy) = 2ysin(z) - 2° + nly). (1.4.10) 


Thus, g'(y) = In(y), or gly) = yln(y)— y + C. Therefore, the solution to the ordinary 
differential equation, Equation 1.4.5, is 


y? sin(x) — 2?y — 2? + yln(y) - y = c. (1.4.11) 
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e Example 1.4.3 
Consider the differential equation 
(x + y) dz + zln(x)dy = 0 (1.4.12) 
on the interval (0,00). A quick check shows that Equation 1.4.12 is not exact since 
OM — 1 ON 


mcm and az 1+ In(z). (1.4.13) 


However, if we multiply Equation 1.4.12 by 1/a so that it becomes 
(1 + 3 dx + ln(x) dy = 0, (1.4.14) 
x 


then this modified differential equation is exact because 


OM 1 ON 1 
po M a E 1.4.15 
Oy xr’ and ðr zx ( ) 
Therefore, the solution to Equation 1.4.12 is 
z 4 yln(x) = C. (1.4.16) 


'This mysterious function that converts an inexact differential equation into an exact one 
is called an integrating factor. Unfortunately there is no general rule for finding one unless 
the equation is linear. 


Problems 


Show that the following equations are exact. Then solve them, using MATLAB to plot them. 
Finally, try and find the symbolic solution using MATLAB’s dsolve. 


1. Qryy! = x? — y? 2. (¢+y)y +y=2 
3. (y? — 1) dz + [2ry — sin(y)] dy = 0 4. [sin(y) — 2zy + z?] dx 

+[x cos(y) — z?] dy = 0 
5. —ydz/2z? + (1/z + 1/y) dy 5 0 6. (32? — 6ry) dx — (3x? + 2y) dy = 0 
7. y sin(zy) dz + xsin(xy) dy = 0 8. (2zy? + 32?) dx + 2a4?y dy = 0 
9. (2zy? + 5x y) dx 10. (z? + y/z) dx + [y? + In(x)] dy = 0 


+(3x7y? + a? + 1) dy =0 


11. [x + e^? + zln(y)] dy 12. cos(4y?) dz — 8ry sin(4y?) dy = 0 
+[y ln(y) + e*] dr = 0 


13. sin? (x + y) dx — cos? (x + y) dy = 0 
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14. Show that the integrating factor for (x — y)y' + oy(1 — y) = 0 is u(y) = y^/(1— y)^*?, 
a+1=1/a. Then show that the solution is 


yet! y £o E 
oq yy / Gp cc 


1.5 LINEAR EQUATIONS 


In the case of first-order ordinary differential equations, any differential equation of the 


form F 
ai) 5 + ao(x)y = f(x) (1.5.1) 
is said to be linear. 


Consider now the linear ordinary differential equation 


d 
a — dy = rêe? (1.5.2) 
dx 
or "m 
y 5,cr 
CT ym . 1.5. 
ee e (1.5.3) 


Let us now multiply Equation 1.5.3 by x74. (How we knew that it should be x~* and not 
something else will be addressed shortly.) This magical factor is called an integrating factor 
because Equation 1.5.3 can be rewritten 


ldy 4 , 
uu — "x — Te , (1.5.4) 
Or d 
VN c 
au) — (1.5.5) 


Thus, our introduction of the integrating factor z-^ allows us to use the differentiation 
product rule in reverse and collapse the right side of Equation 1.5.4 into a single x derivative 
of a function of x times y. If we had selected the incorrect integrating factor, the right side 
would not have collapsed into this useful form. 

With Equation 1.5.5, we may integrate both sides and find that 


T x n dz +C, (1.5.6) 
or 
D. = (r-1)e* 4 C, (1.5.7) 
z 
or 
y = f(x — De® + Crt. (1.5.8) 


From this example, it is clear that finding the integrating factor is crucial to solving 
first-order, linear, ordinary differential equations. To do this, let us first rewrite Equation 
1.5.1 by dividing through by aı(x) so that it becomes 


^! 4 P(x)y = Qo), (15.9) 
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or 
dy + [P(x)y — Q(x)] dz = 0. (1.5.10) 


If we denote the integrating factor by u(x), then 
u(z)dy + u(z)[P(z)y — Q(x)] dx = 0. (1.5.11) 


Clearly, we can solve Equation 1.5.11 by direct integration if it is an exact equation. If this 
is true, then 


oe = a; HEP- QUI. (1.5.12) 
a = u(z)P(x), and t = P(x) da. (1.5.13) 

Integrating Equation 1.5.13, 
ulz) = epl f P(E) as : (1.5.14) 


Note that we do not need a constant of integration in Equation 1.5.14 because Equation 
1.5.11 is unaffected by a constant multiple. It is also interesting that the integrating factor 
only depends on P(x) and not Q(z). 

We can summarize our findings in the following theorem. 


Theorem: Linear First-Order Equation 


If the functions P(x) and Q(x) are continuous on the open interval I containing the 
point zo, then the initial-value problem 


has a unique solution y(x) on I, given by 


GC 1 * 
T EP + J Q(O)ulé) dé 


with an appropriate value of C, and u(x) is defined by Equation 1.5.14. 


The procedure for implementing this theorem is as follows: 


e Step 1: If necessary, divide the differential equation by the coefficient of dy/dx. This 
gives an equation of the form Equation 1.5.9 and we can find P(x) by inspection. 


e Step 2: Find the integrating factor by Equation 1.5.14. 
e Step 3: Multiply the equation created in Step 1 by the integrating factor. 
e Step 4: Run the derivative product rule in reverse, collapsing the left side of the 


differential equation into the form d[u(x)y]/dx. If you are unable to do this, you have 
made a mistake. 
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e Step 5: Integrate both sides of the differential equation to find the solution. 
'The following examples illustrate the technique. 
e Example 1.5.1 
Let us solve the linear, first-order ordinary differential equation 
xy’ — y = 4r ln(x). (1.5.15) 


We begin by dividing through by x to convert Equation 1.5.15 into its canonical form. 
This yields 


1 
y — —y = Aln(z). (1.5.16) 
x 
From Equation 1.5.16, we see that P(x) = 1/x. Consequently, from Equation 1.5.14, we 
have that 
u(x) = exp P(£)d£| = exp| — n =-. (1.5.17) 
x 
Multiplying Equation 1.5.16 by the integrating factor, we find that 


y y _ 4ln(x) 


1.5.18 
r r zo! (eee) 
d AI 
(2) ae: (1.5.19) 
dx Nx x 
Integrating both sides of Equation 1.5.19, 
I 
a af 20 eld e t (1.5.20) 
x x 
Multiplying Equation 1.5.20 through by x yields the general solution 
y = 2x In? (x) + Cz. (1.5.21) 


Although it is nice to have a closed-form solution, considerable insight can be gained 
by graphing the solution for a wide variety of initial conditions. To illustrate this, consider 
the MATLAB script 


clear 
^ use symbolic toolbox to solve Equation 1.5.15 
y = dsolve(’x*Dy-y=4*x*log(x)’,’y(1) = c?,?x?); 
^4 take the symbolic version of the solution 
^ and convert it into executable code 
solution = inline(vectorize(y),’x’,’c’); 
close all; axes; hold on 
^ now plot the solution for a wide variety of initial conditions 
x = 0.1:0.1:2; 
for c = -2:4 

if (c==-2) plot(x,solution(x,c),’.’); end 

if (c==-1) plot(x,solution(x,c),’o’); end 

if (c== 0) plot(x,solution(x,c),’x’); end 
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Figure 1.5.1: The solution to Equation 1.5.15 when the initial condition is y(1) — c. 


if (c== 1) plot(x,solution(x,c),’+’); end 

if (c== 2) plot(x,solution(x,c),’*’); end 

if (c== 3) plot(x,solution(x,c),’s’); end 

if (c== 4) plot(x,solution(x,c),’d’); end 
end 
axis tight 
xlabel(’x’,’Fontsize’,20); ylabel(’y’,’Fontsize’ ,20) 
legend(’c = -2’?,’c = -1’,’c = 0’,’c = 1’,... 

|o = 2?,’c = 3’,’c = 4’); legend boxoff 

This script does two things. First, it uses MATLAB’s symbolic toolbox to solve Equa- 
tion 1.5.15. Alternatively, we could have used Equation 1.5.21 and introduced it as a 
function. The second portion of this script plots this solution for y(1) = C where C = 
—2,—1,0,1,2,3,4. Figure 1.5.1 shows the results. As r — 0, we note how all of the 
solutions behave like 2x In? (a). 


e Example 1.5.2 


Let us solve the first-order ordinary differential equation 


d 
PU a cH (1.5.22) 
dv y-2 
subject to the initial condition y(2) — 6. 
Beginning as before, we rewrite Equation 1.5.22 in the canonical form 
(y — z)y — y — 0. (1.5.23) 


Examining Equation 1.5.23 more closely, we see that it is a nonlinear equation in y. On the 
other hand, if we treat x as the dependent variable and y as the independent variable, we 
can write Equation 1.5.23 as the linear equation 


— 1. (1.5.24) 


24 Advanced Engineering Mathematics with MATLAB 


RO 


.- GS E — 


Figure 1.5.2: Schematic diagram for an electric circuit that contains a resistor of resistance R and an 
inductor of inductance L. 


Proceeding as before, we have that P(y) = 1/y and u(y) = y so that Equation 1.5.24 
can be rewritten 


d 
g 99-9 (1.5.25) 
y 
Or 
yx = iy? C. (1.5.26) 
Introducing the initial condition, we find that C — —6. Solving for y, we obtain 


y-—rcwvza?-12. (1.5.27) 


We must take the positive sign in order that y(2) = 6 and 


y—rdwya?-12. (1.5.28) 


e Example 1.5.3: Electric circuits 


A rich source of first-order differential equations is the analysis of simple electrical 
circuits. These electrical circuits are constructed from three fundamental components: the 
resistor, the inductor, and the capacitor. Each of these devices gives the following voltage 
drop: In the case of a resistor, the voltage drop equals the product of the resistance R 
times the current J. For the inductor, the voltage drop is L dI/dt, where L is called the 
inductance, while the voltage drop for a capacitor equals Q/C, where Q is the instantaneous 
charge and C is called the capacitance. 

How are these voltage drops applied to mathematically describe an electrical circuit? 
This question leads to one of the fundamental laws in physics, Kirchhoff's law: The alge- 
braic sum of all the voltage drops around an electric loop or circuit is zero. 

To illustrate Kirchhoff's law, consider the electrical circuit shown in Figure 1.5.2. By 
Kirchhoff's law, the electromotive force E, provided by a battery, for example, equals the 
sum of the voltage drops across the resistor RI and L dI/dt. Thus the (differential) equation 


that governs this circuit is 


dl 
Lot RI-E. (1.5.29) 


Assuming that E, J, and R are constant, we can rewrite Equation 1.5.29 as 


d [ mL _ = nu 
zle ro] = de (1.5.30) 
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Figure 1.5.3: The temporal evolution of current I(t) inside an electrical circuit shown in Figure 1.5.2 with 
a constant electromotive force E. 


Integrating both sides of Equation 1.5.30, 


E 
e™*/L T(t) = mi + Ci, (1.5.31) 


or 
E 
I() 5+ pug UP. (1.5.32) 


To determine C1, we apply the initial condition. Because the circuit is initially dead, 
I(0) = 0, and 


I(t) = 5 (1 E a) (1.5.33) 


Figure 1.5.3 illustrates Equation 1.5.33 as a function of time. Initially the current increases 
rapidly but the growth slows with time. Note that we could also have solved this problem 
by separation of variables. 

Quite often, the solution is separated into two parts: the steady-state solution and the 
transient solution. The steady-state solution is that portion of the solution which remains 
as t — oo. It can equal zero. Presently it equals the constant value, E/R. The transient 
solution is that portion of the solution which vanishes as time increases. Here it equals 
—Ee- PUT. / R, 

Although our analysis is a useful approximation to the real world, a more realistic one 
would include the nonlinear properties of the resistor.” To illustrate this, consider the case 
of an RL circuit without any electromotive source (E = 0) where the initial value for the 
current is Jo. Equation 1.5.29 now reads 


dI 


L +RI(1-aI)=0,  I(0-1. (1.5.34) 


Separating the variables, 


dl di dl R 
— = P 1. . 
maD Faia T z^ yan 


9 For the analysis of 
dI B 
Li + RI + KI” =0, 


see Fairweather, A., and J. Ingham, 1941: Subsidence transients in circuits containing a non-linear resistor, 
with reference to the problem of spark-quenching. J. IEE, Part 1, 88, 330-339. 


26 Advanced Engineering Mathematics with MATLAB 


f 
0.0 0.5 1.0 1.5 2.0 2.5 3.0 
RUL 


Figure 1.5.4: The variation of current I/Io as a function of time Rt/L with different values of alo. 


Upon integrating and applying the initial condition, we have that 
[gen RU 


H 1- alo + alge- RUL' 


(1.5.36) 


Figure 1.5.4 shows I(t) for various values of a. As the nonlinearity reduces resistance, 
the decay in the current is reduced. If alọ > 1, Equation 1.5.36 predicts that the current 
would grow with time. The point here is that nonlinearity can have a dramatic influence 
on a physical system. 

Consider now the electrical circuit shown in Figure 1.5.5, which contains a resistor with 
resistance R and a capacitor with capacitance C. Here the voltage drop across the resistor 
is still RI while the voltage drop across the capacitor is Q/C. Therefore, by Kirchhoff's 
law 

Q 


RI t = =E. 1.5.37 
+5 (1.5.37) 


, 


Equation 1.5.37 is not a differential equation. However, because current is the time rate of 
change in charge I = dQ/dt, our differential equation becomes 

dQ Q 

R= +% =E, 1.5.38 

dt C ( ) 

which is the differential equation for the instantaneous charge. 
Let us solve Equation 1.5.38 when the resistance and capacitance are constant but the 
electromotive force equals Eo cos(wt). The corresponding differential equation is now 
dQ Q 


RO + G^ Eo cos(wt). (1.5.39) 


The differential equation has the integrating factor e'/(*C) so that it can be rewritten 


d | t/RC) — Eo 4j(RC) 
E le Q()| = e CO cos(ut). (1.5.40) 
Integrating Equation 1.5.40, 
gH POG) CEo __t/(RC) [cos(wt) + RCw sin(wt)| + C; (1.5.41) 


> 1+ R2C2W2 
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Figure 1.5.5: Schematic diagram for an electric circuit that contains a resistor of resistance R and a 
capacitor of capacitance C. 


Or 
C Eo 


If we take the initial condition as Q(0) — 0, then the final solution is 


C'Eo -t/(RC) 
QW) = PA [cos (wt) — + RCwsin(wt)) . (1.5.43) 
Figure 1.5.6 illustrates Equation 1.5.43. Note how the circuit eventually supports a purely 
oscillatory solution (the steady-state solution) as the exponential term decays to zero (the 
transient solution). Indeed, the purpose of the transient solution is to allow the system to 
adjust from its initial condition to the final steady state. 


e Example 1.5.4: Terminal velocity 


When an object passes through a fluid, the viscosity of the fluid resists the motion by 
exerting a force on the object proportional to its velocity. Let us find the motion of a mass 
m that is initially thrown upward with the speed vo. 

If we choose the coordinate system so that it increases in the vertical direction, then 
the equation of motion is 


m— = —kv — mg (1.5.44) 


with v(0) — vo and k » 0. Rewriting Equation 1.5.44, we obtain the first-order linear 
differential equation 


— +v = g. (1.5.45) 
m 


Its solution in nondimensional form is 


w ja (: + c gm. (1.5.46) 


mg 


'The displacement from its initial position is 


2 2 


m?g m?g m 
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Figure 1.5.6: The temporal evolution of the nondimensional charge (1 + R?C?w?)Q(t) /(CEo) in the 
electric circuit shown in Figure 1.5.4 as a function of nondimensional time wt when the circuit is driven by 
the electromotive force Eo cos(wt) and RCw = 2. 


As t — oc, the velocity tends to a constant downward value, —mg/k, the so-called “terminal 
velocity," where the aerodynamic drag balances the gravitational acceleration. This is the 
steady-state solution. 

Why have we written Equation 1.5.46 and Equation 1.5.47 in this nondimensional 
form? There are two reasons. First, the solution reduces to three fundamental variables, 
a nondimensional displacement x. = K?z(t)/(m?g), velocity v, = kv(t)/(mg), and time 
t, = kt/m, rather than the six original parameters and variables: g, k, m, t, v, and z. 
Indeed, if we had substituted tą, vx, and x, into Equation 1.5.45, we would have obtained 
the following simplified initial-value problem: 

u kvo 


dvs da. k2 
” Sty u= s dud) P 
dt, mg m?g 


(1.5.48) 


right from the start. The second advantage of the nondimensional form is the compact 
manner in which the results can be displayed, as Figure 1.5.7 shows. 

From Equation 1.5.46 and Equation 1.5.47, the trajectory of the ball is as follows: If 
we define the coordinate system so that zo = 0, then the object will initially rise to the 
height H given by 


KH k k 
ie (1 + 2) (1.5.49) 
m?g mg mg 
at the time " : 
tman 
Semat. e (: + mx) , (1.5.50) 
m mg 


when v(tmax) = 0. It will then fall toward the earth. Given sufficient time kt/m > 1, it 
would achieve terminal velocity. 


e Example 1.5.5: The Bernoulli equation 


Bernoulli's equation, 


» + p(z)y=a(a)y”, n#0,1, (1.5.51) 
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Figure 1.5.7: The nondimensional displacement k?z(t)/(m?g) as a function of nondimensional time kt/m 
of an object of mass m thrown upward at the initial nondimensional speed v = kvo/(mg) in a fluid that 
retards its motion as —kv. 


is a first-order, nonlinear differential equation. This equation can be transformed into a 
first-order, linear differential equation by introducing the change of variable z = yl^". 
Because 


=(1 a 1.5.52 
os o (Lo my e, (1.5.52) 
the transformed Bernoulli equation becomes 
dz 
d T (1-mn)p(x)z = (1— n)q(x). (1.5.53) 


This is now a first-order linear differential equation for z and can be solved using the 
methods introduced in this section. Once z is known, the solution is found by transforming 
back from z to y. 

To illustrate this procedure, consider the nonlinear ordinary differential equation 


d 
ayo — gy? — 1, (1.5.54) 
dx 
Or j i 
Y vy vy 
— — mL. 1.5.55 
dr c x ( ) 
Equation 1.5.55 is a Bernoulli equation with p(x) = —1/z, q(x) = 1/a?, and n = —1. 
Introducing z = y?, it becomes 
dz 2z 2 (1.5.56) 
dx xz r? n 


This first-order linear differential equation has the integrating factor u(x) = 1/z? and 


d/z 2 
— {| — ] = —. 1.5.57 
dx (=) xt ( ) 
Integration gives 
2 
ET EN (1.5.58) 
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Therefore, the general solution is 


y ege ec (1.5.59) 


Problems 


Find the solution for the following differential equations. State the interval on which the 
general solution is valid. Then use MATLAB to examine their behavior for a wide class of 
initial conditions. 


ly +y=e 2. y -2xy — x 

3. z?y' - zy —-1 4. (2y + xz?) dz = x dy 

5. y! — 3y/x = 22? 6. y' + 2y = 2sin(x) 

T. y' + 2cos(2x)y = 0 8. cy’ +y = ln(x) 

9. y +3y=4, y(0)=5 10. y —y=e"/x, y(e) 20 

11. sin(z)y' + cos(z)y = 1 12. [1 — cos(z)]y' + 2sin(z)y = tan(x) 


13. y’ + [atan(x) + bsec(x)]y = csec(x) 14. (zy - y — 1) dz + zdy —0 


in(2 2k 1 
2 4w x3 


15. y' + 2ay = ) 629. y(1) — 0. 


17. Solve the following initial-value problem: 


dy 2 


keys — y —*, — y(1)—0. 
x 


Hint: Introduce the new dependent variable p — y?. 


18. If x(t) denotes the equity capital of a company, then under certain assumptions? z(t) 


is governed by 
BE i uN emos 
— = (1— N)rx 
dt , 
where N is the dividend payout ratio, r is the rate of return of equity, and S is the rate of 


net new stock financing. If the initial value of x(t) is x(0), find z(t). 


19. The assimilation!! of a drug into a body can be modeled by the chemical reaction A 5 
B5 C, which is governed by the chemical kinetics equations 
d|A] 
dt 


=-  fluuwoam 9S agg, 


10 See Lebowitz, J. L., C. O. Lee, and P. B. Linhart, 1976: Some effects of inflation on a firm with 
original cost depreciation. Bell J. Economics, T, 463-477. 


11 See Calder, G. V., 1974: The time evolution of drugs in the body: An application of the principle of 
chemical kinetics. J. Chem. Educ., 51, 19-22. 
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where [A] denotes the concentration of the drug in the gastrointestinal tract or in the site 
of injection, [B] is the concentration of the drug in the body, and [C] is either the amount 
of drug eliminated by various metabolic functions or the amount of the drug utilized by 
various action sites in the body. If [A]g denotes the initial concentration of A, find [A], [B], 
and [C] as a function of time t. 


20. Find the current in an RL circuit when the electromotive source equals Ey cos?(wt). 
Initially the circuit is dead. 


Find the general solution for the following Bernoulli equations: 


2 dy 


dy y E 2 E 2 = 

21. 4 —-y 22. 3^ 7^ — my by 28. 5 — — = ay 
d d g 

a4, HV ae 25.2zy  —y!&z —0 26, a 4 y= lay? 
dr x da: da 


1.6 GRAPHICAL SOLUTIONS 


In spite of the many techniques developed for their solution, many ordinary differen- 
tial equations cannot be solved analytically. In the next two sections, we highlight two 
alternative methods when analytical methods fail. Graphical methods seek to understand 
the nature of the solution by examining the differential equations at various points and 
infer the complete solution from these results. In the last section, we highlight the numeri- 
cal techniques that are now commonly used to solve ordinary differential equations on the 
computer. 


e Direction fields 


One of the simplest numerical methods for solving first-order ordinary differential equa- 
tions follows from the fundamental concept that the derivative gives the slope of a straight 
line that is tangent to a curve at a given point. 

Consider the first-order differential equation 


y = f(x,y), (1.6.1) 


which has the initial value y(xo) = yo. For any (x,y) it is possible to draw a short line 
segment whose slope equals f(x,y). This graphical representation is known as the direction 
field or slope field of Equation 1.6.1. Starting with the initial point (£o, yo), we can then 
construct the solution curve by extending the initial line segment in such a manner that 
the tangent of the solution curve parallels the direction field at each point through which 
the curve passes. 

Before the days of computers, it was common to first draw lines of constant slope 
(isoclines) or f(x,y) — c. Because along any isocline all of the line segments had the same 
slope, considerable computational savings were realized. Today, computer software exists 
that performs these graphical computations with great speed. 

To illustrate this technique, consider the ordinary differential equation 


dx 2 
== mgy--i* 1.6.2 
p7 (1.6.2) 
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Figure 1.6.1: The direction field for Equation 1.6.2. The solid lines are plots of the solution with various 
initial conditions. 


Its exact solution is 
z(t) = Cet -- t? + 2t +2, (1.6.3) 


where C is an arbitrary constant. Using the MATLAB script 


clear 
^ create grid points in t and x 
[t,x] = meshgrid(-2:0.2:3,-1:0.2:2); 
% load in the slope 
slope = x - t.*t; 
^ find the length of the vector (1,slope) 
length = sqrt(1 + slope .* slope); 
^ create and plot the vector arrows 
quiver(t,x,1./length,slope./length,0.5) 
axis equal tight 
hold on 
^4 plot the exact solution for various initial conditions 
tt = [-2:0.2:3]; 
for cval - -10:1:10 
x.exact = cval * exp(tt) + tt.*tt + 2*tt + 2; 
plot(tt,x exact) 
xlabel(’t’,’Fontsize’ ,20) 
ylabel(’x’,’Fontsize’ ,20) 
end 
we show in Figure 1.6.1 the directional field associated with Equation 1.6.2 along with 
some of the particular solutions. Clearly the vectors are parallel to the various particular 
solutions. Therefore, without knowing the solution, we could choose an arbitrary initial 
condition and sketch its behavior at subsequent times. The same holds true for nonlinear 
equations. 


e Rest points and autonomous equations 
In the case of autonomous differential equations (equations where the independent 


variable does not explicitly appear in the equation), considerable information can be gleaned 
from a graphical analysis of the equation. 
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Figure 1.6.2: The phase line diagram for the ordinary differential equation, Equation 1.6.4. 


Consider the nonlinear ordinary differential equation 


da: 
= = 2_ 14), 1.6.4 
df = ssi) (1.6.4) 
The time derivative x’ vanishes at x = —1,0,1. Consequently, if z(0) = 0, x(t) will remain 
zero forever. Similarly, if «(0) = 1 or z(0) = —1, then x(t) will equal 1 or —1 for all 


time. For this reason, values of x for which the derivative x’ is zero are called rest points, 
equilibrium points, or critical points of the differential equation. 

The behavior of solutions near rest points is often of considerable interest. For example, 
what happens to the solution when z is near one of the rest points x = —1,0,1? 

Consider the point x = 0. For z slightly greater than zero, x’ < 0. For x slightly less 
than 0, 2’ > 0. Therefore, for any initial value of x near x = 0, x will tend to zero. In 
this case, the point x = 0 is an asymptotically stable critical point because whenever x is 
perturbed away from the critical point, it tends to return there again. 

Turning to the point x = 1, for x slightly greater than 1, x’ > 0; for x slightly less 
than 1, x’ < 0. Because any x near x = 1, but not equal to 1, will move away from x = 1, 
the point x = 1 is called an unstable critical point. A similar analysis applies at the point 
x = —1. This procedure of determining the behavior of an ordinary differential equation 
near its critical points is called a graphical stability analysis. 


e Phase line 


A graphical representation of the results of our graphical stability analysis is the phase 
line. On a phase line, the equilibrium points are denoted by circles. See Figure 1.6.2. Also 
on the phase line we identify the sign of x’ for all values of z. From the sign of x’, we then 
indicate whether x is increasing or deceasing by an appropriate arrow. If the arrow points 
toward the right, x is increasing; toward the left z decreases. Then, by knowing the sign 
of the derivative for all values of x, together with the starting value of x, we can determine 
what happens as t — oo. Any solution that is approached asymptotically as t — oo is called 
a steady-state output. In our present example, x = 0 is a steady-state output. 


Problems 
In previous sections, you used various techniques to solve first-order ordinary differential 
equations. Now check your work by using MATLAB to draw the direction field and plot 
your analytic solution for the following problems taken from previous sections: 


1. Section 1.2, Problem 5 2. Section 1.3, Problem 1 


3. Section 1.4, Problem 5 4. Section 1.5, Problem 3 
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For the following autonomous ordinary differential equations, draw the phase line. Then 
classify each equilibrium solution as either stable or unstable. 


5. z' = oz(1— z)(z— 1i) 6. z' = (a? — 1)(z? — 4) 


3 3 


T. 4! = —4r— r 8. a =4r-—2 


1.7 NUMERICAL METHODS 


By now you have seen most of the exact methods for finding solutions to first-order 
ordinary differential equations. The methods have also given you a view of the general 
behavior and properties of solutions to differential equations. However, it must be admitted 
that in many instances exact solutions cannot be found and we must resort to numerical 
solutions. 

In this section we present the two most commonly used methods for solving differen- 
tial equations: Euler and Runge-Kutta methods. There are many more methods and the 
interested student is referred to one of countless numerical methods books. A straightfor- 
ward extension of these techniques can be applied to systems of first-order and higher-order 
differential equations. 


e Euler and modified Euler methods 
Consider the following first-order differential equation and initial condition: 


Do fley), vm) =o. (1.7.1) 


Euler's method is based on a Taylor series expansion of the solution about zo or 
y(zo +h) = y(zo) + hy' (x0) + 3y (Eh, — zo « £ « zo +h, (1.7.2) 


where h is the step size. Euler's method consists of taking a sufficiently small h so that 
only the first two terms of this Taylor expansion are significant. 
Let us now replace y'(xo) by f(xo, yo). Using subscript notation, we have that 


Vici = yi hf (vi yi) + O(h’). (1.7.3) 


Equation 1.7.3 states that if we know the values of y; and f (xj, yi) at the position z;, then 
the solution at z;,4 can be obtained with an error!? O(h?). 

'The trouble with Euler's method is its lack of accuracy, often requiring an extremely 
small time step. How might we improve this method with little additional effort? 

One possible method would retain the first three terms of the Taylor expansion rather 
than the first two. This scheme, known as the modified Euler method, is 


Yirr = Yi + hy (xi) + gh? yy’ + O(h’). (1.7.4) 


This is clearly more accurate than Equation 1.7.3. 


1? The symbol O is a mathematical notation indicating relative magnitude of terms, namely that f(e) = 
O(e") provided lime.,0|f(c)/e"| < oo. For example, as € — 0, sin(e) = O(c), sin(c?) = O(e?), and 
cos(e) = O(1). 
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An obvious question is how do we evaluate y/, because we do not have any information 
on its value. Using the forward derivative approximation, we find that 


/ / 
n Vii Wi 


y; = h (1.7.5) 
Substituting Equation 1.7.5 into Equation 1.7.4 and simplifying 
Wn = yi + 5 (ui + yl) + 009) (1.7.6) 
Using the differential equation, 
Vici = Yi + 2 [f (xi yi) + f (visi, Yi+1)] + O(h’). (1.7.7) 


2 


Although f(x;,y;) at (xi, yi) are easily calculated, how do we compute f(2j41, yj41) at 
(zi41,Ui41)? For this we compute a first guess via the Euler method, Equation 1.7.3; 
Equation 1.7.7 then provides a refinement on the value of yi+1. 

In summary then, the simple Euler scheme is 


Vii = Yi + kı +O(h?), kı -hf(xsy) (1.7.8) 
while the modified Euler method is 


Yin = yi + (kı + ka) + O(h’), kı = hf (zi yi), ka = hf (ei +h, yi + kı). (1.7.9) 


e Example 1.7.1 
Let us illustrate Euler's method by numerically solving 
z'—mr-t, z(0) — 1. (1.7.10) 


A quick check shows that Equation 1.7.10 has the exact solution 2exact(t) = 2e* — t — 1. 
Using the MATLAB script 


clear 

for i= 1:3 

^4 set up time step increment and number of time steps 
h = 1/10^i; n = 10/h; 

^ set up initial conditions 
t-zeros(n*1,1); t(1) = 0; 


x euler-zeros(n*ti,1); x euler(1) = 1; 
xmodified-zeros(n*i,1); x modified(1) = 1; 
x exact-zeros(n*i,1); x exact(1) = 1; 


^ set up difference arrays for plotting purposes 
diffi = zeros(n,1); diff2 = zeros(n,1); tplot = zeros(n,1); 
% define right side of differential equation, Equation 1.7.10 
f = inline(’xxttt’,’tt’,’xx’); 
for k = i:n 
t(k+1) = t(k) + h; 
^4 compute exact solution 
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x_exact(k+1) = 2*exp(t(kt1)) - t(k+1) - 1; 
^4 compute solution via Euler’s method 
ki = h * f(t(k),x_euler(k)); 
x_euler(k+1) = x_euler(k) + k1; 
tplot(k) = t(k+1); 
diff1(k) x_euler(k+1) - x exact(k*1); 
diffi(k) = abs(diff1(k) / x exact(k*1)); 
^4 compute solution via modified Euler method 
ki = h * £(t(k) ,x_modified(k)); 
k2 = h * f(t(k*1),x modified(k)-*k1); 
x_modified(k+1) = x modified(k) + 0.5*(k1+k2) ; 
diff2(k) = xmodified(k*1) - x_exact(k+1); 
diff2(k) = abs(diff2(k) / x_exact(k+1)); 
end 
% plot relative errors 
semilogy(tplot,diffi,’-’ ,tplot,diff2,’:’) 
hold on 
xlabel(’TIME’ ,’Fontsize’ ,20) 
ylabel(’ |RELATIVE ERROR|’,’Fontsize’ ,20) 
legend(’Euler method’, ’modified Euler method’) 
legend boxoff; 
numi = 0.2*n; num2 = 0.8*n; 
text(3,diffi(numi1),[?h = ’,num2str(h)],’Fontsize’,15,... 
>HorizontalAlignment’,’right’,... 
?VerticalAlignment’, ’bottom’) 
text(9,diff2(num2),[?h = ’,num2str(h)],’Fontsize’,15,... 
>HorizontalAlignment’,’right’,... 
?VerticalAlignment’, bottom’) 


end 


Both the Euler and modified Euler methods have been used to numerically integrate Equa- 
tion 1.7.10 and the absolute value of the relative error is plotted in Figure 1.7.1 as a function 
of time for various time steps. In general, the error grows with time. The decrease of error 
with smaller time steps, as predicted in our analysis, is quite apparent. Furthermore, the 
superiority of the modified Euler method over the original Euler method is clearly seen. 


e Runge-Kutta method 


As we have just shown, the accuracy of numerical solutions of ordinary differential 
equations can be improved by adding more terms to the Taylor expansion. The Runge- 
Kutta method! builds upon this idea, just as the modified Euler method did. 

Let us assume that the numerical solution can be approximated by 


Yi+1 = Ui + akı + bk», (1.7.11) 


13 Runge, C., 1895: Ueber die numerische Auflósung von Differentialgleichungen. Math. Ann., 46, 167— 
178; Kutta, W., 1901: Beitrag zur Nàherungsweisen Integration totaler Differentialgleichungen. Zeit. Math. 
Phys., A6, 435-453. For a historical review, see Butcher, J. C., 1996: A history of Runge-Kutta methods. 
Appl. Numer. Math., 20, 247-260 and Butcher, J. C., and G. Wanner, 1996: Runge-Kutta methods: Some 
historical notes. Appl. Numer. Math., 22, 113-151. 
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Figure 1.7.1: The relative error [x(t) — zexact(t)]/*exacc(t) of the numerical solution of Equation 1.7.10 
using Euler's method (the solid line) and modified Euler's method (the dotted line) with different time steps 
h. 


where 


Here a, b, A1, and B, are four unknowns. Equation 1.7.11 was suggested by the modified 
Euler method that we just presented. In that case, the truncated Taylor series had an error 
of O(h?). We anticipate such an error in the present case. 

Because the Taylor series expansion of f(x +h,y+k) about (a, y) is 


f(x +h, y+ k) — f(x,y) 25 (hfe oh k fy) s i (h? fus zs 2hk fry T k? fu) 


+ $ (hè feaa + 3h?k frzy + 3hk? fryy + k? fyyy) +, (173) 

ko can be rewritten 
= h[f(zi yi) + (Ath fe + Bihf fy)| (1.7.15) 
=hf + Aih’ fa + Bil’ f fy, (1.7.16) 


where we have retained only terms up to O(h?) and neglected all higher-order terms. Finally, 
substituting Equation 1.7.16 into Equation 1.7.11 gives 


Visi = yi + (a+ b)hf + (Arbfe + Bibf fy)h?. (1.7.17) 


This equation corresponds to the second-order Taylor expansion: 


Yi+1 = yid hy; | Sh? yl! (1.7.18) 
Therefore, if we wish to solve the differential equation y' = f(x,y), then 


y" = fa + tay = fa + F fy- (1.7.19) 


Substituting Equation 1.7.19 into Equation 1.7.18, we have that 


yii = yi hf III fs f fy). (1.7.20) 
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Although Carl David Tolmé Runge (1856-1927) began his studies in Munich, his friendship with 
Max Planck led him to Berlin and pure mathematics with Kronecker and Weierstrass. It was his 
professorship at Hanover beginning in 1886 and subsequent work in spectroscopy that led him to his 
celebrated paper on the numerical integration of ordinary differential equations. Runge's final years 
were spent in Góttingen as a professor in applied mathematics. (Portrait taken with permission from 
Reid, C., 1976: Courant in Gottingen and New York: The Story of an Improbable Mathematician. 
Springer-Verlag, 314 pp. ©1976, by Springer-Verlag New York Inc.) 


A direct comparison of Equation 1.7.17 and Equation 1.7.20 yields 


a+b=1, Aib=3, and Bjb- i. (1.7.21) 

These three equations have four unknowns. If we choose a = i, we immediately calculate 
b= i and A; = Bı = 1. Hence the second-order Runge-Kutta scheme is 

Vid = Yi + (ki + kə), (1.7.22) 


where kı = hf (zi, yi) and ko = hf(xi + h, yi + kı). Thus, the second-order Runge-Kutta 
scheme is identical to the modified Euler method. 

Although the derivation of the second-order Runge-Kutta scheme yields the modified 
Euler scheme, it does provide a framework for computing higher-order and more accurate 
schemes. A particularly popular one is the fourth-order Runge-Kutta scheme 


Yi+ı = Yi + $ (kı + 2ka + 2ks + a), 1.7.23) 
where 
ky = hf (zi yi), 1.7.24) 
ko = hf (xi + ih, yi + ki), 1.7.25) 
ka = hf (ai + ih, yi + ka), 1.7.26) 
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and 


e Example 1.7.2 


Let us illustrate the fourth-order Runge-Kutta by redoing the previous example using 
the MATLAB script 


clear 
^4 test out different time steps 
for i= 1:4 


^4 set up time step increment and number of time steps 
if i==1 h = 0.50; end; if i==2 h = 0.10; end; 
if i--3 h = 0.05; end; if i--4 h = 0.01; end; 
n = 10/h; 

^4 set up initial conditions 
t=zeros(nti,1); t(1) = 0; 
xrk-zeros(nti,1); xrk(1) = 1; 
x exact-zeros(n*i,1); x exact(1) = 1; 

^4 set up difference arrays for plotting purposes 
diff = zeros(n,1); tplot = zeros(n,1); 

^4 define right side of differential equation 
f = inline xxttt? ,? tt? ,? xx?) ; 
for k = 1:n 

x_local = xrk(k); t local = t(k); 


ki = h * f(t_local,x_local) ; 

k2 = h * f(t local + h/2,x local + k1/2); 
k3 = h * f(t local + h/2,x local + k2/2); 
k4 = h * f(t local + h,x local + k3); 


t(k+1) = t local + h; 
x_rk(k+1) = x local + (k142*k242*k3*k4) / 6; 
x_exact(k+1) = 2*exp(t(kt1)) - t(k+1) - 1; 
tplot(k) = t(k); 
diff(k) = xrk(k*1) - x exact(k*1); 
diff(k) = abs(diff(k) / x exact(k*1)); 
end 
% plot relative errors 
semilogy(tplot,diff,’-’) 
hold on 
xlabel (’TIME’ ,’Fontsize’ ,20) 
ylabel(’|RELATIVE ERROR|’,’Fontsize’ ,20) 
numi = 2*i; num2 = 0.2*n; 
text (numi,diff(num2),[’h = ’,num2str(h)],’Fontsize’,15,... 
>HorizontalAlignment’,’right’,... 
?VerticalAlignment’, ’bottom’ ) 
end 


The error growth with time is shown in Figure 1.7.2. Although this script could be used 
for any first-order ordinary differential equation, the people at MATLAB have an alternative 
called ode45, which combines a fourth-order and a fifth-order method that are similar to 
our fourth-order Runge-Kutta method. Their scheme is more efficient because it varies the 
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Figure 1.7.2: Same as Figure 1.7.1 except that we have used the fourth-order Runge-Kutta method. 


step size, choosing a new time step at each step in an attempt to achieve a given desired 
accuracy. 


e Adams-Bashforth method 


All of the methods presented so far (Euler, modified Euler, Runge-Kutta) are single 
point methods; the solution at i+ 1 depends solely on a single point i. A popular alternative 
to these schemes are multistep methods that compute y; 41 by reusing previously obtained 
values of y, where n « i. 

We begin our derivation of a multistep method by rewriting Equation 1.7.1 as 


dy = f(x,y) da. (1.7.28) 


Integrating both sides of Equation 1.7.28, we obtain 


vem) -v= fay = f fene (1.7.29) 


i i 


The Adams-Bashforth method! replaces the integrand in Equation 1.7.29 with an approx- 
imation derived from Newton's backward difference formula: 


fun) = fi + EV fi + SEECTDV fi + SEC DG - 2) V? f (1.7.30) 
where £ = (x — z;)/h or x = xi + h£, 
V fi = f (£i Yi) — Fag): (1.7.31) 


V? f; = f(zi, yi) — 2f (zi i yii) + f(zi-2, Yi-2), (1.7.32) 


14 Bashforth, F., and J. C. Adams, 1883: An Attempt to Test the Theories of Capillary Action by 
Comparing the Theoretical and Measured Forms of Drops of Fluid. With an Explanation of the Method 
of Integration Employed in Constructing the Tables Which Give the Theoretical Forms of Such Drops. 
Cambridge University Press, 139 pp. 
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and 
V? f; = f (ziyi) — Sf (vica Yi-1) + 3f (i52) — f(i-s Yi-3). (1.7.33) 


Substituting Equation 1.7.30 into Equation 1.7.29 and carrying out the integration, we find 
that 


y(xi+1) = v(zi) + x [55 (zi, yi) — 59f (zi-1, yi-1) + 3T f (zi-2, yi-2) — 9f (zi-3, Yi-3)]- 
(1.7.34) 
Thus, the Adams-Bashforth method is an explicit finite difference formula that has a global 
error of O(h*). Additional computational savings can be realized if the old values of the 
slope are stored and used later. A disadvantage is that some alternative scheme (usually 
Runge-Kutta) must provide the first three starting values. 


e Example 1.7.3 


The flight of projectiles provides a classic application of first-order differential equa- 
tions. If the projectile has a mass m and its motion is opposed by the drag mgkv?, where 
g is the acceleration due to gravity and k is the quadratic drag coefficients, Newton’s law 
of motion gives 

dv 
dt 
where 0 is the slope of the trajectory to the horizon. From kinematics, 


— —gsin(0) — gkv?, (1.7.35) 


dr _ dy — dð ^ gcos(0) 
— —vcos(0)  — =vsin(9), qo ae 


P (1.7.36) 


An interesting aspect of this problem is the presence of a system of ordinary differential 
equations. 

Although we can obtain an exact solution to this problem,!° let us illustrate the Adams- 
Bashforth method to compute the solution to Equation 1.7.35 and Equation 1.7.36. We 
begin by computing the first three time steps using the Runge-Kutta method. Note that 
we first compute the kı for all of the dependent variables before we start computing the 
values of ks. Similar considerations hold for kg and k4. 


clear 

a = 0; b = 7.85; N = 100; g = 9.81; c 
h = (b-a)/N; t = (a:h:b+h); 

% set initial conditions 

v(1) = 44.69; theta(1) = pi/3; x(1) = 0; y(1) = 0; 


0.000548; 


for i - 1:3 
angle - theta(i); vv - v(i); 
ki vel = -g*sin(angle) - g*c*vv*vv; 


ki angle = -g*cos(angle) / vv; 

ki x = vv * cos(angle); 

ki y = vv * sin(angle); 

angle = theta(i)*h*ki1 angle/2; vv = v(i)*h*kl vel/2; 
k2 vel - -g*sin(angle) - g*c*vv*vv; 


15 Tan, A., C. H. Frick, and O. Castillo, 1987: The fly ball trajectory: An older approach revisited. Am. 
J. Phys., 55, 37-40; Chudinov, P. S., 2001: The motion of a point mass in a medium with a square law of 
drag. J. Appl. Math. Mech., 65, 421—426. 
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k2 angle - -g*cos(angle) / vv; 

k2_x = vv * cos(angle); 

k2 y = vv * sin(angle); 

angle = theta(i)+h*k2_angle/2; vv = v(i)+h*k2_vel/2; 

k3 vel = -g*sin(angle) - g*c*vv*vv; 

k3 angle = -g*cos(angle) / vv; 

k3 x 7 vv * cos(angle); 

k3_y = vv * sin(angle); 

angle = theta(i)+h*k3_angle; vv = v(i)+h*k3_vel; 

k4_vel = -g*sin(angle) - g*c*vv*vv; 

k4 angle = -g*cos(angle) / vv; 

kA x = vv * cos(angle); 

k4 y = vv * sin(angle); 

v(i+1) = v(i) + h*(ki_vel+2*k2_vel+2*k3_vel+k4_vel)/6; 

x(i*1) = x(i) + h*(k1_x+2*k2_x+2*k3_x+k4_x)/6; 

y(itt) = y(i) + h*(k1.y*2*k2 y*2*k3. y*k4. y) /6; 

theta(i*1) = theta(i) + h*(ki angle42*k2 angle ... 
*2*k3 angletk4 angle)/6; 


end 


Having computed the first three values of each of the dependent variables, we turn to 
the Adams-Bashforth method to compute the remaining portion of the numerical solution: 


for i= 4:N 


^ 


angle = theta(i); vv = v(i); 

ki vel = -g*sin(angle) - g*c*vv*vv; 

ki angle = -g*cos(angle) / vv; 

ki x = vv * cos(angle); 

ki_y = vv * sin(angle); 

angle = theta(i-1); vv = v(i-1); 

k2 vel = -g*sin(angle) - g*c*vv*vv; 

k2 angle - -g*cos(angle) / vv; 

k2 x = vv * cos(angle); 

k2 y = vv * sin(angle); 

angle = theta(i-2); vv = v(i-2); 

k3 vel = -g*sin(angle) - g*c*vv*vv; 

k3 angle = -g*cos(angle) / vv; 

k3 x = vv * cos(angle); 

k3. y = vv * sin(angle); 

angle = theta(i-3); vv = v(i-3); 

k4 vel = -g*sin(angle) - g*c*vv*vv; 

k4 angle = -g*cos(angle) / vv; 

kA x = vv * cos(angle); 

k4 y = vv * sin(angle); 

Use Equation 1.7.35 and Equation 1.7.36 for v, x, y and 0 

v(i*1) = v(i) + h*(55*k1l vel-59*k2 vel*37*k3 vel-9*k4 vel)/24; 

x(i*1) x(i) + h*(55*k1_x-59*k2_x+37*k3_x-9*k4_x)/24; 

y(i*1) = y(i) + h*(55*k1_y-59*k2_y+37*k3_y-9*k4_y) /24; 

theta(i*1) = theta(i) + h*(55*k1 angle-59*k2 angle ... 
+37*k3_angle-9*k4_angle)/24; 


end 
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Figure 1.7.3: The trajectory of a projectile with and without air resistance when it is initial fired with a 
muzzle velocity of 44.69 m/s and an elevation of 0 — 60?. All units are in the MKS system. 


Figure 1.7.3 illustrates this numerical solution when k = 0 and k = 0.000548 s?/m? 
and the shell is fired with the initial velocity v(0) = 44.69 m/s and elevation 0(0) = 7/3 
with z(0) = y(0) = 0. 


Problems 


Using Euler's, Runge-Kutta, or the Adams-Bashforth method for various values of h — 
107”, find the numerical solution for the following initial-value problems. Check your 
answer by finding the exact solution. 


l.a’=a-t, zx(0)22 2. z/—tz, x(0)=1 
3.z'—a?/(t--1, 2«(0)=1 4.z'— reí, zx(1)20 


5. Consider the integro-differential equation 


where the signum function is defined by Equation 11.2.11. This equation describes the 
(nondimensional) current, x(t), within an electrical circuit that contains a capacitor, 
inductor, and nonlinear resistor. Assuming that the circuit is initially dead, x(0) — 0, write 
a MATLAB script that uses Euler's method to compute x(t). Use a simple Riemann sum to 
approximate the integral. See Figure 1.7.4. Examine the solution for various values of B 
and f) as well as time step At. 


16 Monahan, T. F., 1960: Calculation of the current in non-linear surge-current-generation circuits. Proc. 
IEE, Part C, 107, 288-291. 
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Figure 1.7.4: The numerical solution of the equation describing an electrical circuit with a nonlinear 
resistor. Here 8 = 0.2 and At = 0.01. 


Further Readings 


Boyce, W. E., and R. C. DiPrima, 2004: Elementary Differential Equations and Boundary 
Value Problems. Wiley, 800 pp. Classic textbook. 


Ince, E. L., 1956: Ordinary Differential Equations. Dover, 558 pp. The source book on 
ordinary differential equations. 


Zill, D. G., and M. R. Cullen, 2008: Differential Equations with Boundary- Value Problems. 
Brooks Cole, 640 pp. Nice undergraduate textbook. 
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Chapter 2 
Higher-Order Ordinary 
Differential Equations 


Although first-order ordinary differential equations exhibit most of the properties of 
differential equations, higher-order ordinary differential equations are more ubiquitous in 
the sciences and engineering. This chapter is devoted to the most commonly employed 


techniques for their solution. 
A linear nth-order ordinary differential equation is a differential equation of the form 


n n—1i 
anla) 5 tania) +--+ a (2) aoa fa) (201) 
If f(x) = 0, then Equation 2.0.1 is said to be homogeneous; otherwise, it is nonhomogeneous. 
A linear differential equation is normal on an interval I if its coefficients and f(x) are 
continuous, and the value of a4 (x) is never zero on I. 

Solutions to Equation 2.0.1 generally must satisfy not only the differential equations 
but also certain specified conditions at one or more points. JInitial-value problems are 
problems where all of the conditions are specified at a single point x = a and have the 
form: y(a) = bo, y/(a) = bi, y" (a) = ba, ...., YTH (a) = bs 1, where bo, bi, b2, ...., 01 
are arbitrary constants. A quick check shows that if Equation 2.0.1 is homogeneous and 
normal on an interval J and all of the initial conditions equal zero at the point x = a that 
lies in J, then y(x) = 0 on J. This follows because y = 0 is a solution of Equation 2.0.1 and 
satisfies the initial conditions. 
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At this point a natural question would be whether the solution exists for this initial- 
value problem and, if so, whether it is unique. From a detailed study of this question, we 
have the following useful theorem. 


Theorem: Existence and Uniqueness 


Suppose that the differential equation, Equation 2.0.1, is normal on the open interval 
I containing the point x =a. Then, given n numbers bo, bı, ..., b, 1, the nth-order linear 
equation, Equation 2.0.1, has a unique solution on the entire interval I that satisfies the n 
initial conditions y(a) = bo, y (a) = b,... y^P (a) = b, i. 


e Example 2.0.1 


The solution y(x) = $e*—2e ?" to the ordinary differential equation y" 4-2" —y' —2y = 
0 satisfies the initial conditions y(0) = 1, y'(0) = 2, and y"(0) = 0 at x = 0. Our theorem 
guarantees us that this is the only solution with these initial values. 


Another class of problems, commonly called (two-point) boundary-value problems, oc- 
curs when conditions are specified at two different points x = a and x = b with b > a. 
An important example, in the case of second-order ordinary differential equations, is the 
Sturm-Liouville problem where the boundary conditions are oy(a) + £1y'(a) — 0 at x =a 
and ooy(b) + B2y'(b) = 0 at x = b. The Sturm-Liouville problem is treated in Chapter 6. 

Having introduced some of the terms associated with higher-order ordinary linear dif- 
ferential equations, how do we solve them? One way is to recognize that these equations 
are really a set of linear, first-order ordinary differential equations. For example, the linear 
second-order linear differential equation 


y" — 3y' + 2y = 3x (2.0.2) 
can be rewritten as the following system of first-order ordinary differential equations: 
y —y =v, and v’ — w= 3x (2.0.3) 


because 
y" —y =v = w + 3x = 2y' — 2y + 3x, (2.0.4) 


which is the same as Equation 2.0.2. This suggests that Equation 2.0.2 can be solved by 
applying the techniques from the previous chapter. Proceeding along this line, we first find 
that 


v(x) = Cie” — 3r — 3. (2.0.5) 
Therefore, 
y — y= 0e” — 3r- 3. (2.0.6) 
Again, applying the techniques from the previous chapter, we have that 
y = Cie?” + Cze” + ic + 2. (2.0.7) 


Note that the solution to this second-order ordinary differential equation contains two ar- 
bitrary constants. 


1 The proof of the existence and uniqueness of solutions to Equation 2.0.1 is beyond the scope of this 
book. See Ince, E. L., 1956: Ordinary Differential Equations. Dover Publications, Inc., Section 3.32. 
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e Example 2.0.2 

In the case of linear, second-order ordinary differential equations, a similar technique, 
called reduction in order, provides a method for solving differential equations if we know 
one of its solutions. 

Consider the second-order ordinary differential equation 


z^y" — day’ + 9y = 0. (2.0.8) 


A quick check shows that yi(z) = z? ln(x) is a solution of Equation 2.0.8. Let us now 
assume that the general solution can be written y(x) = u(z)z? ln(x). Then 


y' = v (z)2? n(x) + u(x) [3x In(x) + 27] , (2.0.9) 
and 
y" = u” (x)? In(z) + 2u' (x) [327 ln(x) + x°] + u(x) [62 n(x) + 52]. (2.0.10) 
Substitution of y(x), y'(z), and y"(x) into (2.0.8) yields 
a? In(z)u" + [z^ In(z) + 224] w' = 0. (2.0.11) 
Setting u’ = w, separation of variables leads to 
w’ 1 2 


w cx mc In(z)' 2042) 


Note how our replacement of u'(z) with w(x) has reduced the second-order ordinary differ- 
ential equation to a first-order one. Solving Equation 2.0.12, we find that 


w(z) =u (2) = -— S, (2.0.13) 
x In*(z) 
and 
ula) = +C (2.0.14) 
uc In(x) 2° .U. 
Because y(x) = u(z)z? ln(x), the complete solution is 
ylz) = Cia? + Coa? ln(x). (2.0.15) 


Substitution of Equation 2.0.15 into Equation 2.0.8 confirms that we have the correct solu- 
tion. 

We can verify our answer by using the symbolic toolbox in MATLAB. Typing the com- 
mand: 
dsolve(?x*x*D2y-5*x*Dy49*y-0? ,? x?) 
yields 
ans - 

C1*x^34C2*x^3*1log(x) 
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In summary, we can reduce (in principle) any higher-order, linear ordinary differential 
equations into a system of first-order ordinary differential equations. T'his system of differ- 
ential equations can then be solved using techniques from the previous chapter. In Chapter 
3 we will pursue this idea further. Right now, however, we will introduce methods that 
allow us to find the solution in a more direct manner. 


e Example 2.0.3 


An autonomous differential equation is one where the independent variable does not 
appear explicitly. In certain cases we can reduce the order of the differential equation and 
then solve it. 

Consider the autonomous ordinary differential equation 


y” = 2y*. (2.0.16) 
The trick here is to note that 
dv dv 
"= =y— = 2y’ 2.0.17 
a E ( ) 


where v = dy/dx. Integrating both sides of Equation 2.0.17, we find that 
v? — * 4 C4. (2.0.18) 


Solving for v, 

d — 

“Y og maf Fyi. (2.0.19) 
dx 


Integrating once more, we have the final result that 


dy 
r+C - | -= (2.0.20) 
? / C14 yt 
Problems 


For the following differential equations, use reduction of order to find a second solution. 
Can you obtain the general solution using dsolve in MATLAB? 


1. zy" -2y/ 20, w(x) =1 2. y" +y —2y 0, yi(x) =e” 
3. z^y" + day'—4y=0, w(x) =a 4. zy" — (z +1)y' *y 20, y(r) =e* 
5. (Qa — z?)y" + 2(z — 1)y' — 2y = 0, 6. y" + tan(z)y' — 6cot?(z)y = 0, 
y(x)—-z-—1 yi (x) = sin? (x 
7. Ax? y" + day’ + (Az? — 1)y =0, 8. y" + ay! + b(1-- ax — bz?)y = 0, 
in (z) = cos(z)/ Vz yi(e) = e? 


Solve the folowing autonomous ordinary differential equations: 


9. yy" — y^ 10. y” = 2yy’, y(0)=7/(0) =1 
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11. yy" E y' +4 y^? 12. 2yy" Sais y"? 
13. y” =e™, y(0) 20,y(0) —1 14. y" 23yy, y(0) = y'(0) = 1,y"(0) = 5 
15. Solve the nonlinear second-order ordinary differential equation 


dy 1dy 1/fdy\ 
dx? xdr 2 


by (1) reducing it to the Bernoulli equation 


(2) solving for v(x), and finally (3) integrating u’ = v to find u(x). 
16. Consider the differential equation 
az(x)y"  ai(z)y' + ao(z)y 2 0, — aa(x) #0. 


Show that this ordinary differential equation can be rewritten 


u” + f(z)u — 0, 


using the substitution 


utes deb l-: ?, í D a . 


2.1 HOMOGENEOUS LINEAR EQUATIONS WITH CONSTANT COEFFICIENTS 


In our drive for more efficient methods to solve higher-order, linear, ordinary differential 
equations, let us examine the simplest possible case of a homogeneous differential equation 
with constant coefficients: 

d"y daty 


an + an-ı 
dz" d 


d 
| ce ag" + ars + ay = 0. (2.1.1) 
Xx 


qn-1 


Although we could explore Equation 2.1.1 in its most general form, we will begin by studying 
the second-order version, namely 


ay" + by’ + cy — 0, (2.1.2) 


since it is the next step up the ladder in complexity from first-order ordinary differential 
equations. 

Motivated by the fact that the solution to the first-order ordinary differential equation 
y' + ay = 0 is y(x) = Cie ?*, we make the educated guess that the solution to Equation 
2.1.2 is y(x) = Ae"?. Direct substitution into Equation 2.1.2 yields 


(am? + bm + c) Ae"* = 0. (2.1.3) 
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The constant A cannot equal 0 because that would give y(x) = 0 and we would have a 
trivial solution. Furthermore, since e"'* Æ 0 for arbitrary x, Equation 2.1.3 simplifies to 


am? 4- bm 4 c — 0. (2.1.4) 


Equation 2.1.4 is called the auxiliary or characteristic equation. At this point we must 
consider three separate cases. 


e Distinct real roots 


In this case the roots to Equation 2.1.4 are real and unequal. Let us denote these roots 
by m = mı, and m = m». Thus, we have the two solutions: 


yi(r)- Cie", and yo(x) = Cpe”. (2.1.5) 
We will now show that the most general solution to Equation 2.1.2 is 
y(x) = Cye™® + Cero. (2.1.6) 
This result follows from the principle of (linear) superposition. 


Theorem: Let yi, y2,..., yx be solutions of the homogeneous equation, Equation 2.1.1, on 
an interval I. Then the linear combination 


y(z) = Ciy (x) + Coy2(2) +--+ + Ceyn(2), (2.1.7) 
where Ci, i = 1,2,...,k, are arbitrary constants, is also a solution on the interval I. 
Proof: We will prove this theorem for second-order ordinary differential equations; it is 
easily extended to higher orders. By the superposition principle, y(x) = Ciyi1(x) 4- Coya (x). 
Upon substitution into Equation 2.1.2, we have that 


a (Ciyt + Coy2) + b (Ciyi + Coy5) + c(Ciyi + C2y2) = 0. (2.1.8) 


Recombining the terms, we obtain 


C; (ay + byi + cy1) + C» (ayz + bys + cy») = 0, (2.1.9) 
or 
0C -- 0C5 = 0. (2.1.10) 


e Example 2.1.1 


A quick check shows that y1(x) = e” and y2(a) = e~* are two solutions of y" — y = 0. 
Our theorem tells us that any linear combination of these solutions, such as y(x) = 5e* — 
3e" *, is also a solution. 

How about the converse? Is every solution to y" — y = 0 a linear combination of yj (x) 
and y2(x)? We will address this question shortly. 
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e Example 2.1.2 
Let us find the general solution to 
y + 2y' — 15y = 0. (2.1.11) 
Assuming a solution of the form y(x) = Ae"?, we have that 
(m? + 2m — 15) Ae"? = Q. (2.1.12) 
Because A Æ 0 and e"'* generally do not equal zero, we obtain the auxiliary or characteristic 


equation 
m? + 2m — 15 = (m + 5)(m — 3) = 0. (2.1.13) 


Therefore, the general solution is 


y(x) = Cie?” + Coe”. (2.1.14) 


e Repeated real roots 
When m = mı = ms», we have only the single exponential solution yi(r) = Cie""*, 


To find the second solution we apply the reduction of order technique shown in Example 
2.0.2. Performing the calculation, we find 


e x/a 


[2 1 


Since mı = —b/(2a), the integral simplifies to f dx and 


y(x) = Cie"? + Core™”, (2.1.16) 


e Example 2.1.3 
Let us find the general solution to 
y" + Ay! + 4y = 0. (2.1.17) 
Here the auxiliary or characteristic equation is 
m? -- 4m 4- 4 — (m -- 2? — 0. (2.1.18) 
Therefore, the general solution is 


y(x) = (C1 + Cox)e 7". (2.1.19) 
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e Complez conjugate roots 


When b? — 4ac < 0, the roots become the complex pair m, = a+i6 and mz = a — fii, 
where o and f are real and i? = —1. Therefore, the general solution is 


ylz) = Ceres. Cee". (2.1.20) 
Although Equation 2.1.20 is quite correct, most engineers prefer to work with real 


functions rather than complex exponentials. To this end, we apply Euler’s formula? to 
eliminate etf” and e~*®* since 


eP? = cos( Bx) + isin(Bz), (2.1.21) 
and 
e P® — cos(Bx) — isin( 8x). (2.1.22) 
Therefore, 
y(x) = Cie [cos(Bx) + isin(8x)] + Coe” [cos(8x) — isin(8x)] (2.1.23) 
= Cse** cos(x) + C4e?* sin(fix), (2.1.24) 


where C3 = C, + Co, and C4 = iC, — 1C5. 
e Example 2.1.4 
Let us find the general solution to 
y" + Ay + 5y — 0. (2.1.25) 


Here the auxiliary or characteristic equation is 


m? +4m+5 = (m -2)? -1— 0, (2.1.26) 
or m = —2 + i. Therefore, the general solution is 
y(x) = e ?*[C1 cos(x) + Co sin(z)]. (2.1.27) 


So far we have only dealt with second-order differential equations. When we turn 
to higher-order ordinary differential equations, similar considerations hold. In place of 
Equation 2.1.4, we now have the nth-degree polynomial equation 


anm” + ay um? +--+ E agm? 4c aqm +a — 0 (2.1.28) 


for its auxiliary equation. 

When we treated second-order ordinary differential equations, we were able to classify 
the roots to the auxiliary equation as distinct real roots, repeated roots, and complex 
roots. In the case of higher-order differential equations, such classifications are again useful 


? If you are unfamiliar with Euler’s formula, see Section 10.1. 
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although all three types may occur with the same equation. For example, the auxiliary 
equation 
mê — m? + 2m* — 2m? +m? - m 2 0 (2.1.29) 


has the distinct roots m = 0 and m = 1 with the twice repeated, complex roots m = +i. 

Although the possible combinations increase with higher-order differential equations, 
the solution technique remains the same. For each distinct real root m — m4, we have a 
corresponding homogeneous solution e™!”. For each complex pair m = a+ ffi, we have the 
corresponding pair of homogeneous solutions e?* cos(Gx) and e?* sin(Bx). For a repeated 
root m = m; of multiplicity k, regardless of whether it is real or complex, we have either 
emi? ge™® gem ykemu? in the case of real m4 or 


e** cos(Bx), e** sin(Gx), xe?" cos(Gx), xe” sin(fx), 


1? e?* cos(Bx), z2e"* sin(fx), .. . , x*e?* cos(Bx), x"e** sin(8z) 


in the case of complex roots œ + fi. For example, the general solution for the roots to 
Equation 2.1.29 is 


y(x) = Cy + Coe” + Cs cos(x) + Cy sin(x) + Csz cos(x) + Cex sin(x). (2.1.30) 


e Example 2.1.5 
Let us find the general solution to 
y" +y' — 10y — 0. (2.1.31) 


Here the auxiliary or characteristic equation is 


m? +m- 10 = (m — 2) (m? + 2m +5) = (m — 2)[(m + 1)? +4] — 0, (2.1.32) 
or m = —2 and m = —1+ 2i. Therefore, the general solution is 
y(x) = Cie ?* + e * [Cs cos(2) + C3 sin(2z)]. (2.1.33) 


Having presented the technique for solving constant coefficient, linear, ordinary differ- 
ential equations, an obvious question is: How do we know that we have captured all of the 
solutions? Before we can answer this question, we must introduce the concept of linear 
dependence. 


A set of functions fi(x), f2(z),..., fa (x) is said to be linearly dependent on an interval 
I if there exist constants C1, C5,..., C4, not all zero, such that 
C; f(x) + Cofo(x) + Cs fa(x) +--+ + Cufu(x) 20 (2.1.34) 


for each x in the interval; otherwise, the set of functions is said to be linearly independent. 
This concept is easily understood when we have only two functions fi(x) and fa(x). If the 
functions are linearly dependent on an interval, then there exist constants C4 and C5 that 
are not both zero, where 

Cif (x) + C^» falx) =0 (2.1.35) 
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for every x in the interval. If C1 Æ 0, then 


filx) = -2 fale) (2.1.36) 


In other words, if two functions are linearly dependent, then one is a constant multiple of 
the other. Conversely, two functions are linearly independent when neither is a constant 
multiple of the other on an interval. 
e Example 2.1.6 

Let us show that f(x) = 2x, g(x) = 3x7, and h(x) = 5x — 82? are linearly dependent 


on the real line. 
To show this, we must choose three constants, C1, C5, and C3, such that 


C3 f (x) + Cog(x) + C3h(x) = 0, (2.1.37) 
where not all of these constants are nonzero. A quick check shows that 


15 f(x) — 16g(x) — 6h(x) = 0. (2.1.38) 


Clearly, f(x), g(x), and h(x) are linearly dependent. 


e Example 2.1.7 


This example shows the importance of defining the interval on which a function is 
linearly dependent or independent. Consider the two functions f(a) = x and g(x) = |z]. 
They are linearly dependent on the interval (0,00) since Cia + Co|z| = Ciz + Cox = 0 
is satisfied for any nonzero choice of Cı and C5 where C4, = —C 2. What happens on the 
interval (—co,0)? They are still linearly dependent but now C1 = C5. 


Although we could use the fundamental concept of linear independence to check and see 
whether a set of functions is linearly independent or not, the following theorem introduces 
a procedure that is very straightforward. 


Theorem: Wronskian Test of Linear Independence 


Suppose fi(x), fa(x),..., fu(x) possess at least n — 1 derivatives. If the determinant? 


fi fa S fn 
fi Je xe d 
jn» fn» n (n-1) 
is not zero for at least one point in the interval I, then the functions fi(x), fale), ..., fa (x) 


are linearly independent on the interval. The determinant in this theorem is denoted by 
WIlUfi(z), fa(z),..., fa(x)] and is called the Wronskian of the functions. 


3 If you are unfamiliar with determinants, see Section 3.2. 
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Proof: We prove this theorem by contradiction when n = 2. Let us assume that W [fi (xo); 
fo(xo)| # 0 for some fixed xo in the interval J and that fi(x) and fə(x) are linearly 
dependent on the interval. Since the functions are linearly dependent, there exists C4 and 
C5, both not zero, for which 

Cif (x) + C^ falx) =0 (2.1.39) 


for every x in J. Differentiating Equation 2.1.39 gives 
C; fi (x) + C2 f2(z) = 0. (2.1.40) 


We may view Equation 2.1.39 and Equation 2.1.40 as a system of equations with C4 and C2 
as the unknowns. Because the linear dependence of fı and f? implies that Cı 4 0 and/or 
C2 Æ 0 for each x in the interval, 


_|fi fe 


Wifi(2), fa(x)] = E (2.1.41) 


for every x in I. This contradicts the assumption that W[fi(xo), fa(xo)] 4 0 and fı and fə 
are linearly independent. 


e Example 2.1.8 


Are the functions f(x) = x, g(x) = xe”, and h(x) = xe” linearly dependent on the 
real line? To find out, we compute the Wronskian or 


i ce? xe” 1a zz? 


e 
Wilf(z),g(x),h(x) — |e* (a+ 1)e* (£? +2r)e® |=e%|0 1 2g|= 2e? 90. 
e” (a+2)e” (x? +4ax + 2)e* 0 0 2 


(2.1.42) 


Therefore, x, xe”, and z?e* are linearly independent. 


Having introduced this concept of linear independence, we are now ready to address 
the question of how many linearly independent solutions a homogeneous linear equation 
has. 


Theorem: 


On any interval I over which an n-th order homogeneous linear differential equation is 
normal, the equation has n linearly independent solutions y1(x), ya(x),..., ua (x) and any 
particular solution of the equation on I can be expressed as a linear combination of these 
linearly independent solutions. 


Proof: Again for convenience and clarity we prove this theorem for the special case of n = 2. 
Let yi(a) and yo(x) denote solutions on I of Equation 2.1.2. We know that these solutions 
exist by the existence theorem and have the following values: 


yi(a)=1, yo(a)=0, yi(a)=0, yb(a) =1 (2.1.43) 


at some point a on I. To establish the linear independence of yı and y2 we note that, 
if Ciyi(z) + Coy2(a) = 0 holds identically on I, then Ciy1(z) + CayS(x) = 0 there too. 
Because x = a lies in I, we have that 


Ciyi(a) + Coys(a) = 0, (2.1.44) 


56 Advanced Engineering Mathematics with MATLAB 
and 
Cry; (a) + Coy»(a) = 0, (2.1.45) 


which yields C, = C2 = 0 after substituting Equation 2.1.43. Hence, the solutions yı and 
y» are linearly independent. 

To complete the proof we must now show that any particular solution of Equation 
2.1.2 can be expressed as a linear combination of yı and yo. Because y, yi, and y» are all 
solutions of Equation 2.1.2 on J, so is the function 


Y (a) = y(x) — y(a)yı (x) — v (a)us (2), (2.1.46) 


where y(a) and y'(a) are the values of the solution y and its derivative at x = a. Evaluating 
Y and Y' at x =a, we have that 


Y (a) = y(a) — y(a)yı (a) — y'(a)y2(a) = y(a) — y(a) = 0, (2.1.47) 
and 
Y'(a) = y'(a) — v(a)yi(a) — y'(a)us(a) = y'(@) — y'(a) = 0. (2.1.48) 
Thus, Y is the trivial solution to Equation 2.1.2. Hence, for every x in J, 
y(x) — y(a)yr (x) — y'(a)yo(x) = 0. (2.1.49) 
Solving Equation 2.1.49 for y(x), we see that y is expressible as the linear combination 
y(x) = y(a)yr(x) + v'(a)us (x) (2.1.50) 
of y; and y2, and the proof is complete for n = 2. 


Problems 


Find the general solution to the following differential equations. Check your general solution 
by using dsolve in MATLAB. 


1. y" + 6y' +5y =0 2. y" — 6y' + 10y =0 3. y" —2y t y =0 
4. y" — 3y' + 2y =0 5. y" — 4y' + 8y =0 6. y" + by’ + 9y = 0 
7. y" + 6y' — 40y = 0 8. y" + 4y! + 5y =0 9. y" + 8y' + 25y =0 
10. 4y” — 12y' + 9y =0 11. y" + 8y! + 16y =0 12. y” + Ay" = 0 

13. y" + 4y” =0 14. y" + Qy +y" =0 15. y" — 8y =0 


16. y" = 3y” + 3y" = y! =j 


17. The simplest differential equation with “memory” — its past behavior affects the present 
— is 


A 


"OS A 


/ 


y= e- 79/7 y(a) dz. 


Solve this integro-differential equation by differentiating it with respect to t to eliminate 
the integral. 
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(a) (b ^ (c) 


Figure 2.2.1: Various configurations of a mass/spring system. The spring alone has a length L, which 
increases to L-4- s when the mass is attached. During simple harmonic motion, the length of the mass/spring 
system varies as L + s + x. 


2.2 SIMPLE HARMONIC MOTION 


Second-order, linear, ordinary differential equations often arise in mechanical or elec- 
trical problems. The purpose of this section is to illustrate how the techniques that we just 
derived may be applied to these problems. 

We begin by considering the mass-spring system illustrated in Figure 2.2.1 where a 
mass m is attached to a flexible spring suspended from a rigid support. If there were 
no spring, then the mass would simply fall downward due to the gravitational force mg. 
Because there is no motion, the gravitational force must be balanced by an upward force 
due to the presence of the spring. This upward force is usually assumed to obey Hooke's 
law, which states that the restoring force is opposite to the direction of elongation and 
proportional to the amount of elongation. Mathematically the equilibrium condition can 
be expressed mg — ks. 

Consider now what happens when we disturb this equilibrium. This may occur in 
one of two ways: We could move the mass either upward or downward and then release 
it. Another method would be to impart an initial velocity to the mass. In either case, 
the motion of the mass/spring system would be governed by Newton's second law, which 
states that the acceleration of the mass equals the imbalance of the forces. If we denote the 


downward displacement of the mass from its equilibrium position by positive x, then 
d?r 
mg” —k(s + z) + mg = —kz, (2.2.1) 


since ks — mg. After dividing Equation 2.2.1 by the mass, we obtain the second-order 
differential equation 


dr k 

aa tt =0, (2.2.2) 
or 2 

ug t u?z = 0, (2.9.3) 


where w? = k/m and w is the circular frequency. Equation 2.2.3 describes simple harmonic 
motion or free undamped motion. The two initial conditions associated with this differential 
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Summary of Simple Harmonic Motion 
Displacement,x(t) 
A 
Velocity = 0 . 
/ Amplitude, A 
. Initial JÀ ax 
displacement A po 
4 - 
Time, wr 
Phase = 
E E — 
Period 
v2 1/2 
a(t) = | z2 + A) sin(wt + p) 
WwW 
ES X CA ; 
y = tan”! (=) ; vo = initial velocity 
Uo 


equation are 


x(0) =a, x'(0) = £. (2.2.4) 


The first condition gives the initial amount of displacement while the second condition 
specifies the initial velocity. If a > 0 while 8 < 0, then the mass starts from a point below 
the equilibrium position with an initial upward velocity. On the other hand, if a < 0 with 
B = 0 the mass is at rest when it is released |a| units above the equilibrium position. Similar 
considerations hold for other values of a and f. 

To solve Equation 2.2.3, we note that the solutions of the auxiliary equation m?+w? = 0 
are the complex numbers mı = wi, and mz = —wi. Therefore, the general solution is 


x(t) = Acos(wt) + B sin(wt). (2.2.5) 
The (natural) period of free vibrations is T = 27/w while the (natural) frequency is f = 
1/T = w/(2n). 
e Example 2.2.1 


Let us solve the initial-value problem 


dx 


The physical interpretation is that we have pulled the mass on a spring down 10 units 
below the equilibrium position and then release it from rest at t = 0. Here, w = 2 so that 


x(t) = Acos(2t) + B sin(2t) (2.2.7) 


from Equation 2.2.5. 
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Because x(0) = 10, we find that 
z(0) =10=A-1+B-0 (2.2.8) 
so that A = 10. Next, we note that 


d 
= = —20sin(2t) + 2B cos(2t). (2.2.9) 
Therefore, at t = 0, 
x'(0) =0=-20-0+2B-1 (2.2.10) 


and B = 0. Thus, the equation of motion is z(t) = 10 cos(2t). 

What is the physical interpretation of our equation of motion? Once the system is set 
into motion, it stays in motion with the mass oscillating back and forth 10 units above and 
below the equilibrium position z = 0. The period of oscillation is 27/2 = 7 units of time. 


e Example 2.2.2 


A weight of 45 N stretches a spring 5 cm. At time t = 0, the weight is released from 
its equilibrium position with an upward velocity of 28 cm s^!. Determine the displacement 
a(t) that describes the subsequent free motion. 

From Hooke's law, 

F=mg=45N=kx5cm (2.2.11) 


so that k = 9 N cm7!. Therefore, the differential equation is 


dx = 
The initial displacement and initial velocity are z(0) = 0 cm and z'(0) = —28 cm s^!. The 


negative sign in the initial velocity reflects the fact that the weight has an initial velocity 
in the negative or upward direction. 
Because w? = 196 s7? or w = 14 s^!, the general solution to the differential equation 
is 
a(t) = Acos(14s 1t) + Bsin(14s !t). (2.2.13) 


Substituting for the initial displacement z(0) in Equation 2.2.13, we find that 


z(0)— 0cm 2 A-14- B- 0, (2.2.14) 
and A — 0 cm. Therefore, 
a(t) = Bsin(14s tt) (2.2.15) 
and 
a'(t) = 14s ! Bcos(14s  !t). (2.2.16) 


Substituting for the initial velocity, 
z'(0)— —28cms ! = 14s ! B, (2.2.17) 
and B — —2 cm. Thus the equation of motion is 


z(t) = —2cm sin(14s !t). (2.2.18) 
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Figure 2.2.2: Schematic of a floating body partially submerged in pure water. 


e Example 2.2.3: Vibration of floating bodies 


Consider a solid cylinder of radius a that is partially submerged in a bath of pure water 
as shown in Figure 2.2.2. Let us find the motion of this cylinder in the vertical direction 
assuming that it remains in an upright position. 

If the displacement of the cylinder from its static equilibrium position is z, the weight of 
water displaced equals Agp,z, where pw is the density of the water and g is the gravitational 
acceleration. This is the restoring force according to the Archimedes principle. The mass 
of the cylinder is Ahp, where p is the density of cylinder. From second Newton's law, the 
equation of motion is 


pAhz" + Agpyx = 0, (2.2.19) 

B Pwg 
r= 2.2.20 
" E x=0. ( ) 


From Equation 2.2.20 we see that the cylinder will oscillate about its static equilibrium 
position z = 0 with a frequency of 


1/2 
w= (=) i (2.2.21) 
ph 


When both A and B are both nonzero, it is often useful to rewrite the homogeneous 
solution, Equation 2.2.5, as 
x(t) = Csin(wt + p) (2.2.22) 


to highlight the amplitude and phase of the oscillation. Upon employing the trigonometric 
angle-sum formula, Equation 2.2.22 can be rewritten 


x(t) = C sin(wt) cos(y) + C cos(wt) sin(y) = Acos(wt) + B sin(wt). (2.2.23) 
From Equation 2.2.23, we see that A = C'sin(y) and B = C cos(q). Therefore, 
A? + B? = C? sin? (y) + C? cos? (p) = C?, (2.2.24) 


and C = y A? + B?. Similarly, tan(y) = A/B. Because the tangent is positive in both the 
first and third quadrants and negative in both the second and fourth quadrants, there are 
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two possible choices for y. The proper value of o satisfies the equations A = C'sin(y) and 
B = C cos(). 
If we prefer the amplitude/phase solution 


x(t) = C cos(wt — p), (2.2.25) 
we now have 
x(t) = C cos(wt) cos(y) + C sin(wt) sin(y) = Acos(wt) + B sin(wt). (2.2.26) 


Consequently, A = C cos(y) and B = Csin(y). Once again, we obtain C = VA? + B2. On 
the other hand, tan(y) = B/A. 


Problems 


Solve the following initial-value problems and write their solutions in terms of amplitude 
and phase: 


la” +25¢=0, «(0)=10,  a'(0)— —10 
2. Az" + 9x = 0, x(0) = 2m, x! (0) = 37 
3. 2" +n? — 0, x(0) — 1, z'(0) = v3 


4. A 4-kg mass is suspended from a 100 N/m spring. The mass is set in motion by giving it 
an initial downward velocity of 5 m/s from its equilibrium position. Find the displacement 
as a function of time. 


5. A spring hangs vertically. A weight of mass M kg stretches it L m. This weight is 
removed. A body weighing m kg is then attached and allowed to come to rest. It is then 
pulled down sg m and released with a velocity vo. Find the displacement of the body from 
its point of rest and its velocity at any time t. 


6. A particle of mass m moving in a straight line is repelled from the origin by a force 
F. (a) If the force is proportional to the distance from the origin, find the position of the 
particle as a function of time. (b) If the initial velocity of the particle is av/k, where k is 
the proportionality constant and a is the distance from the origin, find the position of the 
particle as a function of time. What happens if m « 1 and m — 1? 


2.3 DAMPED HARMONIC MOTION 


Free harmonic motion is unrealistic because there are always frictional forces that act to 
retard motion. In mechanics, the drag is often modeled as a resistance that is proportional 
to the instantaneous velocity. Adopting this resistance law, it follows from Newton's second 
law that the harmonic oscillator is governed by 


ax dx 
= —k 
m r—B de 


(2.3.1) 


where f is a positive damping constant. The negative sign is necessary since this resistance 
acts in a direction opposite to the motion. 
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Dividing Equation 2.3.1 by the mass m, we obtain the differential equation of free 
damped motion, 


= 2.3.2 
d ' mdt m. 0, ( ) 
Or E ? 
i IE ape aues 
dg 2A gte 0. (2.3.3) 


We have written 2A rather than just A because it simplifies future computations. The 
auxiliary equation is m? + 2\m + w? = 0, which has the roots 


my =-A4+ VA? -w?, and Mz =—-A- V X — w2. (2.3.4) 


From Equation 2.3.4 we see that there are three possible cases which depend on the 
algebraic sign of A? — w?. Because all of the solutions contain the damping factor e7%, 


A > 0, z(t) vanishes as t > oo. 
e Case I: A’ >w 


Here the system is overdamped because the damping coefficient 5 is large compared to 
the spring constant k. The corresponding solution is 


r(t) = Ae™ + Be”, (2.3.5) 
or 
a(t) = e-* (Ae Vo BET) i (2.3.6) 


In this case the motion is smooth and nonoscillatory. 
e Case I: À 2 w 


'The system is critically damped because any slight decrease in the damping force would 
result in oscillatory motion. The general solution is 


a(t) = Ae™* + Bte™®*, (2.3.7) 


B x(t) = e (A Bt). (2.3.8) 


'The motion is quite similar to that of an overdamped system. 
e Case III: X< w 


In this case the system is underdamped because the damping coefficient is small com- 
pared to the spring constant. The roots mı and m» are complex: 


my, = —A-c iyw? — X, and Mz = —A-—iwvw? — M. (2.3.9) 


The general solution now becomes 


r(t) = e7% [Acos(t w? — x) + Bsin(t w? — A? J : (2.3.10) 
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Figure 2.3.1: The displacement z(t) of a damped harmonic oscillator as a function of time and ¢ = A/w. 


Equation 2.3.10 describes oscillatory motion that decays as e~**. Equations 2.3.6, 2.3.8, and 
2.3.10 are illustrated in Figure 2.3.1 when the initial conditions are z(0) = 1 and z'(0) = 0. 

Just as we could write the solution for the simple harmonic motion in the ampli- 
tude/phase format, we can write any damped solution Equation 2.3.10 in the alternative 
form 


alt) se sin(tv/a? — à? + e) : (2.3.11) 


where C = V A? + B? and the phase angle v is given by tan(y) = A/B such that A = 
Csin(y) and B = C cos(i). The coefficient Ce~™” is sometimes called the damped coefficient 
of vibrations. Because Equation 2.3.11 is not a periodic function, the quantity 2r / vw? — A? 
is called the quasi period and Vw? — A? is the quasi frequency. The quasi period is the time 
interval between two successive maxima of x(t). 


e Example 2.3.1 


A body with mass m = 4 kg is attached to the end of a spring that is stretched 2 m 
by a force of 100 N. Furthermore, there is also attached a dashpot* that provides 6 N of 
resistance for each m/s of velocity. If the mass is set in motion by further stretching the 
spring 4 m and giving it an upward velocity of 10 m/s, let us find the subsequent motion. 

We begin by first computing the constants. The spring constant is k = (100 N)/(2 m) 


= 50 N/m. Therefore, the differential equation is 


sx" + 6a’ + 50r = 0 (2.3.12) 
with z(0) = $ m and z'(0) = —10 m/s. Here the units of x(t) are meters. The characteristic 
or auxiliary equation is 

m? + 12m + 100 = (m + 6)? + 64 = 0, (2.3.13) 


^ A mechanical device — usually a piston that slides within a liquid-filled cylinder — used to damp the 
vibration or control the motion of a mechanism to which is attached. 
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Review of the Solution of the 
Underdamped Homogeneous Oscillator Problem 


mz" + Bx! + kv = 0 subject to z(0) = zo, z'/(0) = vo has the solution 
a(t) = Ae sin(wat + q), 
where 


w = y k/m is the undamped natural frequency, 
à = B/(2m) is the damping factor, 


wq = Vw? — M is the damped natural frequency, 


and the constants A and q are determined by 


2 
We 


and 


or m = —6+ 8i. Therefore, we have an underdamped harmonic oscillator and the general 
solution is 
a(t) = e 9 [Acos(8t) + Bsin(8t)] . (2.3.14) 


Consequently, each cycle takes 27/8 = 0.79 second. This is longer than the 0.63 second 
that would occur if the system were undamped. 
From the initial conditions, 


z(0)— A— 2$, and z'(0)— —10— —6A- 8B. (2.3.15) 
Therefore, A = i and B — -i Consequently, 


x(t) =e [5 cos(8t) — £ sin(8t)] = VS os cos(8t + 2.62244). (2.3.16) 


e Example 2.3.2: Design of a wind vane 


In its simplest form a wind vane is a flat plate or airfoil that can rotate about a vertical 
shaft. See Figure 2.3.2. In static equilibrium it points into the wind. There is usually a 
counterweight to balance the vane about the vertical shaft. 

A vane uses a combination of the lift and drag forces on the vane to align itself with 
the wind. As the wind shifts direction from 05 to the new direction 0;, the direction 0 in 
which the vane currently points is governed by the equation of motion? 


2 
EP ae E E (2.3.17) 


qoe 
dp V dt 


5 For a derivation of Equation 2.3.12 and Equation 2.3.13, see subsection 2 of Section 3 in Barthelt, 
H. P., and G. H. Ruppersberg, 1957: Die mechanische Windfahne, eine theoretische und experimentelle 
Untersuchung. Beitr. Phys. Atmos., 29, 154-185. 
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Wind vector 


Vane 


Vertical axis of rotation A 


Figure 2.3.2: Schematic of a wind vane. The counterbalance is not shown. 


where I is the vane's moment of inertia, N is the aerodynamic torque per unit angle, and 
R is the distance from the axis of rotation to the effective center of the aerodynamic force 
on the vane. The aerodynamic torque is given by 
N = iCLpAV?R, (2.3.18) 
where Cz is the lift coefficient, p is the air density, A is the vane area, and V is the wind 
speed. 
Dividing Equation 2.3.17 by I, we obtain the second-order ordinary differential equation 
d^(0—0;)  NRd(0—0) N 


| 0 — 0j) — 0. 2.3.19 
dt? IV dt do d eo) 


The solution to Equation 2.3.19 is 


NRt 
where EA 
N N*R 
o N 2.3.21 
I 4PV?’ ( ) 


and A and y are the two arbitrary constants that would be determined by presently unspec- 
ified initial conditions. Consequently an ideal wind vane is a damped harmonic oscillator 
where the wind torque should be large and its moment of inertia should be small. 


Problems 


For the following values of m, 8, and k, find the position x(t) of a damped oscillator for the 
given initial conditions: 


3.m=4, 8220, k=169, 2(0)=4, z'(0)- 16 


4. For a fixed value of A/w, what is the minimum number of cycles required to produce a 
reduction of at least 5096 in the maxima of an underdamped oscillator? 
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5. For what values of c does z” + cx’ + 4x = 0 have critically damped solutions? 
y 


6. For what values of c are the motions governed by 4x" + ca’ + 9x = 0 (a) overdamped, 
(b) underdamped, and (c) critically damped? 


7. For an overdamped mass-spring system, prove that the mass can pass through its equi- 
librium position x = 0 at most once. 


2.4 METHOD OF UNDETERMINED COEFFICIENTS 


Homogeneous ordinary differential equations become nonhomogeneous when the right 
side of Equation 2.0.1 is nonzero. How does this case differ from the homogeneous one that 
we have treated so far? 

To answer this question, let us begin by introducing a function yp(x) — called a par- 
ticular solution — whose only requirement is that it satisfies the differential equation 


Tee a, (x) ee + ao(z)yy = f(x). (2.4.1) 


Then, by direct substitution, it can be seen that the general solution to any nonhomoge- 
neous, linear, ordinary differential equation is 


ylz) = ya (z) + yp(), (2.4.2) 


where yg (x) — the homogeneous or complementary solution — satisfies 


IH be aic) UR + ao(z)yg = 0. (2.4.3) 


Why have we introduced this complementary solution - because the particular solution 
already satisfies the ordinary differential equation. The purpose of the complementary 
solution is to introduce the arbitrary constants that any general solution of an ordinary 
differential equation must have. Thus, because we already know how to find yx(a), we 
must only invent a method for finding the particular solution to have our general solution. 


e Example 2.4.1 
Let us illustrate this technique with the second-order, linear, nonhomogeneous ordinary 


differential equation 
y" — Ay + 4y = 2e?* + Ax — 12. (2.4.4) 


Taking y(x) = yg (£) + yp(x), direction substitution yields 
Vg V, — Avg + Yp) + An vy) = 2e?* + Ag — 12. (2.4.5) 
If we now require that the particular solution y,(a) satisfies the differential equation 
y!! — 4y! + Ay, = 2€? + Ax — 12, (2.4.6) 
Equation 2.4.5 simplifies to the homogeneous ordinary differential equation 


Yn — Ayg + 4yH = 0. (2.4.7) 
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A quick check? shows that the particular solution to Equation 2.4.6 is y,(x) = zx?e?* 4- — 2. 


Using techniques from the previous section, the complementary solution is yg (z) = Ce?" + 
Core”. 


In general, finding y,(a) is a formidable task. In the case of constant coefficients, several 
techniques have been developed. The most commonly employed technique is called the 
method of undetermined coefficients, which is used with linear, constant coefficient, ordinary 
differential equations when f(x) is a constant, a polynomial, an exponential function e°”, 
sin(Gx), cos(fx), or finite sum and products of these functions. Thus, this technique applies 
when the function f(x) equals e sin(x) — (3x — 2)e~?* but not when it equals ln(x). 

Why does this technique work? The reason lies in the set of functions that we have 
allowed to be included in f(x). They enjoy the remarkable property that derivatives of 
their sums and products yield sums and products that are also constants, polynomials, 
exponentials, sines, and cosines. Because a linear combination of derivatives such as ayp + 
by, + cyp must equal f(x), it seems reasonable to assume that yp(x) has the same form as 
f(x). The following examples show that our conjecture is correct. 


e Example 2.4.2 


Let us illustrate the method of undetermined coefficients by finding the particular 
solution to 
y" —2y' +y = x + sin(z) (2.4.8) 


by the method of undetermined coefficients. 
From the form of the right side of Equation 2.4.8, we guess the particular solution 


Yp(z) = Az + B + C sin(x) + D cos(x). (2.4.9) 
Therefore, 
y,(x) = A + C cos(x) — D sin(x), (2.4.10) 
and 
y, (x) = —C sin(x) — D cos(x). (2.4.11) 


Substituting into Equation 2.4.8, we find that 
Yp — 2Yp + Yp = Ax + B — 2A — 2C cos(x) + 2D sin(x) = x + sin(x). (2.4.12) 
Since Equation 2.4.12 must be true for all x, the constant terms must sum to zero or 
B — 2A = 0. Similarly, all of the terms involving the polynomial x must balance, yielding 
A = 1 and B = 2A = 2. Turning to the trigonometric terms, the coefficients of sin(x) and 
cos(r) give 2D = 1 and —2C = 0, respectively. Therefore, the particular solution is 
yp(z) = x +2 4- 1 cos(x), (2.4.13) 


and the general solution is 


ylz) = yg (£) + yp(a) = Cre? + Cone” + x -+2 + $ cos(a). (2.4.14) 


6 We will show how yp(x) was obtained momentarily. 
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We can verify our result by using the symbolic toolbox in MATLAB. Typing the com- 
mand: 


dsolve(’D2y-2*Dyt+y=x+sin(x)’,’x’) 
yields 


ans = 
x+2+1/2*cos (x) +C1*exp (x) +C2*exp (x) *x 


e Example 2.4.3 
Let us find the particular solution to 
y" +y' — 2y = re” (2.4.15) 


by the method of undetermined coefficients. 
From the form of the right side of Equation 2.4.15, we guess the particular solution 


Yp(z) = Axe” + Be”. 2.4.16) 
Therefore, 
Yp(x) = Axe” + Ae” + Be”, 2.4.17) 
and 
Yp (z) = Awe” + 2Ae” + Be”. 2.4.18) 


Substituting into Equation 2.4.15, we find that 
3Ae* = xe”. (2.4.19) 


Clearly we cannot choose a constant A such that Equation 2.4.19 is satisfied. What went 
wrong? 
To understand why, let us find the homogeneous or complementary solution to Equation 
2.4.15; it is 
yu (x) = Cie ?* + Coe”. (2.4.20) 


Therefore, one of the assumed particular solutions, Be”, is also a homogeneous solution 
and cannot possibly give a nonzero left side when substituted into the differential equation. 
Consequently, it would appear that the method of undetermined coefficients does not work 
when one of the terms on the right side is also a homogeneous solution. 

Before we give up, let us recall that we had a similar situation in the case of linear 
homogeneous second-order ordinary differential equations when the roots from the auxiliary 
equation were equal. There we found one of the homogeneous solutions was e"'!?, We 
eventually found that the second solution was ze™!”. Could such a solution work here? Let 
us try. 

We begin by modifying Equation 2.4.16 by multiplying it by x. Thus, our new guess 
for the particular solution reads 


yp(z) = Az?e* + Bre”. (2.4.21) 


Then, 
yp = Ax^e* + 2Axe* + Bae” + Be”, (2.4.22) 
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and 
yp = Axe” + AAxe* + 24e? + Bre” + 2Be*. (2.4.23) 


Substituting Equation 2.4.21 into Equation 2.4.15 gives 
Yp Vp — 2yp = GAxe® + 2Ae” + 3Be” = re”. (2.4.24) 


Grouping together terms that vary as xe”, we find that 6A = 1. Similarly, terms that vary 
as e” yield 2A+ 3B = 0. Therefore, 


yp() = 1a?e* — tre”, (2.4.25) 
so that the general solution is 


(x) = yn (£) + yp(z) = Cie 7" + Coe? + 1a?e* — ire”. (2.4.26) 


In summary, the method of finding particular solutions to higher-order ordinary differ- 
ential equations by the method of undetermined coefficients is as follows: 


e Step 1: Find the homogeneous solution to the differential equation. 

e Step 2: Make an initial guess at the particular solution. The form of y,(z) is a lin- 
ear combination of all linearly independent functions that are generated by repeated 
differentiations of f(x). 

e Step 3: If any of the terms in yj (x) given in Step 2 duplicate any of the homogeneous 
solutions, then that particular term in y,(x) must be multiplied by x”, where n is the 
smallest positive integer that eliminates the duplication. 

e Example 2.4.4 
Let us apply the method of undetermined coefficients to solve 


y" +y = sin(x) — e” cos(5z). (2.4.27) 


We begin by first finding the solution to the homogeneous version of Equation 2.4.27: 


yg t yu =0. (2.4.28) 
Its solution is 
yn(x) = Acos(z) + B sin(x). (2.4.29) 


To find the particular solution we examine the right side of Equation 2.4.27 or 
f(x) = sin(x) — e?” cos(5z). (2.4.30) 
Taking a few derivatives of f(x), we find that 
f'(x) = cos(x) — 3e?” cos(5z) + be?* sin(5z), (2.4.31) 


f" (x) = — sin(x) — 9e?* cos(5z) + 30e?* sin(5x) + 25e?* cos(5z), (2.4.32) 
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and so forth. Therefore, our guess at the particular solution is 
yy (x) = Cz sin(x) + Dx cos(x) + Ee?” cos(5z) + Fe?” sin(5z). (2.4.33) 


Why have we chosen rsin(r) and xcos(x) rather than sin(x) and cos(x)? Because sin(x) 
and cos(r) are homogeneous solutions to Equation 2.4.27, we must multiply them by a 
power of x. 

Since 


y, (x) = 2C cos(x) — Cx sin(x) — 2D sin(x) — Dz cos(x) 
+ (30F — 16E)e?* cos(5x) — (30E + 16F)e?" sin(52), (2.4.34) 


Yp + yp = 2C cos(x) — 2D sin(x) 
+ (30F — 15E)e?* cos(5x) — (30E + 15F)e?* sin(52) (2.4.35) 
= sin(x) — e?” cos(5z). (2.4.36) 


Therefore, 2C = 0, —2D = 1, 30F — 15E = —1, and 30E + 15F = 0. Solving this system 


of equations yields C — 0, D -i E ze and F = -&. Thus, the general solution is 
y(x) = Acos(x) + Bsin(x) — $x cos(x) + 4e?” [cos(5x) — 2 sin(5z)]. (2.4.37) 
Problems 


Use the method of undetermined coefficients to find the general solution of the following 
differential equations. Verify your solution by using dsolve in MATLAB. 


1. y" - 4y' -3y— xz 1 2. y" — y = e? — 2e ?7 

3. y" +2y' + 2y = Qn? + 2x +4 4. y! +y =? +r 

5. y" +2y' = 2g + 5 — e7?" 6. y" — 4y' + 4y = (x + 1)e?* 
T. y” + Ay! + Ay = ze” 8. y" — 4y = 4sinh(2z) 

9. y" + 9y = z cos(3x) 10. y” +y = sin(x) + z cos(x) 
11. Solve 


y" +2ay' = sin? (uz), y(0) = y'(0) = 0, 


by (a) the method of undetermined coefficients and (b) integrating the ordinary differential 
equation so that it reduces to 


x  sin(2ar) 
2 4a ' 


y + 2ay = 


and then using the techniques from the previous chapter to solve this first-order ordinary 
differential equation. 
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2.5 FORCED HARMONIC MOTION 


Let us now consider the situation when an external force f(t) acts on a vibrating mass 
on a spring. For example, f(t) could represent a driving force that periodically raises and 
lowers the support of the spring. The inclusion of f (t) in the formulation of Newton's second 
law yields the differential equation 


d?z dx 
mg ka bT + f(t), (2.5.1) 
ax Bde k _ fit) 
d? "mdi m m’ (252) 
or 2 
x ax whe 82 = 
TE + 2A r F(t), (2.5.3) 


where F(t) = f(t)/m, 24 = B/m, and w? = k/m. To solve this nonhomogeneous equation 
we will use the method of undetermined coefficients. 


e Example 2.5.1 
Let us find the solution to the nonhomogeneous differential equation 
y" + 2y' +y = 2sin(t), (2.5.4) 


subject to the initial conditions y(0) — 2 and y'(0) — 1. 
'The homogeneous solution is easily found and equals 


yu(t) = Ae"! + Bte™. (2.5.5) 


From the method of undetermined coefficients, we guess that the particular solution is 


Yp(t) = C cos(t) + D sin(t), (2.5.6) 
so that 
y, (t) = —C sin(t) + D cos(t), (2.5.7) 
and 
y» (t) = —C cos(t) — Dsin(t). (2.5.8) 


Substituting yp(t), y(t), and y; (t) into Equation 2.5.4 and simplifying, we find that 
—2C sin(t) + 2D cos(t) = 2 sin(t) (2.5.9) 


or D —0 and C= -1. 
To find A and B, we now apply the initial conditions on the general solution 


y(t) = Ae™ + Bte™ — cos(t). (2.5.10) 
The initial condition y(0) — 2 yields 


y(0) =A+0-1=2, (2.5.11) 
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or A = 3. The initial condition y'(0) = 1 gives 
y'(0)=-A+B=1, (2.5.12) 


or B = 4, since 


y (t) = — Ae! + Be! — Bte™ +sin(t). (2.5.13) 


Therefore, the solution that satisfies the differential equation and initial conditions is 


y(t) = 3e^* + 4te™ — cos(t). (2.5.14) 


e Example 2.5.2 


Let us solve the differential equation for a weakly damped harmonic oscillator when 
the constant forcing Fo “turns on” at t = to. The initial conditions are that x(0) = rg and 
x'(0) = vo. Mathematically, the problem is 


0, O0<t< to, 


ee (2.5.15) 


x" -2Az! + wr = { 


with z(0) = zo and z'(0) = vo. 
To solve Equation 2.5.15, we first divide the time domain into two regions: 0 « t < to 
and to < t. For 0 < t < to, 
a(t) = Ae cos(wat) + Be sin(wat), (2.5.16) 


where w2 = w? — A?. Upon applying the initial conditions, 


Ug + ALO X 
— e 
Wd 


a(t) = zoe cos(wat) + sin(wat), (2.5.17) 


as before. 
For the region tọ < t, we write the general solution as 


n 
z(t) = Ae^* cos(wat) + Be~™ sin(wgt) + a 
+ Ce?) coslwa(t — to)] + De 7*9 sinfwa(t — to]. DIS) 


Why have we written our solution in this particular form rather than the simpler 
—At —At a: Fo 
x(t) = Ce ^" cos(wat) + De ^" sin(wat) + 7 (2.5.19) 
w 


Both solutions satisfy the differential equation, as direct substitution verifies. However, the 
algebra is greatly simplified when Equation 2.5.18 rather than Equation 2.5.19 is used in 
matching the solution from each region at t = tọ. There both the solution and its first 
derivative must be continuous or 


tts Jm ws, and - a'(tg) ew), (2.5.20) 
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Review of the Solution of the 
Forced Harmonic Oscillator Problem 


The undamped system mz" +ka = Fo cos(wot) subject to the initial conditions z(0) = 
zo and z'(0) = vo has the solution 


a(t) = sin(wt) + (s = x) cos(wt) + 


fo 
w? — we cos(wot), 


where fo = Fo/m and w = \/k/m. The underdamped system mz" + Ba’! + kx = 
Fo cos(wot) has the steady-state solution 


IL fo E —1 2Awo ) 
ud V (w? — wg)? + Quo)? = Z = E -w/l 


where 2\ = b/m. 


where tọ and t are points just below and above to, respectively. When Equation 2.5.17 
and Equation 2.5.18 are substituted, we find that C = —Fo/w?, and wq D = AC. Thus, the 
solution for the region to < t is 


A vo + Azo "E 


Wd 


Fi 
= eto) cos[wa(t — to)] — 5 
W WqW 


F, 
sin(wat) + = (2.5.21) 


t) = zoe ^* cos(wat 
x(t) = zoe ™ cos(wat) + -2 


AFo e Mt-to) 


sin[wa(t — to)]. 


As we will see in Chapter 12, the technique of Laplace transforms is particularly well suited 
for this type of problem when the forcing function changes abruptly at one or more times. 


As noted earlier, nonhomogeneous solutions consist of the homogeneous solution plus 
a particular solution. In the case of a damped harmonic oscillator, another, more physical, 
way of describing the solution involves its behavior at large time. That portion of the 
solution which eventually becomes negligible as t — oo is often referred to as the transient 
term, or transient solution. In Equation 2.5.14 the transient solution equals 3e~' + 4te~*. 
On the other hand, the portion of the solution that remains as t > oo is called the steady- 
state solution. In Equation 2.5.14 the steady-state solution equals — cos(t). 

One of the most interesting forced oscillator problems occurs when 6 = 0 and the 
forcing function equals Fo sin(wot), where Fo is a constant. Then the initial-value problem 
becomes 

dz 5 . 
—— wx = Fosin(wot). (2.5.22) 
dt? 
Let us solve this problem when x(0) = z'(0) = 0. 
'The homogeneous solution to Equation 2.5.22 is 


xj (t) = Acos(wt) + Bsin(wt). (2.5.23) 
To obtain the particular solution, we assume that 


xp(t) = Ccos(wot) + D sin(wot). (2.5.24) 
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Figure 2.5.1: The solution, Equation 2.5.31, as a function of time when w = 1 and wo equals (a) 1.02, (b) 
1.2, and (c) 2 


This leads to 


a(t) = —Cwo sin(wot) + Dwo cos(wot), (2.5.25) 
x(t) = —Cug cos(wot) + Dwg sin(wot), (2.5.26) 

and 
zy + wtp = C(w? — we) cos(wot) + D(w? — wg) sin(wot) = Fo sin(wot). (2.5.27) 


We immediately conclude that C(w? — wê) = 0, and D(w? — wg) = Fo. Therefore, 


F 
C=0, and D2-———, (2.5.28) 
w? — uj 
provided that w Æ wọ. Thus, 
fo. 
Lp(t) = we — we sin(wot). (2.5.29) 


To finish the problem, we must apply the initial conditions to the general solution 
: Po X 
z(t) = Acos(wt) + B sin(wt) + — ——3 sin(wot). (2.5.30) 


w? — tj 


From z(0) = 0, we find that A = 0. On the other hand, z'(0) = 0 yields B = —woFo/|w(w?— 
w@)]. Thus, the final result is 


x(t) = ea [w sin(wot) — wo sin(wt)] . (2.5.31) 


Equation 2.5.31 is illustrated in Figure 2.5.1 as a function of time. 
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The most arresting feature in Figure 2.5.1 is the evolution of the uniform amplitude 
of the oscillation shown in frame (c) into the one shown in frame (a) where the amplitude 
exhibits a sinusoidal variation as wo — w. In acoustics these fluctuations in the amplitude 
are called beats, the loud sounds corresponding to the larger amplitudes. 

As our analysis indicates, Equation 2.5.31 does not apply when w = wọ. As we shall 
shortly see, this is probably the most interesting configuration. We can use Equation 2.5.31 
to examine this case by applying L'Hópital's rule in the limiting case of wo — w. This 
limiting process is analogous to “tuning in” the frequency of the driving frequency [wo/ (27)] 
to the frequency of free vibrations [w/(27)]. From experience, we expect that given enough 
time we should be able to substantially increase the amplitudes of vibrations. Mathematical 
confirmation of our physical intuition is as follows: 


w sin(wot) — wo sin(wt) 


— lim F 2.5.32 
x(t) ae o w(w? ni wg) 
i u : t 
— RF, lim d|w sin(wot) d )]/dwo 2.5.33) 
wow d[w(w? — w3 )]/dwo 
t t) — sin(wt 
- tim * cos(wot) — sin(wt) 2.5.34) 
Wow —2Qwow 
m wt cos(wt) — sin(wt) 2.5.35) 
—2w2 
Fo é Fot 
ER P aah 2.5. 
San sin(wt) 7m cos(wt) SB) 


As we suspected, as t — oo, the displacement grows without bounds. This phenomenon is 
known as pure resonance. We could also have obtained Equation 2.5.36 directly using the 
method of undetermined coefficients involving the initial value problem 


2 
a + war = Fosin(wt), x(0) =a (0) = 0. (2.5.37) 


Because there is almost always some friction, pure resonance rarely occurs and the 
more realistic differential equation is 


d d 
= + 277 + wa = Fy sin(wot). (2.5.38) 


Its solution is 


ENE HK i 
x(t) = Ce ^t sin (vos — wê + e) + TE = "3 TX sin(wot — 0), (2.5.39) 


where 


2AÀwo0 wu? — we 


sin(0) = ——— Tz cos(0) = 5 T3 
(w? — we)? + AM ug (w? — we)? + AM wg 


(2.5.40) 


and C and o are determined by the initial conditions. To illustrate Equation 2.5.39 we 
rewrite the amplitude and phase of the particular solution as 


Fo u Fo 
J (w? — we)? + 4d2w2 — w?/(1 — r?) + AB?r? 


28r 
1— r2?’ 


and tan(0) = (2.5.41) 
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Figure 2.5.2: The amplitude of the particular solution Equation 2.5.39 for a forced, damped simple 
harmonic oscillator (normalized with Fo/w?) as a function of r = wo/w. 


where r = wo/w and 8 = A/w. Figures 2.5.2 and 2.5.3 graph Equation 2.5.41 as functions 
of r for various values of 5. 


e Example 2.5.3: Electrical circuits 


In the previous chapter, we saw how the mathematical analysis of electrical circuits 
yields first-order linear differential equations. In those cases we only had a resistor and 
capacitor or a resistor and inductor. One of the fundamental problems of electrical circuits 
is a circuit where a resistor, capacitor, and inductor are connected in series, as shown in 
Figure 2.5.4. 

In this RCL circuit, an instantaneous current flows when the key or switch K is closed. 
If Q(t) denotes the instantaneous charge on the capacitor, Kirchhoff's law yields the differ- 
ential equation 


dl Q 
L— I4 -—-E(t 2.5.42 
— +RI+ 5 = E(t), (2.5.42) 


where E(t), the electromotive force, may depend on time, but where L, R, and C are 
constant. Because I = dQ/dt, Equation 2.5.42 becomes 


PQ dQ Q 
mr x- 


L E(t). (2.5.43) 

Consider now the case when resistance is negligibly small. Equation 2.5.43 will become 
identical to the differential equation for the forced simple harmonic oscillator, Equation 
2.5.3, with A = 0. Similarly, the general case yields various analogs to the damped harmonic 
oscillator: 


Case 1 Overdamped R? > AL/C 
Case 2 Critically damped R? — AL/C 
Case 3 Underdamped R? < 4L/C 


In each of these three cases, Q(t) > 0 as t + oo. (See Problem 6.) Therefore, an RLC 
electrical circuit behaves like a damped mass-spring mechanical system, where inductance 
acts like mass, resistance is the damping coefficient, and 1/C is the spring constant. 
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Figure 2.5.3: The phase of the particular solution, Equation 2.5.39, for a forced, damped simple harmonic 
oscillator as a function of r = wo/w. 


Problems 
1. Find the values of y so that x” + 6x’ + 18 = cos(7t) is in resonance. 


2. The differential equation 
x" +2a' -- 2x = 10sin(2t) 


describes a damped, forced oscillator. If the initial conditions are z(0) = zo and z'(0) = 0, 
find its solution by hand and by using MATLAB. Plot the solution when xg = —10, —9,...,9 
, 10. Give a physical interpretation to what you observe. 


3. At time t = 0, a mass m is suddenly attached to the end of a hanging spring with a 
spring constant k. Neglecting friction, find the subsequent motion if the coordinate system 


is chosen so that z(0) — 0. 


Step 1: Show that the differential equation is 


with the initial conditions z(0) = z'(0) = 0. 
Step 2: Show that the solution to Step 1 is 

x(t) = mg [1 — cos(wt)] /k, w? = kfm. 
4. Consider the electrical circuit shown in Figure 2.5.4, which now possesses negligible 
resistance and has an applied voltage E(t) = Eo[1— cos(wt)]. Find the current if the circuit 
is initially dead. 
5. Find the general solution to the differential equation governing a forced, damped har- 


monic equation 
mz" + cx! + kr = Fosin(wt), 
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Figure 2.5.4: A simple electrical circuit containing a resistor of constant resistance R, capacitor of constant 
capacitance C, and inductor of constant inductance L driven by a time-dependent electromotive force E(t). 


where m, c, k, Fo, and w are constants. Write the particular solution in amplitude/phase 
format. 


6. Prove that the transient solution to Equation 2.5.45 tends to zero as t — oo if R, C, and 
L are greater than zero. 


2.6 VARIATION OF PARAMETERS 


As the previous section has shown, the method of undetermined coefficients can be 
used when the right side of the differential equation contains constants, polynomials, expo- 
nentials, sines, and cosines. On the other hand, when the right side contains terms other 
than these, variation of parameters provides a method for finding the particular solution. 

To understand this technique, let us return to our solution of the first-order ordinary 
differential equation 


d 
= + Ploy = f). (2.6.1) 
z 
Its solution is 
y(x) = C,e- J PO de + e J P@) ae / el P(2) 12 f(x) da. (2.6.2) 


The solution, Equation 2.6.2, consists of two parts: The first term is the homogeneous 
solution and can be written yg (x) = Ciyi(x), where yı (£) = e f P949. The second term 
is the particular solution and equals the product of some function of x, say u(x), times 
yı (z): 

ues ren / ef PO) ff) de = ui (à) (). (2.6.3) 


This particular solution bears a striking resemblance to the homogeneous solution if we 
replace uı(x) with C4. 

Variation of parameters builds upon this observation by using the homogeneous solution 
y1(z) to construct a guess for the particular solution yy (zx) = ui(x)yi(z). Upon substituting 
this guessed y,(x) into Equation 2.6.1, we have that 


d 
doi (u131) + P(z)uiyi = f(x), (2.6.4) 
uy T na + P(xuy = f(z), (2.6.5) 
or , 
nm = f). (2.6.6) 
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since yj + P(z)yi = 0. 
Using the technique of separating the variables, we have that 


f(z) f(z) 
du, = dz, and = da. 2.6.7 
m IY z n uj (a) TICY x ( ) 
Consequently, the particular solution equals 
2 = (x) 
Yp(x) = ur(x)yi (x) = yr (x) TOS dx. (2.6.8) 


Upon substituting for yı (x), we obtain Equation 2.6.3. 
How do we apply this method to the linear second-order differential equation 
ag(x)y" + aiy' (x) + ao(x)y = g(a), (2.6.9) 
or 
y" + P(z)y + Q(z)y = f(z), (2.6.10) 
where P(x), Q(x), and f(x) are continuous on some interval I? 
Let yi(a) and yo(a) denote the homogeneous solutions of Equation 2.6.10. That is, 
yı(x) and yo(x) satisfy 


yr + P(x)yi + Q(z)yı = 0, (2.6.11) 
and 
y2 + P(z)y5 + Q(x)y2 = 0. (2.6.12) 
Following our previous example, we now seek two functions u(x) and w2(x) such that 
yp(z) = ui(x)yi(z) + u2(x£)y2(x) (2.6.13) 


is a particular solution of Equation 2.6.10. Once again, we replaced our arbitrary constants 
C, and C2 by the “variable parameters” uj(r) and uo(r). Because we have two unknown 
functions, we require two equations to solve for uj(x) and us(x). One of them follows from 
substituting yp(x) = ui(z)yi(x) + u2(x)y2(x) into Equation 2.6.10. The other equation is 
yi(x)u} (x) + ys(z)uS (x) = 0. (2.6.14) 
This equation is an assumption that is made to simplify the first and second derivative, 
which is clearly seen by computing 
Yp = uii + yuh + uy» + Yous = U1Y1 + uy», (2.6.15) 
after applying Equation 2.6.14. Continuing to the second derivative, 
yy = uir + yu, + uyy + yaus. (2.6.16) 
Substituting these results into Equation 2.6.10, we obtain 


yp + P(x)y, + Q(x) yp = vay + yiii + uzy3 + your 


+ Pury, + Pusy, + Quiyi + Quays, (2.6.17) 
= u [yr + P(x)yi + Q(a) yi] + u2 [y + P(£)ys + Q(x) ye] 
+ yiu + yaus = f(x). (2.6.18) 
Hence, u1(x) and u2(r) must be functions that also satisfy the condition 
yiu + yous = f(x). (2.6.19) 


It is important to note that the differential equation must be written so that it conforms to 
Equation 2.6.10. This may require the division of the differential equation by a2(a) so that 
you have the correct f(x). 

Equations 2.6.14 and 2.6.19 constitute a linear system of equations for determining the 
unknown derivatives u, and uh. By Cramer’s rule,’ the solutions of Equation 2.6.14 and 


7 Tf you are unfamiliar with Cramer's rule, see Section 3.3. 
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Equation 2.6.19 equal 


W. W. 
uj(z)- ae and u»(r) = m (2.6.20) 
where 
ree wi - [ye " Ws =|" 0 (2.6.21) 
Yi v f(z) w yi f(x) 


The determinant W is the Wronskian of yı and yo. Because yı and y» are linearly indepen- 
dent on J, the Wronskian will never equal to zero for every x in the interval. 
These results can be generalized to any nonhomogeneous, nth-order, linear equation of 
the form 
y P, i (z)y 79 + Py(a)y! + Po(x) = f(a). (2.6.22) 


If yg (x) = Cryi(x) + Coys(z) +--+ Coys (x) is the complementary function for Equation 
2.6.22, then a particular solution is 


yp(z) = ui(z)ui(z) + us(x)yo(z) + +++ + Un(Z)Yn(x), (2.6.23) 
where the u}, k = 1,2,...,n, are determined by the n equations: 
yiuj + yos: d yu, = 0, (2.6.24) 
yiu t yaus + yu us = 0, (2.6.25) 
yu, + yah pene py Da = f(z). (2.6.26) 


The first n — 1 equations in this system, like Equation 2.6.14, are assumptions made to 
simplify the first n — 1 derivatives of yp(x). The last equation of the system results from 
substituting the n derivative of yp(x) and the simplified lower derivatives into Equation 
2.6.22. Then, by Cramer's rule, we find that 


> 


u, = k-—1,2,;. m, (2.6.27) 


WwW’ 
where W is the Wronskian of y1, ys, ...., Yn, and Wy is the determinant obtained by replacing 
the kth column of the Wronskian by the column vector [0,0,0, ---, f(z)]*. 


e Example 2.6.1 
Let us apply variation of parameters to find the general solution to 
y" +y' — 2y = ae". (2.6.28) 
We begin by first finding the homogeneous solution that satisfies the differential equa- 


tion 
V yg — 2yn = 0. (2.6.29) 


Applying the techniques from Section 2.1, the homogeneous solution is 


yn (x) = Ae” + Be ?*, (2.6.30) 


Higher-Order Ordinary Differential Equations 81 


yielding the two independent solutions yı (x) = e”, and yo(x) = e ?*. Thus, the method of 
variation of parameters yields the particular solution 


yp(z) = eu, (x) +e 7 ua(z). (2.6.31) 
From Equation 2.6.14, we have that 
e*u, (x) + e ?* ug (x) = 0, (2.6.32) 
while 
e*u (£) — 2e ?*u (x) = re”. (2.6.33) 
Solving for u(x) and u(x), we find that 
uj (x) = ic, 2.6.34) 
Or 
u(x) = 4a”, 2.6.35) 
and 
uj(r)- —ize?", 2.6.36) 
or 
u2(x) = (1 — 3a)e?v. 2.6.37) 
Therefore, the general solution is 
y(x) = Ae” + Be ?* + e*ui (x) + e ?*us(x) (2.6.38) 
= Ae” + Be 7" + tge” + L(1— 3a)e” (2.6.39) 
= Ce? + Be?" + (4a? — Ix) e. (2.6.40) 
e Example 2.6.2 
Let us find the general solution to 
y" -2y +y =e ln(x) (2.6.41) 


by variation of parameters on the interval (0, oo). 
We start by finding the homogeneous solution that satisfies the differential equation 


yn + 2y + yn = 0. (2.6.42) 

Applying the techniques from Section 2.1, the homogeneous solution is 
yu(z)— Ae” + Bre”, (2.6.43) 
yielding the two independent solutions y;(x) = e~* and y2(x) = xe~*. Thus, the particular 


solution equals 
Yp(x) = e “ur (x) + xe * u(x). (2.6.44) 
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From Equation 2.6.14, we have that 
e "u(x) + ze *uh(x) = 0, (2.6.45) 


while 
—e "w(x) + (1— z)e *us(r) = e? ln(x). (2.6.46) 


Solving for uj (x) and u4(x), we find that 


uj (x) = —xIn(z), 2.6.47) 
or 
u(x) = 4x? — x? ln(x), 2.6.48) 
and 
u(x) = ln(x), 2.6.49) 
or 
uz(x) = z ln(z) — z. 2.6.50) 
Therefore, the general solution is 
y(x) = Ae * + Bxe * +e "uy(x) + ze "us(ax) (2.6.51) 
= Ae * + Bze * + $x? In(x)e~® — 22e 77. (2.6.52) 


We can verify our result by using the symbolic toolbox in MATLAB. Typing the com- 
mand: 


dsolve(?D2y*2*Dy*y-exp(-x)*log(x)? ,?x?) 
yields 


ans = 
1/2*exp (-x) *x*2*log (x) -3/4*exp (-x) *x*2+C1*exp (-x) +C2*exp (-x) *x 


e Example 2.6.3 


So far, all of our examples have yielded closed-form solutions. To show that this is not 
necessarily so, let us solve 
y” — 4y = e?* /x (2.6.53) 


by variation of parameters. 
Again we begin by solving the homogeneous differential equation 


Vg — 4yn = 0, (2.6.54) 


which has the solution 
yo (x) = Ae?* + Be”. (2.6.55) 


2x 2x 


Thus, our two independent solutions are yı(x) = e°” and yo(%) = e **. Therefore, the 


particular solution equals 


—2x 


yp(x) = eur (x) + e?" u(x). (2.6.56) 


From Equation 2.6.14, we have that 


eu (x) +e 7" ut (x) = 0, (2.6.57) 
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while 
2e7* ui (x) — 2e ?*u,(z) = e?* /g. 


Solving for uw, (x) and u5(x), we find that 


1 
Or 
u(x) = į n]a], 
and i 
€ X 
u(x) = zd 
Or e qat 
£o 


Therefore, the general solution is 
= Ae?* + Be?* + e?*w(z) + e? us (a) 


—2x 1 2x 1,—2x 
Ben dme — ze i! 
T 


ylz) 


= Ae” + 
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(2.6.58) 


2.6.59) 


2.6.60) 


2.6.61) 


2.6.62) 


(2.6.63) 


a dt. (2.6.64) 


Use variation of parameters to find the general solution for the following differential equa- 
tions. Then see if you can obtain your solution by using dsolve in MATLAB. 


1. y” — 4y' + 3y =e* 2. y^ — y —2y — x 


4. y" + 9y = 2sec(z) 5. y" + Ay + Ay = ve ?* 
7. y" — Ay Ay — (zc 1)?" — 8. y" — 4y = sin? (x) 
10. y" +y = tan(x) 


2.7 EULER-CAUCHY EQUATION 


3. y" — 4y = ze” 
6. y" + 2ay! = sin? (wa) 


9. y" — 2y' +y =e /ax 


'The Euler-Cauchy or equidimensional equation is a linear differential equation of the 


form 
d"y $45 9 dy 


qe i n— L. vox) 
Ant + An—12 dini a£ 


dx” 


where an, An—1, « 
power to which x is raised equals the order of differentiation. 


d; 9 = f(z), 


(2.7.1) 


., @g are constants. The important point here is that in each term the 


To illustrate this equation, we will focus on the homogeneous, second-order, ordinary 


differential equation 
dy dy 
2 
+b. +cy=0. 
dd EP A 


(2.7.2) 
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The solution of higher-order ordinary differential equations follows by analog. If we wish to 
solve the nonhomogeneous equation 
d? d 
ax? 9 ybr” + cy = f(x), (2.7.3) 
da? dx 
we can do so by applying variation of parameters using the complementary solutions that 
satisfy Equation 2.7.2. 
Our analysis starts by trying a solution of the form y = xz", where m is presently 
undetermined. The first and second derivatives are 


dy PE m—1 dy 


a: me , and da^ m(m — 1)a™-?, (2.7.4) 
respectively. Consequently, substitution yields the differential equation 
d? d 
ax + be + cy = az? -m(m—1)a™-? + bx - ma^! + eg" (2.7.5) 
= am(m — 1)z"" + bmx" + ez" (2.7.6) 
= [am(m — 1) + bm + c] x". (2.7.7) 


Thus, y = x” is a solution of the differential equation whenever m is a solution of the 
auxiliary equation 


am(m —1)+bm+c=0, or am? +(b—a)m+c=0. (2.7.8) 


At this point we must consider three different cases that depend upon the values of a, b, 
and c. 


e Distinct real roots 


Let mı and m» denote the real roots of Equation 2.7.8 such that mı 4 m». Then, 
y(x) = x" and ya(x) = x"? (2.7.9) 
are homogeneous solutions to Equation 2.7.2. Therefore, the general solution is 


y(x) = Ciz" + Coa", (2.7.10) 


e Repeated real roots 


If the roots of Equation 2.7.8 are repeated [mı = m2 = —(b — a)/2], then we presently 
have only one solution, y = x". To construct the second solution ys, we use reduction in 
order. We begin by first rewriting the Euler-Cauchy equation as 


d^y b dy c 
H t = 0. 2.7.11 
dz? axda ^ ax?” : ( ) 
Letting P(x) = b/(ax), we have 

e7 [I0/(22)) dx e- (b/a) ln(z) 
yalx) = x" / — (my dz = y" J a dx (2.7.12) 
Sg | ae ale dx = rU Justum dx (2.7.13) 
En a =2"™ n(z). (2.7.14) 


x 
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'The general solution is then 
y(x) = Ciz" + Cox" ln(x). (2.7.15) 
For higher-order equations, if mı is a root of multiplicity k, then it can be shown that 
zz" In(z), 2 In(z)]?,..., 2 [In(z)]^-! 


are the k linearly independent solutions. Therefore, the general solution of the differential 
equation equals a linear combination of these k solutions. 


e Conjugate complex roots 


If the roots of Equation 2.7.8 are the complex conjugate pair m4 = «a + if, and mg = 
a — ip, where a and f are real and 6 > 0, then a solution is 


y(z) = Cirt? 4 Coret. (2.7.16) 
However, because x’? = [el^ (9| = ei? ale) we have by Euler's formula 
a’? = cos [0 In(z)] + isin [0 In(z)] , (2.7.17) 


and 
z ? = cos [0 In(x)] — isin [0 In(a)]. (2.7.18) 


Substitution into Equation 2.7.16 leads to 
y(x) = Caz? cos [8 In(z)] + Cax? sin [8 In(x)) , (2.7.19) 
where C3 = C, + C5, and C4 = iC, — iC. 
e Example 2.7.1 
Let us find the general solution to 
z?^y" + day’ — 12y = ln(x) (2.7.20) 


by the method of undetermined coefficients and variation of parameters. 
In the case of undetermined coefficients, we begin by letting t = ln(x) and y(x) = Y(t). 
Substituting these variables into Equation 2.7.20, we find that 


Y" + AY' — 12Y =t. (2.7.21) 
'The homogeneous solution to Equation 2.7.21 is 
Yu(t) = A'e + B'e”, (2.7.22) 


while the particular solution is 
Y,(t) 2 Ct+ D (2.7.23) 


from the method of undetermined coefficients. Substituting Equation 2.7.23 into Equation 
2.7.21 yields C = -5 and D = -4. Therefore, 


Y (t) = A’e™™ + B'e” — bt- h, (2.7.24) 
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Or 


A 
y(z) = -5 + Ba — ig n(x) — gg. (2.7.25) 


To find the particular solution via variation of parameters, we use the homogeneous 
solution 


A 
ya(x) == + Bz? (2.7.26) 
£ 
to obtain y1(z) = x° and ya(x) = z?. Therefore, 


Yp(x) = x Pus (a) + a^ ua (a). (2.7.27) 


Substitution of Equation 2.7.27 in Equation 2.7.20 yields the system of equations: 


z uj (x) + z^u)(z) = 0, (2.7.28) 
and 
—6z^"u, (x) + 2zu,(z) = In(z)/z?. (2.7.29) 
Solving for wu, (a) and u% (x), 
z? ln(x ln(x 
ui (£) = — T ). and Usa) = E (2.7.30) 


'The solutions of these equations are 


z95In(r) zê ln(x) 1 
u(x) = — 18 t 288' and U2(x) = — l2 3a (2.7.31) 
The general solution then equals 
= A B 2 —6 2 — A LB 2 1 l 1 
y(x) = ,í* Bete uy(x) + z^ux(x) = z 1 Bt i nia) (2.7.32) 


We can verify this result by using the symbolic toolbox in MATLAB. Typing the com- 
mand: 


dsolve(?x^2*D2y45*x*Dy-12*y-log(x)?,?x?) 
yields 


ans = 
-1/12*1og(x)-1/364C1*x^24C2/x^6 


Problems 


Find the general solution for the following Euler-Cauchy equations valid over the domain 
(—oo, 00). Then check your answer by using dsolve in MATLAB. 


Lay tay —y=0 — 2 ay” + 2ay’ 29-0 3-99 Ayo 
4. xy" — zy +y —0 5. ay" + 3zy' +y =0 6. a?y" — 3ay' + 4y — 0 
7. z2y" — y c 5y =0 8. Az?" + 8ry' + 5y = 0 9. z?y" zy +y=0 


10. z2y" — 3zy' + 13y 2 0. 11. z3y"" — 2z?y" — 2xy' + 8y=0 12. z?y" — 2zy/ — 4y — x 
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2.8 PHASE DIAGRAMS 


In Section 1.6 we showed how solutions to first-order ordinary differential equations 
could be qualitatively solved through the use of the phase line. This concept of qualita- 
tively studying differential equations showed promise as a method for deducing many of 
the characteristics of the solution to a differential equation without actually solving it. In 
this section we extend these concepts to second-order ordinary differential equations by 
introducing the phase plane. 

Consider the differential equation 


x” + sgn(x) = 0, (2.8.1) 


where the signum function is defined by Equation 11.2.11. Equation 2.8.1 describes, for 
example, the motion of an infinitesimal ball rolling in a *V?-shaped trough in a constant 
gravitational field.® 

Our analysis begins by introducing the new dependent variable v = x’ so that Equation 


2.8.1 can be written d 
vL + sgn(x) — 0, (2.8.2) 
dx 


since , 
dr dv  dxdv dv 
E = = = 2.8.3 
7 dB dt dtdx ‘dx eeu 
Equation 2.8.2 relates v to z and t has disappeared explicitly from the problem. Integrating 
Equation 2.8.2 with respect to z, we obtain 


nz + f sent) dx — C, (2.8.4) 


Or 
iv? + |x| = C. (2.8.5) 


Equation 2.8.5 expresses conservation of energy because the first term on the left side of 
this equation expresses the kinetic energy while the second term gives the potential energy. 
The value of C depends upon the initial condition z(0) and v(0). Thus, for a specific initial 
condition, our equation gives the relationship between x and v for the motion corresponding 
to the initial condition. 

Although there is a closed-form solution for Equation 2.8.1, let us imagine that there 
is none. What could we learn from Equation 2.8.5? 

Equation 2.8.5 can be represented in a diagram, called a phase plane, where x and v 
are its axes. A given pair of (z, v) is called a state of the system. A given state determines 
all subsequent states because it serves as initial conditions for any subsequent motion. 

For each different value of C, we will obtain a curve, commonly known as phase paths, 
trajectories, or integral curves, on the phase plane. In Figure 2.8.1, we used the MATLAB 
script 
clear 
^ set up grid points in the (x,v) plane 
[x,v] = meshgrid(-5:0.5:5,-5:0.5:5); 
^ compute slopes 


8 See Lipscomb, T., and R. E. Mickens, 1994: Exact solution to the axisymmetric, constant force 
oscillator equation. J. Sound Vib., 169, 138-140. 
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Figure 2.8.1: Phase diagram for the differential equation, Equation 2.8.1. 


dxdt = v; dvdt = -sign(x); 

^ find magnitude of vector [dxdt,dydt] 

L = sqrt (dxdt.*dxdt + dvdt.*dvdt) ; 

^4 plot scaled vectors 
quiver(x,v,dxdt./L,dvdt./L,0.5); axis equal tight 
hold 

^ contour trajectories 

contour(x,v,v.*v/2 + abs(x),8) 

h = findobj(’Type’,’patch’); set(h, ’Linewidth’ ,2); 
xlabel(’x’,’Fontsize’,20); ylabel(’v’,’Fontsize’ ,20) 


to graph the phase plane for Equation 2.8.1. Here the phase paths are simply closed, oval- 
shaped curves that are symmetric with respect to both the x and v phase space ares. Each 
phase path corresponds to a particular possible motion of the system. Associated with each 
path is a direction, indicated by an arrow, showing how the state of the system changes as 
time increases. 

An interesting feature on Figure 2.8.1 is the point (0,0). What is happening there? 
In our discussion of phase line, we sought to determine whether there were any equilibrium 
or critical points. Recall that at an equilibrium or critical point the solution is constant 
and was given by x’ = 0. In the case of second-order differential equations, we again have 
the condition x’ = v = 0. For this reason, equilibrium points are always situated on the 
abscissa of the phase diagram. 

The condition x’ = 0 is insufficient for determining critical points. For example, when 
a ball is thrown upward, its velocity equals zero at the peak height. However, this is clearly 
not a point of equilibrium. Consequently, we must impose the additional constraint that 
xz” = v' = 0. In the present example, equilibrium points occur where x’ = v = 0 and 
v' = —sgn(x) = 0 or x = 0. Therefore, the point (0,0) is the critical point for Equation 
2.8.1. 

The closed curves immediately surrounding the origin in Figure 2.8.1 show that we 
have periodic solutions there because on completing a circuit, the original state returns and 
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Figure 2.8.2: Phase diagram for a simple pendulum. 


the motion simply repeats itself indefinitely. 

Once we have found an equilibrium point, an obvious question is whether it is stable 
or not. To determine this, consider what happens if the initial state is displaced slightly 
from the origin. It lands on one of the nearby closed curves and the particle oscillates with 
small amplitude about the origin. Thus, this critical point is stable. 

In the following examples, we further illustrate the details that may be gleaned from a 
phase diagram. 


e Example 2.8.1 
'The equation describing a simple pendulum is 
ma?0" + mgasin(0) = 0, (2.8.6) 


where m denotes the mass of the bob, a is the length of the rod or light string, and g is the 
acceleration due to gravity. Here the conservation of energy equation is 
ima?" — mga cos(0) = C. (2.8.7) 
Figure 2.8.2 is the phase diagram for the simple pendulum. Some of the critical points 
are located at 0 = +2nz, n = 0,1,2,..., and 0' = 0. Near these critical points, we have 
closed patterns surrounding these critical points, just as we did in the earlier case of an 
infinitesimal ball rolling in a *V?-shaped trough. Once again, these critical points are 
stable and the region around these equilibrium points corresponds to a pendulum swinging 
to and fro about the vertical. On the other hand, there is a new type of critical point at 
= +(2n—1)7,n = 0,1,2,... and 0' = 0. Here the trajectories form hyperbolas near these 
equilibrium points. Thus, for any initial state that is near these critical points, we have 
solutions that move away from the equilibrium point. This is an example of an unstable 
critical point. Physically these critical points correspond to a pendulum that is balanced 
on end. Any displacement from the equilibrium results in the bob falling from the inverted 
position. 
Finally, we have a wavy line as 0' — +00. This corresponds to whirling motions of the 
pendulum where 60" has the same sign and 0 continuously increases or decreases. 


e Example 2.8.2: Damped harmonic oscillator 
Consider the ordinary differential equation 


x" -- 2a! + 5a = 0. (2.8.8) 
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Figure 2.8.3: Phase diagram for the damped harmonic oscillator, Equation 2.8.8. 
'The exact solution to this differential equation is 
x(t) = e™* [Acos(2t) + Bsin(2t)], (2.8.9) 


and 
z'(t) = 2e^* [B cos(2t) — Asin(2t)] — e™* [Acos(2t) + B sin(2t)]. (2.8.10) 


To construct its phase diagram, we again define v = x’ and replace Equation 2.8.8 with 
v’ = —2v — 5x. The MATLAB script 


clear 
^ set up grid points in the x,x? plane 
[x,v] = meshgrid(-3:0.5:3,-3:0.5:3); 
^ compute slopes 
dxdt = v; dvdt = -2*v - 5*x; 
% find length of vector 
L = sqrt (dxdt.*dxdt + dvdt.*dvdt) ; 
^4 plot direction field 
quiver (x,v,dxdt./L,dvdt./L,0.5); axis equal tight 
hold 
^4 compute x(t) and v(t) at various times and a’s and b’s 
for b = -3:2:3; for a = -3:2:3; 
t = [-5:0.1:5]; 
xx = exp(-t) .* (a*cos(2*t) + b*sin(2*t)); 
vv = 2 * exp(-t) .* (b*cos(2*t) - a*sin(2*t)) - xx; 
^4 plot these values 
plot(xx,vv) 
end; end; 
xlabel(’x’,’Fontsize’,20); ylabel(’v’,’Fontsize’ ,20) 
was used to construct the phase diagram for Equation 2.8.8 and is shown in Figure 2.8.3. 
Here the equilibrium point is at x = v = 0. This is a new type of critical point. It is called 
a stable node because all slight displacements from this critical point eventually return to 
this equilibrium point. 
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Problems 


1. Using MATLAB, construct the phase diagram for x” — 32’ + 2x = 0. What happens 
around the point z — v — 0? 


2. Consider the nonlinear differential equation z” = x? — x. This equation arises in the 


study of simple pendulums with swings of moderate amplitude. 


(a) Show that the conservation law is 


What is special about C = 0 and C 7 


(b) Show that there are three critical points: « = 0 and « = +1 with v = 0. 
(c) Using MATLAB, graph the phase diagram with axes x and v. 


For the following ordinary differential equations, find the equilibrium points and then clas- 
sify them. Use MATLAB to draw the phase diagrams. 


1 |x| > 2 
fie: 7 " p I = , ? 
3. a" = 2x 4. a^ + sgn(z)r = 0 deg s |z| < 2. 


2.9 NUMERICAL METHODS 


When differential equations cannot be integrated in closed form, numerical methods 
must be employed. In the finite difference method, the discrete variable x; or t; replaces the 
continuous variable x or t and the differential equation is solved progressively in increments 
h starting from known initial conditions. The solution is approximate, but with a sufficiently 
small increment, you can obtain a solution of acceptable accuracy. 

Although there are many different finite difference schemes available, we consider here 
only two methods that are chosen for their simplicity. The interested student may read any 
number of texts on numerical analysis if he or she wishes a wider view of other possible 
schemes. 

Let us focus on second-order differential equations; the solution of higher-order differen- 
tial equations follows by analog. In the case of second-order ordinary differential equations, 
the differential equation can be rewritten as 


g” = f(z,z',t), zo =2(0), 2) = x'(0), (2.9.1) 


where the initial conditions xo and xj are assumed to be known. 
For the present moment, let us treat the second-order ordinary differential equation 


x" = f(x,t), zo = x(0), ames. (2.9.2) 


The following scheme, known as the central difference method, computes the solution from 
Taylor expansions at zi+ı and rji: 


Tjqi = Ti + hz, + iW^x] + Enea’ + O(h*) (2.9.3) 
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and 


a1 = ti — hz, + $h)a] — thi al!” + O(n‘), (2.9.4) 


where h denotes the time interval At. Subtracting and ignoring higher-order terms, we 
obtain 
/ Tipi — Vi-1 
v Em IRE Uode 2.9.5 
af, = TH (2.9.5) 


Adding Equation 2.9.3 and Equation 2.9.4 yields 


cu a Ti 2x d dia 


= 72 


(2.9.6) 


In both Equation 2.9.5 and Equation 2.9.6 we ignored terms of O(h?). After substituting 
into the differential equation, Equation 2.9.2, Equation 2.9.6 can be rearranged to 


C41 = 22; See Pe Ee h? f (xi, ti), i = 1, (2.9.7) 


which is known as the recurrence formula. 

Consider now the situation when 7 = 0. We note that although we have xo we do 
not have x_;. Thus, to start the computation, we need another equation for xı. This is 
supplied by Equation 2.9.3, which gives 


X1 = Xo + ha + sh? xg = Xo + ha + sh? f (xo, to). (2.9.8) 


Once we have computed 2, then we can switch to Equation 2.9.6 for all subsequent calcu- 
lations. 

In this development we have ignored higher-order terms that introduce what is known 
as truncation errors. Other errors, such as round-off errors, are introduced due to loss 
of significant figures. These errors are all related to the time increment h in a rather 
complicated manner that is investigated in numerical analysis books. In general, better 
accuracy is obtained by choosing a smaller h, but the number of computations will then 
increase together with errors. 


e Example 2.9.1 


Let us solve z” — 4x = 2t subject to z(0) = z'(0) = 1. The exact solution is 


a(t) = £e + 4e” — it. (2.9.9) 
The MATLAB script 
clear 
% test out different time steps 
for i= 1:3 


^4 set up time step increment and number of time steps 
h = 1/10^i; n = 10/h; 
^ set up initial conditions 
t-zeros(nti,1); t(1) = 0; x(1) = 1; x_exact(1) = 1; 
^ define right side of differential equation 
f = inline(^4*xx42*tt? ,^ tt? ,? xx?) ; 
^ set up difference arrays for plotting purposes 
diff = zeros(n,1); t plot = zeros(n,1); 
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Figure 2.9.1: The numerical solution of z^ — 4x = 2t when z(0) = z'(0) = 1 using a simple finite difference 
approach. 


^4 compute first time step 
t(2) = t(1) + h; x(2 = x(1) + h + 0.5*h*h*f(t(1),x(1)); 
x_exact(2) = (7/8) *exp(2*t (2) )+(1/8) *exp(-2*t (2) )-t (2) /2; 
t_plot(1) = t(2); 
diff (1)=x(2)-x_exact(2); diff(1)=abs (diff (1) /x_exact(2)); 
^ compute the remaining time steps 
for k = 2:n 
t(k+1) = t(k) + h; t_plot(k) = t(k*1); 
x(k+1) = 2*x(k) - x(k-1) + h*h*f(t(k),x(k)); 
x_exact(k+1) = (7/8) *exp(2*t (k+1))+(1/8) *exp(-2*t (k+1)) 
- t(k*1)/2; 


diff(k) = x(k*1) - x exact(k*1); 
diff(k) = abs(diff(k) / x exact(k*1)); 
end 


% plot the relative error 
semilogy(t plot,diff,?-?) 
hold on 
num = 0.2*n; 
text (3*i,diff(num),[’h = ’,num2str(h)],’Fontsize’,15,... 
?HorizontalAlignment’,’right’,’VerticalAlignment’, ’bottom’) 
xlabel (’TIME’ ,’Fontsize’ ,20); 
ylabel(’ |RELATIVE ERROR|’,’Fontsize’ , 20); 
end 
implements our simple finite difference method of solving a second-order ordinary differential 


equation. In Figure 2.9.1 we have plotted results for three different values of the time step. 
As our analysis suggests, the relative error is related to h?. 


An alternative method for integrating higher-order ordinary differential equations is 
Runge-Kutta. It is popular because it is self-starting and the results are very accurate. 

For second-order ordinary differential equations, this method first reduces the differen- 
tial equation into two first-order equations. For example, the differential equation 


— = / 
gtd RS V P (2.9.10) 
m 
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Figure 2.9.2: The numerical solution of x” — 4x = 2t when z(0) = z'(0) = 1 using the Runge-Kutta 
method. 


becomes the first-order differential equations 
v=y, y =F(z,y,t). (2.9.11) 


The Runge-Kutta procedure can then be applied to each of these equations. Using a fourth- 
order scheme, the procedure is as follows: 


Tp] = Ti + ch(k + 2ko + 2ka + ka), 2.9.12) 
and 
Vidi = Yi + h(Kı + 2K + 2K3 + K4), 2.9.13) 
where 
ky = Ui; Ki = F(x;, yi, ti), 2.9.14) 
ko = yi + 2K, Ko = F(a; + 3h, ko, t; + 3), 2.9.15) 
k3 = yi T 2K, K3 = F(a; Te Eko, kg, ti ae A), 2.9.16) 
and 
k4 = Yi + Kzh, K4 = F(x; + hka, ka, ti + h). 2.9.17) 
e Example 2.9.2 
The MATLAB script 
clear 
% test out different time steps 
for i = 1:4 


% set up time step increment and number of time steps 
if i==1 h = 0.50; end; if i==2 h = 0.10; end; 
if i==3 h = 0.05; end; if i--4 h = 0.01; end; 
nn = 10/h; 

% set up initial conditions 
t-zeros(nti,1); t(1) = 0; 
x_rk=zeros(nt+1,1); xrk(1) = 1; 
y-rkezeros(n*1,1); y_rk(1) = 1; 

x exact-zeros(n*i,1); x exact(1) = 1; 

^ set up difference arrays for plotting purposes 
t plot = zeros(n,1); diff = zeros(n,1); 

^ define right side of differential equation 
f = inline(?4*xx*2*tt? ,?^ tt? ,^ xx? ,?yy?); 
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for k = 1:n 
t_local = t(k); x local = x rk(k); y local = y rk(k); 
k1 = y_local; K1 = f(t_local,x_local,y_local) ; 
k2 = y_local + h*K1/2; 
K2 = f(t.local + h/2,x local + h*k1/2,k2); 
k3 = y_local + h*K2/2; 
K3 = f(t.local + h/2,x local + h*k2/2,k3); 
k4 = y local + h*K3; K4 = f(t_local + h,x local + h*k3,k4); 
t(k+1) = t_local + h; 
x_rk(k+1) = x_local + (h/6) * (k1+2*k2+2*k3+k4) ; 
y-rk(k*1) = y_local + (h/6) * (K1+2*K2+2*K3+KA) ; 
x_exact(k+1) = (7/8) *exp(2*t (k+1))+(1/8) *exp(-2*t (k*1)) 
- t(k*1)/2; 
t plot(k) = t(k); 
diff(k) = xrk(k*1) - x exact(k*1); 
diff(k) = abs(diff(k) / x exact(k*1)); 
end 


% plot the relative errors 

semilogy(t plot,diff,?-?) 

hold on 

xlabel(C TIME! ,’Fontsize’ ,20); 

ylabel(’ |RELATIVE ERROR|',?Fontsize?,20); 

text (2*i,diff(0.2*n),[’h = ’,num2str(h)],’Fontsize’,15,... 

^"HorizontalAlignment? ,?'right? ,'VerticalAlignment? ,? bottom?) 

end 


was used to resolve Example 2.9.1 using the Runge-Kutta approach. Figure 2.9.2 illustrates 
the results for time steps of various sizes. 


Problems 
In previous sections, you found exact solutions to second-order ordinary differential equa- 


tions. Confirm these earlier results by using MATLAB and the Runge-Kutta scheme to find 
the numerical solution to the following problems drawn from previous sections. 


1. Section 2.1, Problem 1 2. Section 2.1, Problem 5 
3. Section 2.4, Problem 1 4. Section 2.4, Problem 5 
5. Section 2.6, Problem 1 6. Section 2.6, Problem 5 


Project: Pendulum Clock 


In his exposition on pendulum clocks, M. Denny? modeled the system by the second- 
order differential equation in time t: 


0" +00" - w20 = kf (0,0), (1) 


9 Denny, M., 2002: The pendulum clock: A venerable dynamical system. Eur. J. Phys., 23, 449—458. 


96 Advanced Engineering Mathematics with MATLAB 


"0.2 EY 0 01 0.2 

0 
Figure 2.9.3: The phase diagram of the dynamical system given by Equations (1) and (2) by modified 
Euler method. Here the parameters are g — 9.8 m/sec?, L — 1 m, b — 0.22/sec, k — 0.02/sec, and 
At = 0.005. The initial conditions are 6(0) = 7/18 and 0'(0) = 0. 


where 
1/(At), [0| < At/2, 0' >0; 


J00) = { 0, otherwise; (2) 


and w? = g/L — b?/4. Here At denotes some arbitrarily small nondimensional time. In 
Chapter 12 we identify this forcing as the Dirac delta function. 

Using the numerical scheme of your choice, develop a MATLAB code to numerically 
integrate this differential equation. Plot the results as a phase diagram with @ as the 
abscissa and 6’ as the ordinate. What happens with time? What happens as k varies? 
Figure 2.9.3 illustrates the solution. 


Further Readings 


Boyce, W. E., and R. C. DiPrima, 2004: Elementary Differential Equations and Boundary 
Value Problems. Wiley, 800 pp. Classic textbook. 


Ince, E. L., 1956: Ordinary Differential Equations. Dover, 558 pp. The source book on 
ordinary differential equations. 


Zill, D. G., and M. R. Cullen, 2008: Differential Equations with Boundary- Value Problems. 
Brooks Cole, 640 pp. Nice undergraduate textbook. 
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Chapter 3 
Linear Algebra 


Linear algebra involves the systematic solving of linear algebraic or differential equa- 
tions that arise during the mathematical modeling of an electrical, mechanical, or even 
human system where two or more components are interacting with each other. In this 
chapter we present efficient techniques for expressing these systems and their solutions. 


3.1 FUNDAMENTALS OF LINEAR ALGEBRA 


Consider the following system of m simultaneous linear equations in n unknowns 
L1, T2, T3,- Tn: 


01121 + d1222 ++i + ainn = b1, 


0212, + a22£2 d ::: + dant = b2, 


(3.1.1) 
Am1L1 + Am2%2 +` `` + AmnIn = bm, 


where the coefficients a;; and constants b; denote known real or complex numbers. The 
purpose of this chapter is to show how matrix algebra can be used to solve these systems by 
first introducing succinct notation so that we can replace Equation 3.1.1 with rather simple 
expressions, and then by employing a set of rules to manipulate these expressions. In this 
section we focus on developing these simple expressions. 

The fundamental quantity in linear algebra is the matrix. A matrix is an ordered 
rectangular array of numbers or mathematical expressions. We shall use upper case letters 


1 


1 This term was first used by J. J. Sylvester, 1850: Additions to the articles, “On a new class of 
theorems,” and “On Pascal’s theorem." Philos. Mag., Ser. 4, 37, 363-370. 
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to denote them. The m x n matrix 


Q11 012 a13 `: p © Aln 
Q21 422 Q23 - ; ~ 02n 
A= (3.1.2) 
. . . . Qij . . 
Am1 Am2 Am3 " 1 j Amn 


has m rows and n columns. The order (or size) of a matrix is determined by the number 
of rows and columns; Equation 3.1.2 is of order m by n. If m = n, the matrix is a square 
matrix; otherwise, A is rectangular. The numbers or expressions in the array a;j are the 
elements of A and can be either real or complex. When all of the elements are real, A is 
a real matrix. If some or all of the elements are complex, then A is a complex matrix. For 
a square matrix, the diagonal from the top left corner to the bottom right corner is the 
principal diagonal. 

From the limitless number of possible matrices, certain ones appear with sufficient 
regularity that they are given special names. A zero matrix (sometimes called a null matrix) 
has all of its elements equal to zero. It fulfills the role in matrix algebra that is analogous 
to that of zero in scalar algebra. The unit or identity matrix is an n x n matrix having 1’s 
along its principal diagonal and zero everywhere else. The unit matrix serves essentially 
the same purpose in matrix algebra as does the number one in scalar algebra. A symmetric 
matrix is one where aj; = aji for all i and j. 


e Example 3.1.1 


Examples of zero, identity, and symmetric matrices are 


0.0 0 10 3 2 4 
0—-[000], r=(j y and A—-[2 1 0], (3.1.3) 
0.0 0 4 0 5 
respectively. 
A special class of matrices are column vectors and row vectors: 
Tı 
T2 
x=| . |> y=(y yo ` Yna). (3.1.4) 
Tm 


We denote row and column vectors by lower case, boldfaced letters. The length or norm of 
the vector x of n elements is i 
\|x|| = be z) ; (3.1.5) 
k=1 


Two matrices A and B are equal if and only if aj; = bj; for all possible i and j and 
they have the same dimensions. 

Having defined a matrix, let us explore some of its arithmetic properties. For two 
matrices A and B with the same dimensions (conformable for addition), the matrix C = 
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A+ B contains the elements cj; = aij + bij. Similarly, C = A — B contains the elements 
ci; = aij — bij. Because the order of addition does not matter, addition is commutative: 
A-4- B — B- A. 

Consider now a scalar constant k. The product kA is formed by multiplying every 
element of A by k. Thus the matrix kA has elements kaij. 

So far the rules for matrix arithmetic conform to their scalar counterparts. However, 
there are several possible ways of multiplying two matrices together. For example, we might 
simply multiply together the corresponding elements from each matrix. As we will see, the 
multiplication rule is designed to facilitate the solution of linear equations. 

We begin by requiring that the dimensions of A be m x n while for B they are n x p. 
That is, the number of columns in A must equal the number of rows in B. The matrices A 
and B are then said to be conformable for multiplication. If this is true, then C = AB isa 
matrix m x p, where its elements equal 


n 
Cij = X an bij. (3.1.6) 
k=1 


The right side of Equation 3.1.6 is referred to as an inner product of the ith row of A and the 
jth column of B. Although Equation 3.1.6 is the method used with a computer, an easier 
method for human computation is as a running sum of the products given by successive 
elements of the ith row of A and the corresponding elements of the jth column of B. 

The product AA is usually written A?; the product AAA, A3, and so forth. 


e Example 3.1.2 


If 
- (7 a aud 5-(i 25 (3.1.7) 
then 
(CDOS (710 - ((4)) (11. 14 
AB - (layne) Dattan) "(or 8): (3.1.8) 


Checking our results using MATLAB, we have that 
>> A = [-1 4; 2 -3]; 

>> B= [1 2; 3 4]; 

>> C = A*B 


C= 
11 14 
-7 —8 


Note that there is a tremendous difference between the MATLAB command for matrix 
multiplication * and element-by-element multiplication .*. 


Matrix multiplication is associative and distributive with respect to addition: 
(kA)B = k(AB) = A(kB), (3.1.9) 


A(BC) = (AB)C, (3.1.10) 
(A+ B)C = AC + BC, (3.1.11) 
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and 
C(A+ B) 2 CA CB. (3.1.12) 


On the other hand, matrix multiplication is not commutative. In general, AB 4 BA. 
e Example 3.1.3 


Does AB — BA if 


1 0 1 1 
= = ? 
am (19). ant ne (1 2) uu 


a sus 
SEC snas 


AB + BA. (3.1.16) 


Because 


and 


e Example 3.1.4 
Given 


1 1 —1 1 
P TIL qum 
find the product AB. 


Performing the calculation, we find that 


ap=(4 3 G A)-G 3 (3.1.18) 


The point here is that just because AB = 0, this does not imply that either A or B equals 
the zero matrix. 


We cannot properly speak of division when we are dealing with matrices. Nevertheless, 
a matrix A is said to be nonsingular or invertible if there exists a matrix B such that 
AB = BA = I. This matrix B is the multiplicative inverse of A or simply the inverse of A, 
written A^!. An n x n matrix is singular if it does not have a multiplicative inverse. 


e Example 3.1.5 


If 
101 
A=|3 3 4], (3.1.19) 
2 2 3 
let us verify that its inverse is 
DL. 2k -—3 
A2|-1 1 -1i[. (3.1.20) 
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We perform the check by finding AA~! or ATHA, 
10 1 1 2 —3 100 
AA -—|[3.3 A -1 1 -1)=[0 10]. (3.1.21) 
2 2 3 0 -2 3 0 0 1 


In a later section we will show how to compute the inverse, given A. 


Another matrix operation is transposition. The transpose of a matrix A with dimen- 
sions m x n is another matrix, written AT, where we interchanged the rows and columns 
from A. In MATLAB, AT is computed by typing A’. Clearly, (A7)7 = A as well as 
(A+ B)? = AT + BT, and (kA)? = kAT. If A and B are conformable for multiplica- 
tion, then (AB)? = B? AT. Note the reversal of order between the two sides. To prove this 
last result, we first show that the results are true for two 3 x 3 matrices A and B and then 
generalize to larger matrices. 

Having introduced some of the basic concepts of linear algebra, we are ready to rewrite 
Equation 3.1.1 in a canonical form so that we can present techniques for its solution. We 
begin by writing Equation 3.1.1 as a single column vector: 


Q113231 + 041232 + ccc + Qlntn bi 
4231321 + 02232 + ° + Qnn b» 
: e ss (3.1.22) 
QmitXi + Am2%Q + + Amnn bm 
We now use the multiplication rule to rewrite Equation 3.1.22 as 
ail  Q12 Qin Ti bi 
a21 Q22 Q2n, T2 b» 
E T =|], (3.1.23) 
Am1 Am2 Amn In bm 
or 
Ax =b, (3.1.24) 


where x is the solution vector. If b = 0, we have a homogeneous set of equations; otherwise, 
we have a nonhomogeneous set. In the next few sections, we will give a number of methods 
for finding x. 


e Example 3.1.6: Solution of a tridiagonal system 


A common problem in linear algebra involves solving systems such as 


biyi + eio = di, (3.1.25) 

a2y1 + b2ya + Coys = do, (3.1.26) 

an—1YN-2 + bn-1yn-1 + Cn-1YN = AN-1, (3.1.27) 
bNUN-1 + CNUN = dy. (3.1.28) 
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Such systems arise in the numerical solution of ordinary and partial differential equations. 
We begin by rewriting Equation 3.1.25 through Equation 3.1.28 in the matrix notation: 


bi Ci 0 (0) 0 0 yı dı 
ag bo C2 0 0 0 Y2 d» 
0 a3 b3 0 0 0 Ua da 
E = MD (3.1.29) 
0 0 0 +++ aw. by- CN- UN-1 dn—1 
0 0 0 $us 0 an bn UN dyn 


The matrix in Equation 3.1.29 is an example of a banded matrix: a matrix where all of 
the elements in each row are zero except for the diagonal element and a limited number 
on either side of it. In the present case, we have a tridiagonal matrix in which only the 
diagonal element and the elements immediately to its left and right in each row are nonzero. 
Consider the nth equation. We can eliminate an by multiplying the (n — 1)th equation 
by an/bn—ı and subtracting this new equation from the nth equation. The values of b,, and 
dn become 
b, = bn = AnCn—1/bn-1, and di, = dn = andn—1/bn-1 (3.1.30) 
for n = 2,3,..., N. The coefficient cn is unaffected. Because elements a4 and cy are never 
involved, their values can be anything or they can be left undefined. The new system of 
equations may be written 


oc 0-0 0 0 Vi 1 

0 b c 0 0 0 Y2 A 

0 0 5 0 0 0 0 y : 

: ; : : i = s . (3.1.31) 
0 0 0 es 0 bui CN-1 YN-1 Nd 

0 0 0 :-- 0 0 by YN dy 


The matrix in Equation 3.1.31 is in upper triangular form because all of the elements 
below the principal diagonal are zero. This is particularly useful because yn can be computed 
by back substitution. That is, we first compute yy. Next, we calculate yw, in terms of 
yw. The solution yy—2 can then be computed in terms of yy and yw-.1. We continue this 
process until we find y; in terms of yw, yn—1,---,Y2- In the present case, we have the rather 
simple: 
and Yn = (di, — Cn n41)/05 


yn = dy /bN; (3.1.32) 


for n= N—1,N—2,...,2,1. 

As we shall show shortly, this is an example of solving a system of linear equations 
by Gaussian elimination. For a tridiagonal case, we have the advantage that the solution 
can be expressed in terms of a recurrence relationship, a very convenient feature from 
a computational point of view. This algorithm is very robust, being stable? as long as 
la; + ci| < |b;|. By stability, we mean that if we change b by Ab so that x changes by Ax, 
then ||Ax|| < Me, where || Ab|| € e, 0< M < co, for any N. 


2 Torii, T., 1966: Inversion of tridiagonal matrices and the stability of tridiagonal systems of linear 
systems. Tech. Rep. Osaka Univ., 16, 403-414. 
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e Example 3.1.7: Linear transformation 
Consider a set of linear equations 


Yı = 01121 + 01212 + A13%3 + 01424, 


Y2 = 2121 + A22X2Q + A23%3 + A144, (3.1.33) 


Y3 = 43121 + 03122 + 033133 + 03424. 


Each of the right-side expressions is called a linear combination of £1, £2, £3, and z4: a sum 
where each term consists of a constant times x; raised to the first power. An expression 
such as a41127 + à1222 + a13%3 + a1477 is an example of a nonlinear combination. Note that 
we are using 4 values of x; to find only 3 values of y;. 

If we were given values of x;, we can determine a set of values for y; using Equation 
3.1.33. Such a set of linear equations that yields values of y; for given z;'s is called a linear 
transform of x into y. The point here is that given x, the corresponding y will be evaluated. 

Matrix notation and multiplication are very convenient in expressing linear transfor- 
mation. In the present case, we would have that 


T2 G11 412 Q13 414 Ui 
y = Ax, where x= zh A= | an G22 az az |, y= | yo]. (3.1.34) 
d 431 432 433  Q34 Ua 


In general, a transformation A(x) is a linear transformation that satisfies two conditions: 
(1) A(x + y) = A(x) + A(y) and (2) A(kx) = KA(x), where k is a scalar. 


Problems 
Given A= (1 )rand B= (5 z) na 
1A+B,B+A 2.A—B, B—A 3. 3A — 2B, 3(2A — B) 
4. AT BU ABE YE 5. (A+ B)?, AT + BT 6. B+ BT, B- BT 
7. AB,AT B,BA,BT A 8. A2, B? 9. BBT, BT B 
10. 4? - 3A +I 11. A -2A 12. A* — A4? +21 


by hand and using MATLAB. 


Can multiplication occur between the following matrices? If so, compute it. 


3 51 2 1 —2 4 142 3 2 
13. ( ) 4 1 14. | —4 6](1 2 3) 15.[0 0 4 1 1 
1 3 —6 1 0 1 2 2 1 


4 6\(1 3 6 6 4 2\/3 14 
ie. (1 2s 2 J im 2 =e 0 S) 


1 1 
If A= | 1 2], verify that 
3 1 
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18. 7A = 4A + 3A, 19. 10A = 5(2A), 20. (AT)T =A 
by hand and using MATLAB. 


2 1 -2 11 . 
ra=(3 ess and = (1 ELIT 


21. (A-B) CC 2 A - (B 4- C), 22. (AB)C = A(BC), 


23. A(B+C) = AB + AC, 24. (A + B)C = AC + BC 
by hand and using MATLAB. 


Verify that the following A^! are indeed the inverse of A: 


010 010 
5 a-(* 23 -(5 3 26. A=|1 0 0| At-2[10 0 
001 00 1 

by hand and using MATLAB. 


Write the following linear systems of equations in matrix form: Ax = b. 


27. 124—225 = 5 321 -z9-—1 
28. 221 cx 4x3 =2 Ax, an 2x2 + 5x3 —6 621 ian 322 + 5x3 —2 
29. Zot 2x3 or 324 =2 321 a 4x3 = 4x4 =5 £1 + T2 + £3 + T4 = 3 


22, — 9159 + T3 — 324 =T 
3.2 DETERMINANTS 


Determinants appear naturally during the solution of simultaneous equations. Con- 
sider, for example, two simultaneous equations with two unknowns 2, and 22, 


a1171 + à41222 = by, (3.2.1) 


and 


42171 + a22%2 = bə. (3.2.2) 


The solution to these equations for the value of zı and x» is 


b — a13b 
ns 1022 — 41292 (3.2.3) 


, 
011022 — 012021 


and " " 
à11— à 
Bye ce HC REN. (3.2.4) 
011022 — 012021 

Note that the denominator of Equation 3.2.3 and Equation 3.2.4 is the same. This term, 
which always appears in the solution of 2 x 2 systems, is formally given the name of deter- 
minant and written 
Q11 Q12 
421 Q22 


det(A) = = 411422 — 012021. (3.2.5) 
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MATLAB provides a simple command det (A), which computes the determinant of A. 
For example, in the present case, 
>> A= [2 -1 2; 132; 5 1 6]; 
>> det (A) 
ans = 

0 

Although determinants have their origin in the solution of systems of equations, any 
square array of numbers or expressions possesses a unique determinant, independent of 
whether it is involved in a system of equations or not. This determinant is evaluated (or 
expanded) according to a formal rule known as Laplace’s expansion of cofactors. The 
process revolves around expanding the determinant using any arbitrary column or row of 
A. If the ith row or jth column is chosen, the determinant is given by 


det (A) = ayAjy + Q42 Aj +. + Qin Ain = ay; Ay; + 025 Ag; fee Hp OnjAnj; (3.2.6) 
where A;;, the cofactor of aij, equals (—1)/*7 M;;. The minor Mj; is the determinant of the 
(n — 1) x (n — 1) submatrix obtained by deleting row i, column j of A. This rule, of course, 
was chosen so that determinants are still useful in solving systems of equations. 


e Example 3.2.1 


Let us evaluate 


2 -1 2 
1 3 2 
5 1 6 
by an expansion in cofactors. 
Using the first column, 
2 -1 2 
i 3 2) -3([.1)7 a +1(-1)° ot +5(—1)4 E (3.2.7) 
1 6 1 6 3 2 
5 1 6 
= 2(16) — 1(—8) + 5(—8) = 0. (3.2.8) 


The greatest source of error is forgetting to take the factor (—1)/*7 into account during the 
expansion. 


Although Laplace's expansion does provide a method for calculating det(A), the num- 
ber of calculations equals n!. Consequently, for hand calculations, an obvious strategy is to 
select the column or row that has the greatest number of zeros. An even better strategy 
would be to manipulate a determinant with the goal of introducing zeros into a particular 
column or row. In the remaining portion of this section, we show some operations that may 
be performed on a determinant to introduce the desired zeros. Most of the properties follow 
from the expansion of determinants by cofactors. 


e | Rule 1 |: For every square matrix A, det( AT) = det(A). 


? Laplace, P. S., 1772: Recherches sur le calcul intégral et sur le système du monde. Hist. Acad. R. 
Sci., II* Partie, 267-376. Œuvres, 8, pp. 369—501. See Muir, T., 1960: The Theory of Determinants in 
the Historical Order of Development, Vol. I, Part 1, General Determinants Up to 1841. Dover Publishers, 
pp. 24-33. 
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'The proof is left as an exercise. 


e | Rule 2 |: If any two rows or columns of A are identical, det(A) = 0. 


To see that this is true, consider the following 3 x 3 matrix: 


bi bi C1 
b» b» C2 | = cı (b2b3 - b3b2) LI c2(b1b3 EN b3b1) + c3(b1b2 "E b5b1) = 0. (3.2.9) 
b3 b3 c3 


e | Rule 3 |: The determinant of a triangular matrix is equal to the product of its diagonal 
elements. 


If A is lower triangular, successive expansions by elements in the first column give 


a m 0 022 DE 0 
det(A) =| . l l . | =an| : : © | +++ = 41d29°++ Ann. (3.2.10) 


Qn2 °** Ann 
Qni  Qn2 `° Ann 


If A is upper triangular, successive expansions by elements of the first row prove the prop- 
erty. 


e | Rule 4 |: If a square matrix A has either a row or a column of all zeros, then det(A) = 0. 


The proof is left as an exercise. 


e | Rule 5 |: If each element in one row (column) of a determinant is multiplied by a number 
c, the value of the determinant is multiplied by c. 


Suppose that |B| has been obtained from |A| by multiplying row i (column j) of |A| by 
c. Upon expanding |B| in terms of row i (column j), each term in the expansion contains 
c as a factor. Factor out the common c, and the result is just c times the expansion |A| by 
the same row (column). 


e | Rule 6 |: If each element of a row (or a column) of a determinant can be expressed as a 
binomial, the determinant can be written as the sum of two determinants. 


To understand this property, consider the following 3 x 3 determinant: 


artdı b c a, b aà di b & 
az +d bo c9|-—|a9 bə c2|+|d2 bə csl. (3.2.11) 
a3 T d3 bs C3 3 bs C3 d3 bs C3 


The proof follows by expanding the determinant by the row (or column) that contains the 
binomials. 


e Rule 7 |: If B is a matrix obtained by interchanging any two rows (columns) of a square 
matrix A, then det(B) = — det(A). 
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The proof is by induction. It is easily shown for any 2 x 2 matrix. Assume that this 
rule holds for any (n — 1) x (n — 1) matrix. If A is n x n, then let B be a matrix formed by 
interchanging rows 7 and j. Expanding |B| and |A| by a different row, say k, we have that 

|B] = 35 (-0 *b. My, and |A| = Y (C0 ars Nis, (3.2.12) 


s—l s—l 


where My, and N;, are the minors formed by deleting row k, column s from |B| and |A], 


respectively. For s = 1,2,...,n, we obtain N;,, and Mks by interchanging rows 7 and j. By 
the induction hypothesis and recalling that Ni; and My; are (n— 1) x (n— 1) determinants, 
Ny, = —Mks for s = 1,2,...,n. Hence, |B| = —|A]. Similar arguments hold if two columns 


are interchanged. 


e | ule 8 |: If one row (column) of a square matrix A equals to a number c times some 
other row (column), then det(A) = 0. 


Suppose one row of a square matrix A is equal to c times some other row. If c = 0, 
then |A| = 0. If c 4 0, then |A| = c|B|, where |B| = 0 because |B| has two identical rows. 
A similar argument holds for two columns. 


e | Rule 9 |: The value of det(A) is unchanged if any arbitrary multiple of any line (row or 
column) is added to any other line. 


To see that this is true, consider the simple example: 


1 bi C1 cbi bi Ci Q1 Tr cbi bi C1 
a2 b2 co|-F|cb9 09 co|—|aa-- cb bo co], (3.2.13) 
a3 bs C3 cba b3 C3 a3 T cbs bs C3 


where c Z 0. The first determinant on the left side is our original determinant. In the 
second determinant, we again expand the first column and find that 


cbi b Cy by b cy 

cb» b» C2 | = C b» b» C2 = 0. (3.2.14) 

cbs b3 C3 bs bs C3 

e Example 3.2.2 
Let us evaluate 

1 2 3 4 
—1 1 2 3 
1 -1 1 2 
-1 1 -1 5 


using a combination of the properties stated above and expansion by cofactors. 
By adding or subtracting the first row to the other rows, we have that 


1 2 3 4 


4 
3 5 7 
-1 1 2 3| l0 3 5 7 
E lp ee es sale P. = (3.2.15) 
eur 1 «ed 5| uo 3 2 9 
3-5. T 
-|[0 3 5 =3| 3, T j-e (3.2.16) 
r3. m 
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Problems 


Evaluate the following determinants. Check your answer using MATLAB. 


3 12 4 3 0 
i m 73 EN $22 5 48 2 2 
145 5 -2 -4 
i33 Cua oio Qs 0 2 2 
5.4 d duro 6&ji 3 3| 7 8. 
23 5 A 16 1 0 -1 2 —1 1 
1 1 —2 3 -3 2 3 1 


9. Using the properties of determinants, show that 


1 1 1 1 

b d 
a2 p 2 g|=@-a(c- a(d- a)(c— b)(d— b)(d— o). 
a? b cg dæ 


This determinant is called Vandermonde’s determinant. 


10. Show that 


a b+e 1 
b a+c 1|=0. 
c a+b 1 


11. Show that if all of the elements of a row or column are zero, then det(A) = 0. 
12. Prove that det(AT) = det(A). 


3.3 CRAMER'S RULE 


One of the most popular methods for solving simple systems of linear equations is 
Cramer's rule.* It is very useful for 2 x 2 systems, acceptable for 3 x 3 systems, and of 
doubtful use for 4 x 4 or larger systems. 

Let us have n equations with n unknowns, Ax = b. Cramer’s rule states that 


u det(41) ET det(A2) u det(A,) 
T de(A)' PT de(A)' "" T gea" 


zı (3.3.1) 


where A; is a matrix obtained from A by replacing the ith column with b and n is the 
number of unknowns and equations. Obviously, det(A) Z 0 if Cramer's rule is to work. 


^ Cramer, G., 1750: Introduction à l'analyse des lignes courbes algébriques. Geneva, p. 657. 
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To prove? Cramer's rule, consider 


44171 012 Q13 ` Qin 
02111 Q22 Q23 `° Gan 

X1 det(A) = | 43171 Q32 033 ``’  Q3n (3.3.2) 
Ani an2 an3 ubi ann 


by Rule 5 from the previous section. By adding x2 times the second column to the first 
column, 


411%, T 0122. 012. Q13 ^^^ Qin 
2111 T 422172 022 Q23 :** Gan 

x, det(A) = | 493171 T 03212 Q32 033 `’ Q3n |, (3.3.3) 
Anii T Qn2:12  Qn2  Qn3 Ut Ann 


Multiplying each of the columns by the corresponding x; and adding it to the first column 
yields 


à11*1 T 41233 t:::-F ainn A12 G13 co^ Ain 
42134 + à2232 d-::: + d2nX34 Q22 Q23 +++) Aan 

xı det(A) = | 43121 + a3222 + °°: + @3nIn Q32 Q33 co^ azn], (3.3.4) 
Anıtı An2 T2 "aep Annn An2 an3 us. ann 


The first column of Equation 3.3.4 equals Ax and we replace it with b. Thus, 


bi a2 G13 > Gin 
by  d22 G23 `° Gan 
zı det(A) =|3 432 33 o 03» | = det(A), (3.3.5) 
bn Qn2 Qn3 ‘'' Ann 
or det(Ay) 
_ et 1 
qq det(A) (3.3.6) 


provided det(A) 4 0. To complete the proof we do exactly the same procedure to the jth 
column. 


e Example 3.3.1 
Let us solve the following system of equations by Cramer’s rule: 
221 + £2 + 2x3 = —1, (3.3.7) 


zı X3 = —1, (3.3.8) 


5 First proved by Cauchy, L. A., 1815: Mémoire sur les fonctions quine peuvent obtemir que deux valeurs 
égales et de signes contraires par suite des transportations opérées entre les variables qüelles renferment. J. 
l'Ecole Polytech., 10, 29-112. 
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and 
—zı + 3x2 = 2x3 = 1 (3.3.9) 


From the matrix form of the equations, 


2r Ai. 2 zı —1 
1 0 1 z2 |2|-1], (3.3.10) 
-1 3 -2 T3 7 
we have that 
2 1 2 
det(A)=]| 1 0 1 |=1, (3.3.11) 
-1 3 -2 
-1 1 2 
det(A;) 2|-1 0 1 | =2, (3.3.12) 
7 3 -2 
2 -1 2 
det(A3) 2| 1 -1 1|-2l, (3.3.13) 
-1 7 -2 
and 
2 1 -1 
det(A3) 2| 1 0 -1|—-3. (3.3.14) 
-1 3 7 
Finally, 
2 1 — 
gp 2, t= m 1, and z3-— = =—3. (3.3.15) 


You can also use MATLAB to perform Cramer’s rule. In the present example, the script 
is as follows: 
clear; % clear all previous computations 
A= [2 1 2; 101; -1 3 -2]; % input coefficient matrix 
b = [-1 ; -1; 7]; % input right side 


A1 = A; A1(:,1) = b; % compute A 1 
A2 = A; A2(:,2) = b; % compute A2 
A3 = A; A8(:,3) = b; % compute A.3 


^4 compute solution vector 
x = [det(A1), det(A2), det(A3)] / det (A) 


Problems 


Solve the following systems of equations by Cramer’s rule: 


1. xı + 222 = 3, 32, + £2 = 6 

2. 24, + T2 = —3, £ı — %2 = 1 

3. £1 + 219 — 213 = 4, 24, + £2 + £3 = —2, —%, + T2 — 23 = 2 
4. 2241 + 333 — £3 = —1, Z4 — 2% + T3 = 5, 321 — T2 = —2. 


Check your answer using MATLAB. 
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3.4 ROW ECHELON FORM AND GAUSSIAN ELIMINATION 


So far, we assumed that every system of equations has a unique solution. This is not 
necessarily true, as the following examples show. 


e Example 3.4.1 
Consider the system 
zı +x2=2 (3.4.1) 


and 
2x1 4-223 = —1. (3.4.2) 


This system is inconsistent because the second equation does not follow after multiplying 
the first by 2. Geometrically, Equation 3.4.1 and Equation 3.4.2 are parallel lines; they 
never intersect to give a unique xı and z2. 


e Example 3.4.2 


Even if a system is consistent, it still may not have a unique solution. For example, 
the system 
zı 229 = 2 (3.4.3) 


and 


224 ia 2x9 —4 (3.4.4) 


is consistent, with the second equation formed by multiplying the first by 2. However, there 
are an infinite number of solutions. 


Our examples suggest the following: 


Theorem: A system of m linear equations in n unknowns may: (1) have no solution, in 
which case it is called an inconsistent system, or (2) have exactly one solution (called a 
unique solution), or (3) have an infinite number of solutions. In the latter two cases, the 
system is said to be consistent. 


Before we can prove this theorem at the end of this section, we need to introduce some 
new concepts. 

The first one is equivalent systems. Two systems of equations involving the same 
variables are equivalent if they have the same solution set. Of course, the only reason 
for introducing equivalent systems is the possibility of transforming one system of linear 
systems into another that is easier to solve. But what operations are permissible? Also, 
what is the ultimate goal of our transformation? 

From a complete study of possible operations, there are only three operations for trans- 
forming one system of linear equations into another. These three elementary row operations 
are 


(1) interchanging any two rows in the matrix, 
(2) multiplying any row by a nonzero scalar, and 


(3) adding any arbitrary multiple of any row to any other row. 
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Armed with our elementary row operations, let us now solve the following set of linear 
equations: 


vy 3x2 + 7x3 = 2, (3.4.5) 
221 + 4x2 — 323 = —1, (3.4.6) 
and 


We begin by writing Equation 3.4.5 through Equation 3.4.7 in matrix notation: 


1.2857 xy 2 
a 288 z2 )={-1). (3.4.8) 
-1 13 MP ET 2 


The matrix in Equation 3.4.8 is called the coefficient matrix of the system. 
We now introduce the concept of the augmented matriz: a matrix B composed of A 
plus the column vector b or 


EP NE A 
B=[2 4 -3]|-|. (3.4.9) 
-1 13 —21| 2 


We can solve our original system by performing elementary row operations on the augmented 
matrix. Because x; functions essentially as a placeholder, we can omit them until the end 
of the computation. 

Returning to the problem, the first row can be used to eliminate the elements in the 
first column of the remaining rows. For this reason the first row is called the pivotal row 
and the element a1, is the pivot. By using the third elementary row operation twice (to 
eliminate the 2 and —1 in the first column), we have the equivalent system 


Hx z2 
Beo 10 S17) = |. (3.4.10) 
OF 10 su 4 


At this point we choose the second row as our new pivotal row and again apply the third 
row operation to eliminate the last element in the second column. This yields 


fx Lem 
B-[|o0 10 -17,—5|. (3.4.11) 
00 349 


'Thus, elementary row operations transformed Equation 3.4.5 through Equation 3.4.7 into 
the triangular system: 


Ly 3X2 + 7x3 m 2, (3.4.12) 
323 = 9, (3.4.14) 


which is equivalent to the original system. The final solution is obtained by back substitution, 
solving from Equation 3.4.14 back to Equation 3.4.12. In the present case, x3 = 3. Then, 
1022 = 17(3) — 5, or x2 = 4.6. Finally, xı = 312 — 7z3 + 2 = —5.2. 
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In general, if an n x n linear system can be reduced to triangular form, then it has a 
unique solution that we can obtain by performing back substitution. This reduction involves 
n — 1 steps. In the first step, a pivot element, and thus the pivotal row, is chosen from the 
nonzero entries in the first column of the matrix. We interchange rows (if necessary) so that 
the pivotal row is the first row. Multiples of the pivotal row are then subtracted from each 
of the remaining n — 1 rows so that there are 0’s in the (2, 1), ..., (n, 1) positions. In the 
second step, a pivot element is chosen from the nonzero entries in column 2, rows 2 through 
n, of the matrix. The row containing the pivot is then interchanged with the second row 
(if necessary) of the matrix and is used as the pivotal row. Multiples of the pivotal row are 
then subtracted from the remaining n — 2 rows, eliminating all entries below the diagonal 
in the second column. The same procedure is repeated for columns 3 through n — 1. Note 
that in the second step, row 1 and column 1 remain unchanged, in the third step the first 
two rows and first two columns remain unchanged, and so on. 

If elimination is carried out as described, we arrive at an equivalent upper triangular 
system after n — 1 steps. However, the procedure fails if, at any step, all possible choices 
for a pivot element equal zero. Let us now examine such cases. 

Consider now the system 


£1 + 219 + 23 = —1, (3.4.15) 
221 + 4x2 ar 2x3 = —2, (3.4.16) 
zı + 4x2 ED 213 =). (3.4.17) 
Its augmented matrix is 
12 1; -1 
B=|2 4 2|-2). (3.4.18) 
142 2 


iude 
B-[000|0], (3.4.19) 
02 1/3 
Or 
1 2 ep est 
B-[|02 1|3]. (3.4.20) 
0 0 ofo 


The difficulty here is the presence of the zeros in the third row. Clearly any finite numbers 
satisfy the equation 02; +0z2 J- 0x3 = 0 and we have an infinite number of solutions. Closer 
examination of the original system shows an underdetermined system; Equation 3.4.15 and 
Equation 3.4.16 differ by a multiplicative factor of 2. An important aspect of this problem 
is the fact that the final augmented matrix is of the form of a staircase or echelon form 
rather than of triangular form. 

Let us modify Equation 3.4.15 through Equation 3.4.17 to read 


zı + 219 + 23 = —1, (3.4.21) 
221 + 4x2 =r 2x3 = 3, (3.4.22) 
zı + 4x2 es 213 = 2; (3.4.23) 
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then the final augmented matrix is 


1 2 qe 
B=[0 2 1|3 |. (3.4.24) 
0 0 0] 5 


We again have a problem with the third row because 0x4 --0z3--0x3 = 5, which is impossible. 
There is no solution in this case and we have an inconsistent system. Note, once again, that 
our augmented matrix has a row echelon form rather than a triangular form. 

In summary, to include all possible situations in our procedure, we must rewrite the 
augmented matrix in row echelon form. We have row echelon form when: 


(1) The first nonzero entry in each row is 1. 


(2) If row k does not consist entirely of zeros, the number of leading zero entries in 
row k + 1 is greater than the number of leading zero entries in row k. 


(3) If there are rows whose entries are all zero, they are below the rows having 
nonzero entries. 


The number of nonzero rows in the row echelon form of a matrix is known as its rank. In 
MATLAB, the rank is easily found using the command rank( ). Gaussian elimination is 
the process of using elementary row operations to transform a linear system into one whose 
augmented matrix is in row echelon form. 


e Example 3.4.3 


Each of the following matrices is not of row echelon form because they violate one of 
the conditions for row echelon form: 


2.2.3 
021 i 7 ob 3; (3.4.25) 
0 04 


e Example 3.4.4 


The following matrices are in row echelon form: 


1 2 3 146 1340 
pb. dise. dy dis [001 3l. (3.4.26) 
0. 0 1 00 0 0000 


e Example 3.4.5 


Gaussian elimination can also be used to solve the general problem AX — B. One of 
the most common applications is in finding the inverse. For example, let us find the inverse 
of the matrix 


2 2 
A=|-2 -4 4 (3.4.27) 
8 
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by Gaussian elimination. 
Because the inverse is defined by AAT! = J, our augmented matrix is 


4 —2 2|1 0 0 
-2 —4 wd 0]. (3.4.28) 
—4 2 BiG 0 1 


4 —2 2|1 0 0 Za cd 4|0 1 0 
SA —4 40 1 oļ=| 4 mod 00 (3.4.29) 
-4 2 8/001 —4 2 8ļ0 0 1 
-2 —4 4|0 1 0 
=| 4 -2 2/100 (3.4.30) 
Qr 0 10|1 0 1 
-2 —4 4|0 10 
=[{ 0 -10 10|1 2 0 (3.4.31) 
0 0 10/1 01 
- —4 wg 0 
=| 0 -10 0/0 2 -1 (3.4.32) 
0 0 10/1 0 1 
-2 —4 0ļ|-2/5 1 -2/5 
=| o0 xps 0| 0 2 est (3.4.33) 
o o0 10) d 0 A 
-2 0 0ļ]—2/5 1/55 0 
=| 0 S10 0f o 2 ~i (3.4.34) 
o o0 10} 1 0 1 


1 0 0| 1/5 -1/10 0 
=|0 1 of 0  -1/5 1/10]. (3.4.35) 
0 0 1/1/10 0 1/10 


Thus, the right half of the augmented matrix yields the inverse and it equals 


1/5 -1/10 0 
A!-[| 0 -1/5 1/10}. (3.4.36) 
1/10 0 1/10 


MATLAB has the ability of doing Gaussian elimination step by step. We begin by 
typing 
>>% input augmented matrix 
>>aug = [4 -2 2 1 00 ; -2 -4 4 0 1 0;-4 28001]; 
»»rrefmovie(aug); 

The MATLAB command rrefmovie(A) produces the reduced row echelon form of A. 
Repeated pressing of any key gives the next step in the calculation along with a statement 
of how it computed the modified augmented matrix. Eventually you obtain 


A= 
1 0 0 1/5 -1/10 0 
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0 1 0 0 -1/5 1/10 
0 0 1 1/10 0 1/10 


You can read the inverse matrix just as we did earlier. 


Gaussian elimination may be used with overdetermined systems. Overdetermined sys- 
tems are linear systems where there are more equations than unknowns (m > n). These 
systems are usually (but not always) inconsistent. 


e Example 3.4.6 


Consider the linear system 


X1 + T = 1, (3.4.37) 
—21 2x5 = —2, (3.4.38) 


After several row operations, the augmented matrix 


il T. 
edo melee (3.4.40) 
1 -1|4 
becomes 
1 1| 1 
0 1| 2 (3.4.41) 
0 0|-7 


From the last row of the augmented matrix, Equation 3.4.41, we see that the system is 
inconsistent. 
If we test this system using MATLAB by typing 
>>% input augmented matrix 
>>aug = [1 1 1 ; -1 2 -2; 1-1 4]; 
>>rrefmovie (aug) ; 


eventually you obtain 


A= 
1 0 
0 1 0 
0 1 


Although the numbers have changed from our hand calculation, we still have an inconsistent 
system because zı = z9 = 0 does not satisfy r4 + £2 = 1. 
Considering now a slight modification of this system to 


£ı + T2 = 1, (3.4.42) 
—2; + Ino = 5, (3.4.43) 
zı = —1, (3.4.44) 
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the final form of the augmented matrix is 
1 1|1 
0 1|2], (3.4.45) 
0 0/0 


which has the unique solution zı = —1 and x» = 2. 
How does MATLAB handle this problem? Typing 


>>% input augmented matrix 
>>aug = [1 1 1 ; -12 5; 10 -1]; 
»»rrefmovie(aug); 


we eventually obtain 


A= 
1 0 =1 
1 2 
0 0 0 


This yields z; = —1 and x2 = 2, as we found by hand. 
Finally, by introducing the set: 


21 29 — 1, (3.4.46) 
221 EL 2x2 = 2, (3.4.47) 
321 Ty 323 = 3, (3.4.48) 


the final form of the augmented matrix is 
1 1/1 
0 010]. (3.4.49) 
0 0/0 


There are an infinite number of solutions: x; = 1 — o, and z2 =a. 
Turning to MATLAB, we first type 


>>% input augmented matrix 
>>aug = [11 1 ; 222; 33 3]; 
>>rrefmovie(aug) ; 


and we eventually obtain 


A= 
1 1 1 

0 0 

0 0 0 


This is the same as Equation 3.4.49 and the final answer is the same. 


Gaussian elimination can also be employed with underdetermined systems. An under- 
determined linear system is one where there are fewer equations than unknowns (m « n). 
These systems usually have an infinite number of solutions although they can be inconsis- 
tent. 
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e Example 3.4.7 
Consider the underdetermined system: 


224 + 2x9 + da = —1, (3.4.50) 


Its augmented matrix can be transformed into the form: 
2 2 1|-1 
[S Ü ] E ó (3.4.52) 
Clearly this case corresponds to an inconsistent set of equations. On the other hand, if 
Equation 3.4.51 is changed to 


4a, + 49 + 2x3 = —2, (3.4.53) 


then the final form of the augmented matrix is 


De ait ST 
b: A j 3 (3.4.54) 


and we have an infinite number of solutions, namely z3 = a, £2 = 8, and 22, = —1—o- 2. 


Consider now one of the most important classes of linear equations: the homogeneous 
equations Ax = 0. If det(A) 4 0, then by Cramer's rule z; = £2 = z3 = ++: = Tn = 0. 
Thus, the only possibility for a nontrivial solution is det(A) = 0. In this case, A is singular, 
no inverse exists, and nontrivial solutions exist but they are not unique. 


e Example 3.4.8 


Consider the two homogeneous equations: 


zı 33 = 0, (3.4.55) 
zı — 29 = 0. (3.4.56) 


Note that det(A) = —2. Solving this system yields zı = z2 = 0. 
However, if we change the system to 


zı +22 — 0, (3.4.57) 
£i + T2 = 0, (3.4.58) 
which has the det(A) = 0 so that A is singular. Both equations yield z; = -z2 = o, 


any constant. Thus, there is an infinite number of solutions for this set of homogeneous 
equations. 


We close this section by outlining the proof of the theorem, which we introduced at 
the beginning. 

Consider the system Ax = b. By elementary row operations, the first equation in this 
system can be reduced to 


X1 + 031242 T: + Ann = py. (3.4.59) 
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'The second equation has the form 

Xp -FO2p41Up41 T: + AanIn = Bo, (3.4.60) 
where p > 1. The third equation has the form 

Tq + O3q1Tq4A + +++ + A3ntn = f3, (3.4.61) 
where q > p, and so on. To simplify the notation, we introduce z; where we choose the 
first k values so that zj = £1, 22 = £p, Z3 = £q, .... Thus, the question of the existence 


of solutions depends upon the three integers: m, n, and k. The resulting set of equations 
have the form: 


Lue. See We Tirti et Fin Z1 By 
O bo oe yog io) Se a Vn 22 f2 
0 0 1 Ykk+1 CCCo Ven Zk = Br . (3.4.62) 
0 0 0 0 0 Žk+1 Bett 
0 0 . O 0 e 0 Zn Bm 
Note that 6441, ..., Bm need not be all zero. 
'There are three possibilities: 
(a) k < m and at least one of the elements 6,41,..., 8m is nonzero. Suppose that an 


element p is nonzero (p > k). Then the pth equation is 
0z1 + 022 + +++ +02 = bp £0. (3.4.63) 


However, this is a contradiction and the equations are inconsistent. 

(b) k = n and either (i) k < m and all of the elements 8441,...,Gm are zero, or 
(ii) k = m. Then the equations have a unique solution that can be obtained by back- 
substitution. 


(c) k < n and either (i) k < m and all of the elements $441,..., Bm are zero, or (ii) 
k =m. Then, arbitrary values can be assigned to the n — k variables zy41,...,254. The 
equations can be solved for z1, 22,...,Zķ and there is an infinity of solutions. 


For homogeneous equations b = 0, all of the 8; are zero. In this case, we have only 
two cases: 

(b^ k = n, then Equation 3.4.62 has the solution z = 0, which leads to the trivial 
solution for the original system Ax = 0. 

(c) k < n, the equations possess an infinity of solutions given by assigning arbitrary 
values to Zy41,... Zn 


Problems 


Solve the following systems of linear equations by Gaussian elimination. Check your answer 
using MATLAB. 


1. 2x1 + z2 = 4, bea 
2. X1 +22 = 0, 321 — 4z = 1 
3. —%1 + 29 + 223 = 0, 321 + 4x2 + z3 = 0, —41-F £2 + 2x3 = 0 
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4. 4%, + 612 + 233 = 2, 224 + 29 — 4T3 = 3, 321 — 222 +5273 = 8 
5. 914 + T2 — 213 = —3, £1 — T2 + 2x3 = —1, —4z4, + 349 — 674 = 4 
6. £1 — 319 + 723 = 2, 2211 + 4x9 — 3213 = —1, —821 + 725 + 273 = 3 
7. 341 — 29 + 3£3 = 5, 2:1 — 4x3 + 7x3 = T, 4x1 — 919 4-213 = —15 
8. £1 + £2 + £3 + z4 — —1, 2r41— z2 + 3233 = 1, 

212 + 3x4 = 15, —41 + 222 + z4 = —2 


Find the inverse of each of the following matrices by Gaussian elimination. Check your 
answers using MATLAB. 


19 2 -—9 1 2 5 
9. t n 10. S 2 11. | -4 -1 2 12. |0 -1 2 
-2 0 1 2 4 11 
13. Does (42)! = (A~!)?? Justify your answer. 
Project: Construction of a Finite Fourier Series 


In Example 5.1.1 we show that the function f(t) given by Equation 5.1.8 can be reex- 
pressed 


f(t) = P + dm cos(nt) + bn sin(nt), —m«í«m, 
i +1 
ne” aee og E 
2 n?n n 


There we stated the Fourier series fits f(t) in a “least squares sense.” In Section 5.7 we will 
show that we could approximate f(t) with the finite Fourier series 


M-1 
f(t) = $40 + 5 Ay, cos(kt) + By sin(kt) + Ay cos( Mt), 
k=1 


if we sample f(t) at tm = (2m + 1 — M)r/M, where m = 0,1,2,...,M — 1 and M is an 

even integer. Then we can use Equation 5.7.12 and Equation 5.7.13 to compute A; and Bx. 

Because MATLAB solves linear equations in a least-squares sense, this suggests that we could 

use MATLAB as an alternative method for finding a finite Fourier series approximation. 
Let us assume that 


A N 
f(t) = F + Y An cos(nt) + B, sin(nt). 


n=1 


Then sampling f(t) at the temporal points tm = —7t+(2m—1)7/M, we obtain the following 
system of linear equations: 


DES 
ES + 5 An cos(ntm) + Bn sin(ntm) = f (tm), 


n=1 


where m = 1,2,..., M. Write a MATLAB program that solves this system for given N and 
M and compare your results with the exact answers ao, a, and bn for various N and M. 
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Consider the case when M > 2N + 1 (overspecified system), M < 2N + 1 (underspecified 
system), and M = 2N +1 (equal number of unknowns and equations). Does this method 
yield any good results? If so, under which conditions? 


Project: Solving Fredholm Integral Equation of the Second Kind 


Fredholm equations of the second kind and their variants appear in many scientific and 
engineering applications. In this project you will use matrix methods to solve this equation: 


b 
u(x) =| K(a,t)u(t) dt + f (x), a «mz «b, 


where the kernel K(z,t) is a given real-valued and continuous function and u(x) is the 
unknown. One method for solving this integral equation replaces the integral with some 
grid-point representation. The goal of this project is examine how we can use linear algebra 
to solve this numerical approximation to Fredholm's integral equation. 


Step 1: Using Simpson's rule, show that our Fredholm equation can be written in the matrix 
form (I — KD)u = f, where 


Ao 0 0 0 
0 A 0 0 
D= : : F 

0 0 - An-1 0 

0 0 0 An 
K (xo, £o) K(ro,21) +++ K(2o,%n-1) K (x0, £n) 
K (21,20) K(21,21) us K (21,241) K (21,24) 

K= | : l 

K(@n_-1,%9) K(Za 1,21) +++ K(@n—1,2n-1) K(@n—1,2n) 
K (an, £0) K(£n,£1) © K(£n, 4-1) K (14,24) 


u= [u(xo), u(z1), id Aiea s and f= [f(xo), f(z), MS Fall: 


where Ao = An = h/3, A» A4 Eps An_2 2h/3, A1 As ES An-1 Ah/3, 
zi = th, and h = (b — a)/n. Here n must be an even integer. 


Step 2: Use MATLAB to solve our matrix equation to find u. Use the following known 
solutions: 


(a) K(x,t) = 42t’, f(z) = 0.927, u(z) = z?, 
(b) K(a,t) = e-d, f(z) c4 (1 -— zx)e*, u(x) = e”, 
(c) K(z,t) = 3e, f(x) = 36, u(x) =e*, 
(d) K(g,t) = —je 555, f(a) 2 ez + 1, uz) = e°”, 
(e) K(a,t) = —v (e** — 1), für) =e — zx, u(x) = 1, 
(f) Kn t) = 22, f(z) = 22, u(t) = 2, 


when 0 € x < 1. How does the accuracy of this method vary with n (or h)? What happens 
when n becomes large? Figure 3.4.1 shows the absolute value of the relative error in the 
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|relative error| 


5 10 15 20 25 30 35 40 


Figure 3.4.1: The absolute value of the relative error of the numerical solution of a Fredholm integral 
equation of the second kind as a function of n for test problems (a), (b), (d), and (e). 


numerical solution at z = 3 as a function of n. For test cases (c) and (f) the error was the 
same order of magnitude as the round-off error. 


Project: LU Decomposition 


In this section we showed how Gaussian elimination can be used to find solutions to sets 
of linear equations. A popular alternative involves rewriting the n x n coefficient matrix: 


11 412 Q13 -« 3 ` Qin 
a21 Q22 Q23 ' j © 02n 
A = 
Qij 
Qni n2 Q3 5 ^ x Ann 


as the product of a lower n x n triangular matrix: 


lı 0 0 > + + 0 

boy £225 0 . 2 $ 0 
L- bi 

lni Lng lag 3 j 4 Lan 


and an upper n x n triangular matrix: 
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so that A — LU. By simply doing the matrix multiplication, we find the following Crout 
algorithm to compute £;; and uij: 


j-1 
lg Sox — >. Fai JE = ya 
k=1 


and 
i-1 
Uij = fey- Nun] /t $m. J-—2:3;0 4m. 
k=1 


For the special case of j = 1, £j; = aj1; for i = 1, u4; = a1; /C11 = ai /ai1. Clearly we could 
write code to compute L and U given A. However, MATLAB has a subroutine for doing this 
factorization [L,U] = lu(A). 

How does this factorization help us to solve Ax = b? The goal of this project is to 
answer this question. 


Step 1: Show that Ly = b and Ux = y can be combined together to yield Ax = b. 


Step 2: Show that y; and 2; can be computed from yı = 61/11, 


i—1 
Ui = b; — So tb; / 132,9, vy 
J= 


and £n = Yn/Unn, 


n 
Ti = | Yi = 5 UijYj us i—n-—1l,n-—2,...,1. 


j=i+1 


Step 3: Write a MATLAB script to solve Ax = b using LU decomposition. 
Step 4: Check your program by resolving Problems 4, 6, 7, and 8. 


The principal reason that this scheme is so popular is its ecomony of storage. The 0’s 
in either L or U are not stored. Furthermore, after the element aj; is used, it never appears 
again. 


Project: QR. Decomposition 


In the previous project you discovered that by factoring the matrix A into upper and 
lower diagonal matrices, we could solve Ax — b. Here we will again factor the matrix A 
into the product QR but Q will have the property that QTQ = I (orthogonal matrix) and 
R is an upper triangular matrix. 


Step 1: Assuming that we can rewrite Ax = b as QRx = b, multiply both sides of this 
second equation by QT and show that you obtain Rx = QTb = y. 
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Step 2: Show that v; can be computed from £n = y, /r,;4, and 


TL 


Ti = |W— 5 TijYj [rs i—n-—1l,n-—2,...,1. 


j=i+1 


Step 3: Write a MATLAB script to solve Ax = b using QR decomposition. 
Step 4: Check your program by resolving Problems 4, 6, 7, and 8. 


What advantages does QR decomposition have over LU decomposition? First, solving 
Ax = b via Rx = QTb is as well-conditioned as the original problem. Second, QR decom- 
position finds the least-squares solutions when no exact solution exists. When there are 
exact solutions, it finds all of them. 


3.5 EIGENVALUES AND EIGENVECTORS 


One of the classic problems of linear algebra? is finding all of the A's that satisfy the 
n x n system 
Ax = Ax. (3.5.1) 


The nonzero quantity A is the eigenvalue or characteristic value of A. The vector x is the 
eigenvector or characteristic vector belonging to A. The set of the eigenvalues of A is called 
the spectrum of A. The largest of the absolute values of the eigenvalues of A is called the 
spectral radius of A. 

To find A and x, we first rewrite Equation 3.5.1 as a set of homogeneous equations: 


(A— AD)x — 0. (3.5.2) 


From the theory of linear equations, Equation 3.5.2 has trivial solutions unless its determi- 
nant equals zero. On the other hand, if 


det(A — AI) — 0, (3.5.3) 


there are an infinity of solutions. 

'The expansion of the determinant, Equation 3.5.3, yields an nth-degree polynomial in A, 
the characteristic polynomial. 'The roots of the characteristic polynomial are the eigenvalues 
of A. Because the characteristic polynomial has exactly n roots, A has n eigenvalues, some 
of which can be repeated (with multiplicity k < n) and some of which can be complex 
numbers. For each eigenvalue A;, there is a corresponding eigenvector x;. This eigenvector 
is the solution of the homogeneous equations (A — A;I)x; = 0. 

An important property of eigenvectors is their linear independence if there are n distinct 
eigenvalues. Vectors are linearly independent if the equation 


Q1X1 + 2X2 + +- + AnXn = 0 (3.5.4) 


can be satisfied only by taking all of the coefficients a,, equal to zero. 


6 The standard reference is Wilkinson, J. H., 1965: The Algebraic Eigenvalue Problem. Oxford Univer- 
sity Press, 662 pp. 
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This concept of linear independence or dependence actually extends to vectors in gen- 
eral, not just eigenvectors. Algebraists would say that our n linearly independent vectors 
form a basis that spans a vector space V. A vector space is simply a set V of vectors that 
can be added and scaled. The maximum number of linearly independent vectors in a vector 
space gives its dimension of V. A vector space V can have many different bases, but there 
are always the same number of basis vectors in each of them. 

Returning to the eigenvalue problem, we now show that in the case of n distinct eigen- 
values 41, Ào,..., An, each eigenvalue A; having a corresponding eigenvector x;, the eigen- 
vectors form a basis. We first write down the linear dependence condition 


04X4 + aQ3X2 +--+ + o Xx, = 0. (3.5.5) 
Premultiplying Equation 3.5.5 by A, 
01ÀX + Q3ÀX2 T c an ÀX, = Q1À1X1 + 03À3X2 d + AnAnXn = 0. (3.5.6) 
Premultiplying Equation 3.5.5 by A?, 
a A?x, + a A? x» 4- +++ + o A?x,, = oq Mx + AQ ARK] +--+ a4 Mx, = 0. (3.5.7) 


In a similar manner, we obtain the system of equations: 


1 1 ee 1 O1X| 0 
^ A2 c An Q2X2 0 
M Ay xe oe a3x3 | 2| O0. (3.5.8) 
FS D NS dg 0 
Because 
1 1 m 1 
Pora CEE Oed onus 3 
A Aj 0e M | 02 — AUS — A2) (Aa — Ar) (Aa — A3 359 
: : : (Ag — Az) +++ (An — Ai) 20, (9,59) 
XX doe que ane 
since it is a Vandermonde determinant, a,x, = Q2X2 = A3X3 = ::: = AnXn = 0. Because 
the eigenvectors are nonzero, a1 a2 a3 EE An 0, and the eigenvectors are 


linearly independent. 
This property of eigenvectors allows us to express any arbitrary vector x as a linear 
sum of the eigenvectors x;, or 


X = CX] + CX +++: + CnXn.- (3.5.10) 
We will make good use of this property in Example 3.5.3. 
e Example 3.5.1 


Let us find the eigenvalues and corresponding eigenvectors of the matrix 


A= = EU (3.5.11) 
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We begin by setting up the characteristic equation: 


det(A — AI) = E A m | =o (3.5.12) 


Expanding the determinant, 
(-4— \)(-1— A) +2 = M + 5A +6 = (AF 3)(A4 2) =0. (3.5.13) 


Thus, the eigenvalues of the matrix A are \; = —3, and Ag = —2. 
To find the corresponding eigenvectors, we must solve the linear system: 


a E tU "S Ge (3.5.14) 


For example, for A, = —3, 


or 


Thus, any nonzero multiple of the vector (1) is an eigenvector belonging to A, = —3. 


1 


SE : . , 1 
Similarly, for Ag = —2, the eigenvector is any nonzero multiple of the vector 1 
Of course, MATLAB will do all of the computations for you via the command eig, which 
computes the eigenvalues and corresponding eigenvalues. In the present case, you would 
type 
>> A = [-4 2; -1 -1]; % load in array A 
>> % find eigenvalues and eigenvectors 
>> [eigenvector,eigenvalue] = eig(A) 


This yields 


eigenvector = 
-0.8944 -0.7071 
-0.4472 -0.7071 
and 
eigenvalue = 
=3 0 
0 =2; 


The eigenvalues are given as the elements along the principal diagonal of eigenvalue. 
The corresponding vectors are given by the corresponding column of eigenvector. As 
this example shows, these eigenvectors have been normalized so that their norm, Equation 
3.1.5, equals one. Also, their sign may be different than any you would choose. We can 
recover our hand-computed results by dividing the first eigenvector by —0.4472 while in 
the second case we would divide by —0.7071. Finally, note that the product eigenvec- 
tor*eigenvalue*inv(eigenvector) would yield A. 
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e Example 3.5.2 


Let us now find the eigenvalues and corresponding eigenvectors of the matrix 


—4 5 5 
A=|-5 6 5]. (3.5.17) 
-5 5 6 
Setting up the characteristic equation: 
det(A — AI) 
-4—-A 5 5 -4-A 5 5 
=| -5 6-A 5 |=| -5 6-A 5 (3.5.18) 
-5 5 6-A 0 A-1 1-2 
—4—) 5 5 -1 1 0 
=(A-1) 5 6—À §/SOA—-1)?/-5 6—A 5 (3.5.19) 
0 i -1 0 1 -1 
-1 0 0 
det(A — AI) =(A-1)?|-5 6—A 0 | —(A-1Y(6—-23)- 0. (3.5.20) 
0 1 -1 


Thus, the eigenvalues of the matrix A are A1, = 1 (twice), and A3 = 6. 
'To find the corresponding eigenvectors, we must solve the linear system: 


(—4 — A)ri + 522 + 523 = 0, (3.5.21) 

—5q4 S (6 = A) x2 os 5x3 = 0, (3.5.22) 
and 

—5q4 m 5x9 + (6 = AÀ)zs =0. (3.5.23) 


For A3 = 6, Equations 3.5.21 through 3.5.23 become 


—10z, + 5x2 + 5x3 = 0, (3.5.24) 
—521 T 5x3 = 0, (3.5.25) 
and 
—524 zn 5x5 = 0. (3.5.26) 
1 
Thus, z; = x2 = z3 and the eigenvector is any nonzero multiple of the vector | 1 


1 
The interesting aspect of this example centers on finding the eigenvector for the eigen- 
value À1,5 = 1. If 412 = 1, then Equations 3.5.21 through 3.5.23 collapse into one equation, 


=z + T2 + 73 = 0, (3.5.27) 


and we have two free parameters at our disposal. Let us take r9 = a, and z3 = B. Then 
1 1 

the eigenvector equals a | 1 | -8| 0] for^12— I. 
0 1 
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1 1 
In this example, we may associate the eigenvector | 1 | with A; = 1, and | 0 | with 
0 1 
1 
À2 = 1 so that, along with the eigenvector | 1 | with A3 = 6, we still have n linearly 
1 
independent eigenvectors for our 3 x 3 matrix. However, with repeated eigenvalues this is 


not always true. For example, 
1 -1 
A= T 1 ) (3.5.28) 


has the repeated eigenvalues A1, = 1. However, there is only a single eigenvector (4) for 
both i and dQ. 
What happens in MATLAB in the present case? Typing in 


>> A= [45 5; -5 6 5; -5 5 6]; % load in array A 
>> % find eigenvalues and eigenvectors 


>> [eigenvector,eigenvalue] = eig(A) 
we obtain 
eigenvector = 
-0.8165 0.5774 0.4259 
-0.4082 0.5774 -0.3904 
-0.4082 0.5774 0.8162 
and 
eigenvalue = 
1 0 0 
0 6 0 
0 0 1 


The second eigenvector is clearly the same as the hand-computed one if you normalize it with 
0.5774. The equivalence of the first and third eigenvectors is not as clear. However, if you 
choose a = f = —0.4082, then the first eigenvector agrees with the hand-computed value. 
Similarly, taking a = —0.3904 and 6 = 0.8162 result in agreement with the third MATLAB 
eigenvector. Finally, note that the product eigenvector*eigenvalue*inv (eigenvector) 
would yield A. 


e Example 3.5.3 


When we discussed the stability of numerical schemes for the wave equation in Section 
7.6, we will examine the behavior of a prototypical Fourier harmonic to variations in the 
parameter cAt/Az. In this example we shall show another approach to determining the 
stability of a numerical scheme via matrices. 

Consider the explicit scheme for the numerical integration of the wave equation, Equa- 
tion 7.6.11. We can rewrite that single equation as the coupled difference equations: 


uH E 2(1 m r2)u? +4 r? (uf. i1 $ up, i) ES Up; (3.5.29) 


and 
qb = u? (3.5.30) 
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where r = cAt/Az. Let u*,,, = ei^*u^.. and u*, = e-^*u^, where 8 is real. Then 
Equation 3.5.29 and Equation 3.5.30 become 


A 
untl 9 | — 2r? sin? (=) Um — Um) (3.5.31) 
and 
yrtl Lan. (3.5.32) 


or in the matrix form 


— on sin? (BA2\] — 
wie (l 2r a 3 ) Te (3.5.33) 


e 
3343 


where u7, = i J The eigenvalues A of this amplification matriz are given by 


M -2 f — 2r? sin? (=*)| A+1=0, (3.5.34) 


Aig = 1 — 2r° sin? c3 + 2r 26 y” sin? c3 =f (3.5.35) 


Because each successive time step consists of multiplying the solution from the previous 
time step by the amplification matrix, the solution is stable only if u7, remains bounded. 
This occurs only if all of the eigenvalues have a magnitude less or equal to one, because 


Or 


uy, = 5 Ck A" Xk = 5 Ck AR Xk, (3.5.36) 
k k 


where A denotes the amplification matrix and x; denotes the eigenvectors corresponding to 
the eigenvalues Aj. Equation 3.5.36 follows from our ability to express any initial condition 
in terms of an eigenvector expansion 


lig = Y ox (3.5.37) 
k 
In our particular example, two cases arise. If r? sin?(GAa/2) < 1, 


Àna—1- 2r? sin? (=) tri s (22) i) — r? sin? (=) (3.5.38) 


and [A4 »| = 1. On the other hand, if r? sin?(@Aa/2) > 1, |A1,2| > 1. Thus, we have stability 
only if cAt/Az < 1. 
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we Ax œ 


Figure 3.5.1: Schematic for finite-differencing a Sturm-Liouville problem into a set of difference equations. 
Problems 


Find the eigenvalues and corresponding eigenvectors for the following matrices. Check your 
answers using MATLAB. 


2 —3 1 
1 4- (3 A 2a-(i 7) 3. A-[1 -2 1 
1 -3 2 
010 11 1 12 1 
4 A—-|0 0 1 5.A-2[0 2 1 6 A—-|0 3 1 
000 0 0 1 0 5 -1 
4 —5 1 —2 0 1 
T. A—-|1 0 -4 8 A=| 3 0 -1 
0 1 -1 0 1 1 


Project: Numerical Solution of the Sturm-Liouville Problem 


You may have been struck by the similarity of the algebraic eigenvalue problem to the 
Sturm-Liouville problem. (See Section 6.1.) In both cases nontrivial solutions exist only 
for characteristic values of A. The purpose of this project is to further deepen your insight 
into these similarities. 

Consider the Sturm-Liouville problem 


y" 4- Ay — 0, y(0) = y(r) = 0. (3.5.39) 


We know that it has the nontrivial solutions Ay, = m7, Ym(x) = sin(mz), where m = 
le ee 


Step 1: Let us solve this problem numerically. Introducing centered finite differencing and 
the grid shown in Figure 3.5.1, show that 


1 py Ynti 7 Pyn ail mele us 


(Aa)? 


N, (3.5.40) 
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Table 3.5.1: Eigenvalues Computed from Equation 3.5.42 as a Numerical Approximation 
of the Sturm-Liouville Problem, Equation 3.5.39 


N Mt A2 Aa A4 A5 A6 À7 

1 0.81057 

2 0.91189 2.73567 

3 0.94964 3.24228 5.53491 

4 0.96753 3.50056 6.63156 9.16459 

5 0.97736 3.64756 7.29513 10.94269 13.61289 

6 0.98333 3.73855 7.71996 12.13899 16.12040 18.87563 

7 0.98721 3.79857 8.00605 12.96911 17.93217 22.13966 24.95100 
8 0.98989 3.84016 8.20702 13.56377 19.26430 24.62105 28.98791 
20 0.99813 3.97023 8.84993 15.52822 23.85591 33.64694 44.68265 
50 


0.99972 3.99498 8.97438 15.91922 24.80297 35.59203 48.24538 


where Az = «/(N + 1). Show that the finite-differenced form of Equation 3.5.39 is 
—h yng + 2h99, — hys 4 = Ayn (3.5.41) 
with yo = yv41 = 0, and h = 1/(Ax). 


Step 2: Solve Equation 3.5.41 as an algebraic eigenvalue problem using N — 1,2,.... Show 
that Equation 3.5.41 can be written in the matrix form of 


2n? —h? 0 ee 0 0 0 Ui yı 
—h? 2h? —-h? --- 0 0 0 Y2 ya 
0 -AM 2h? ... 0 0 0 Y3 Y3 
"E NUM "opea oJ. (3.5.42) 
0 0 0 e. =f? Qh? —h? YN-1 YN-1 
0 0 0 ee 0 —h? 2h? YN UN 


Note that the coefficient matrix is symmetric. 


Step 3: You are now ready to compute the eigenvalues. For small N this could be done by 
hand. However, it is easier just to write a MATLAB program that will handle any N > 2. 
Table 3.5.1 has been provided so that you can check your program. 

With your program, answer the following questions: How do your computed eigenvalues 
compare to the eigenvalues given by the Sturm-Liouville problem? What happens as you 
increase N? Which computed eigenvalues agree best with those given by the Sturm-Liouville 
problem? Which ones compare the worst? 


Step 4: Let us examine the eigenfunctions now. Starting with the smallest eigenvalue, use 
MATLAB to plot Cy; as a function of x; where y; is the jth eigenvector, 7 = 1,2,...,N, 
zi = iAz, i = 1,2,..., N, and C is chosen so that C^Az * ^; y;(z;) = 1. On the same 
plot, graph y;(zx) = \/2/m sin(jz). Why did we choose C as we did? Which eigenvectors 
and eigenfunctions agree the best? Which eigenvectors and eigenfunctions agree the worst? 
Why? Why are there N eigenvectors and an infinite number of eigenfunctions? 
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Step 5: The most important property of eigenfunctions is orthogonality. But what do we 
mean by orthogonality in the case of eigenvectors? Recall from three-dimensional vectors 
we had the scalar dot product 


a-b= aıbı + azb2 + a3b3. (3.5.43) 


For n-dimensional vectors, this dot product is generalized to the inner product 
TL 
x- y= 5 TkYk- (3.5.44) 
k=1 


Orthogonality implies that x- y = 0 if x Z y. Are your eigenvectors orthogonal? How 
might you use this property with eigenvectors? 


Project: Singular Value Decomposition and Linear Least Squares 


In the previous section we showed two ways that linear equations can be solved by 
factoring the matrix A in Ax = b into a product of two matrices. The LU and QR 
decompositions are not the only possible factorization. One popular version rewrites the 
square matrix A as PDP- 1, where D is a diagonal matrix with the n eigenvalues along the 
principal diagonal and P contains the eigenvectors in the transition matrix P. MATLAB's 
routine eig yields both D and P via [P,D] = eig(A). See Example 3.6.4. In this project we 
focus on singular value decomposition, possibly the most important matrix decomposition 
of them all. It is used in signal processing, statistics, and numerical methods and theory. 

Singular value decomposition factorizes a matrix A of dimension m x n into the product 
of three matrices: A = U DVT, where U is an m x m orthogonal matrix, V is an n x n 
orthogonal matrix, and D is an m x n diagonal matrix. The diagonal entries of D are called 
the singular values of A. The rank of a matrix equals the number of non-zero singular 
values. 


Step 1: Consider the matrix A given by 


45  —108 36  —45 
21  —68 26 —33 
A—| 72 -32 -16 24 
—56 64 —8 8 
50  —32 —6 10 


Using MATLAB’s subroutine svd, confirm that it can factorize the array A into U, D, and 
V. Then check that A = U DVT and find the rank of this matrix. 


The goal of this project is to find the parameters m and c so that the line y = mx +c 
gives the best fit to n data points. If there are only two data points, there is no problem 
because we could immediately find the slope m and the intercept c. However, if n > 2 
we cannot hope to choose these coefficients so that the straight line fits them. How does 
singular value decomposition come to the rescue? 

To find the answer, we begin by noting that each data point (£i, yi) must satisfy the 
linear equation mz; + c = yi, or 
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Figure 3.5.2: Using 50 “data points," the singular value decomposition provides a least-squares fit to the 
data. 


We can write this over-determined system of linear equations Ax — b. Let the residual r be 
a vector defined by r = Ax — b. The vector x* that yields the smallest possible residual in 
the least-squares sense is ||r|| = || Ax* — b|| € || Ax — b||, where ||- || denotes the Euclidean 
norm. Although a least-squares solution always exists, it might not be unique. However, 
the least-squares solution x with the smallest norm ||x|| is unique and equals AT Ax = ATb 
or x = (AT A)-1 ATb. This solution can be found using singular value decomposition as 
x= VD} !UTb, where the n x m matrix D7‘ has the diagonal terms: 


=i = 1/3, if D; > €, 
(Do J ={ 0, otherwise, 


even when A is singular or ill-conditioned. 


Step 2: In Step 1, we found that the rank for the matrix A is 2. What values of x does 
singular value decomposition give if b = (—2 44 6 —4)"? This solution equals the least- 
squares solution of minimum length. 


Step 3: Returning to our original goal of finding the best linear fit to data, create data for 
your numerical experiment. One way would use the simple line y = mz + c (with arbitrary 
chosen values of m and c) to create the initial data and then use the random number 
generator rand to modify this initial data (both x and y) so that both have “noise.” 


Step 4: Construct the array A and column vector y. Using the MATLAB routine svd, find 
xc yu hs. 


Step 5: Construct the least-squares fit for the data and plot this curve and your data on 
the same figure. See Figure 3.5.2. 


3.6 SYSTEMS OF LINEAR DIFFERENTIAL EQUATIONS 


In this section we show how we may apply the classic algebraic eigenvalue problem to 
solve a system of ordinary differential equations. 
Let us solve the following system: 


x = T1 3x2, (3.6.1) 
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and 
x = 3x1 + 22, (3.6.2) 


where the primes denote the time derivative. 
We begin by rewriting Equation 3.6.1 and Equation 3.6.2 in matrix notation: 


x’ = Ax, (3.6.3) 


x= [o and A= t i) (3.6.4) 


siy dfa) y 
(5)-2(2)-» T 


Assuming a solution of the form 


where 


Note that 


x = xe, where Xo = @ (3.6.6) 


is a constant vector, we substitute Equation 3.6.6 into Equation 3.6.3 and find that 
Ae" xo = Ae™xo. (3.6.7) 
Because eò does not generally equal zero, we have that 
(A — AD)xo = 0, (3.6.8) 


which we solved in the previous section. This set of homogeneous equations is the classic 
eigenvalue problem. In order for this set not to have trivial solutions, 


deta -an =|* 5 aed (3.6.9) 
Expanding the determinant, 
(1—A)?-9=0 or A—-2,4 (3.6.10) 
Thus, we have two real and distinct eigenvalues: A = —2 and 4. 


We must now find the corresponding xo or eigenvector for each eigenvalue. From 
Equation 3.6.8, 


(1 — A)a + 3b — 0, (3.6.11) 
and 

3a 4- (1 — A)b — 0. (3.6.12) 
If À = 4, these equations are consistent and yield a = b = cj. If A = —2, we have that 
a = —b = c2. Therefore, the general solution in matrix notation is 


xa (1) gt + co e e t. (3.6.13) 
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To evaluate cı and co, we must have initial conditions. For example, if 71(0) = x2(0) = 


iten 
(1)-a(1)*«(À). (3.6.14) 


Solving for cı and co, c1 = 1, c2 = 0, and the solution with this particular set of initial 


conditions is 
x= (1) p (3.6.15) 


e Example 3.6.1 


Let us solve the following set of linear ordinary differential equations 


X = —T2 + T3, (3.6.16) 
xh = 4r, — £2 — 413, (3.6.17) 
and 
xh = —3z4— Lo + 423; (3.6.18) 
or in matrix form, 
0 —1 1 X1 
x ={ 4 -1-4|x x= | x |]. (3.6.19) 
—3 -1 4 x3 
Assuming the solution x = xoe*, 
0 -1 1 
4 —] -4 xo = AXo, (3.6.20) 
-3 -1 4 
or 
—A -1 1 
4 -1-A -4 |xo=0. (3.6.21) 
3 1 4—A 
For nontrivial solutions, 
—A -1 1 0 0 1 
4 —1—A —4 |= 4 —4A —5—A —4 |=0, (3.6.22) 
3 1 4—A 3+4\-—d? 3-A 4-A 
and 
(A= 1)(A-—3)(A41) =0, or A—-11,3. (3.6.23) 


To determine the eigenvectors, we rewrite Equation 3.6.21 as 
—Aa—b+c=0, (3.6.24) 


4a — (1+ A)b — 4c = 0, (3.6.25) 


and 
—8a — b + (4—A)c — 0. (3.6.26) 
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For example, if A = 1, 


—a—b+c=0, (3.6.27) 
4a — 2b — 4c = 0, (3.6.28) 
and 
—3a — b 4- 3c = 0; (3.6.29) 
1 
or a = c, and b = 0. Thus, the eigenvector for A = 1 is xo = | 0 |. Similarly, for \ = —1, 
1 
1 1 
Xo = | 2 |; and for \=3, x» = | —1 |. Thus, the most general solution is 
1 2 
1 1 1 
x=c, |0] é+ |2 |e +e | -1 | e*. (3.6.30) 
1 1 2 


e Example 3.6.2 


Let us solve the following set of linear ordinary differential equations: 


x] = t1 — 219, 3.6.31) 
and 
xy = 2x, — 312; 3.6.32) 
or in matrix form, 
P 1 —2 = X1 
x — t _3 )% Ec s 3.6.33) 
Assuming the solution x = xge*, 
br 324) 79. (3.6.34) 
For nontrivial solutions, 
LA = (A+1)? =0 (3.6.35) 
2 —3—4A j Ae 
'Thus, we have the solution 
x=C} n e™. (3.6.36) 


The interesting aspect of this example is the single solution that the traditional ap- 
proach yields because we have repeated roots. To find the second solution, we try the 


solution 
[96 Y 4 
x= pur . (3.6.37) 
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We guessed Equation 3.6.37 using our knowledge of solutions to differential equations when 
the characteristic polynomial has repeated roots. Substituting Equation 3.6.37 into Equa- 
tion 3.6.33, we find that c = d = 2c2, and a — b = cg. Thus, we have one free parameter, 
which we choose to be b, and set it equal to zero. This is permissible because Equation 


1+ 2) e^t. The first term can 


1 


3.6.37 can be broken into two terms: b 1 e^! and c ( m 


be incorporated into the c (1) e * term. Thus, the general solution is 


x—a ({)ette (jet +20 et (3.6.38) 


e Example 3.6.3 


Let us solve the system of linear differential equations: 


vy = 2x1 — 319, 3.6.39) 
and 
xy = 3x1 + 212; 3.6.40) 
or in matrix form, 
"NN 2 —3 = X1 
x zr 9 E E 3.6.41) 
Assuming the solution x = xe", 
i 3 A m Xo = 0. (3.6.42) 
For nontrivial solutions, 
re js = (2—A)?+9=0, (3.6.43) 


a 


b 


and A = 2 3i. If xo = ( 


the general solution is 


), then b= -ai fA = 2:3 and b = ai if A = 2 — 3i. Thus, 


x= ( ! ) et 316 + e e) QUE (3.6.44) 


—i 
where cı and co are arbitrary complex constants. Using Euler relationships, we can rewrite 


Equation 3.6.44 as 
u cos(3t) | s, sin(3t) | » 
X — 03 bee | e dea UT cos(3t) e”, (3.6.45) 


where c3 = c1 + c2 and c4 = i(c — c2). 
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e Example 3.6.4: Diagonalization of a matrix A 
Let x1,X2,::-,X4 denote the eigenvectors, with corresponding eigenvalues A1, À5,-::, 
An, of an n x n matrix A. If we introduce a matrix X = [x1x2---x,] (the eigenvectors form 
the columns of X) and recall that Ax; = Ajx;, then 
AX = Abaxo-:- Xn] = [AxiAxo:-- AXn] = [A1x1A2K2° +: AnXn]- (3.6.46) 


Therefore, AX = X D, where 


A 0 0 
0 à 0 

D=]. vx. x as Je (3.6.47) 
0 0 Xn 


Because X has rank n, X-! exists and X! AX = X! XD = D. Thus, X AX = Disa 
process whereby we can diagonalize the matrix A using the eigenvectors of A. Diagonalizable 
matrices are of interest because diagonal matrices are especially easy to use. Furthermore, 


we note that 
D? = DRX “TAX X-TAX = X AAX X A?X. (3.6.48) 


Repeating this process, we eventually obtain the general result that D" = X~'A™X. 
To verify X~1AX = D, let us use 


3 4 
a=(3 J (3.6.49) 
This matrix has the eigenvalues 1,2 = 1,5 with the corresponding eigenvectors x; = 
(2 1)" andx,=(2 —1)". Therefore, 
(2 2 -1_ (1/4 -A/2 
xa(2 P) ea xt (Q IP). oso 
Therefore, 
E | (1/4 -1/2 3 4 2 2 
X Ax - (0 y, os irm (3.6.51) 
_ (1/4 -1/2 2 10\ /1 0 
x VM ae 37 58:02) 
Problems 


Find the general solution of the following sets of ordinary differential equations using matrix 
technique. You may find the eigenvalues and eigenvectors either by hand or use MATLAB. 


l. z,-—z1-42z2 Uy = 2a, + T2 
2. qz = T1 — 4x9 Ly = 3x1 — 6x2 


3. £) = 41+ x vy = 4m, + T2 
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4. q^ = £1 5x9 zh = —2x1 — 6x2 

5. a, = 32) — 2x Uy = 2a, + $22. 

6. x} = —3z4 — 2x zh = 2a, + T2 

T. Y? = 44 — T2 Lo = T1 329 

8. x, = 3x1 + 222 Ly = —2T1 — T2 

9. x} = —22z4 — 13x2 zh = £1 + 4T2 

10. xz} = 3x1 — 229 zh = 5x1 — 329 

11. z} = 4x1 — 223 zh = 25x1 — 10x2 

12. x} = —3z, — 4x» Ly = 2x4 + T2 

13. z/, = 3x1 + 4x2 Lh = —2T1 — T2 

14. x^, + 5x1 + zh + 322 = 0 2x^ Exi +z + z2 =0 

15. x} — zi + zh — 2x2 = 0 xi — 5x1 +225 — Tro = 0 

16. x} = x1 — 2x2 ry =0 vs = —5z4 + Tz3. 
17. z/, = 22, Lh = T1 2x3 X5 = T3. 

18. xv = 3x1 — 2x3 Uy = —T1 + 22 + T3 x, = 4x1 — 373 
19. v = 321 — T3 Uy = —2a4 + 22:3 + 23 x5 = 8x1 — 343 


3.7 MATRIX EXPONENTIAL 


In the previous section we solved initial-value problems involving systems of linear 
ordinary differential equations via the eigenvalue problem. Here we introduce an alternative 
method based on the matrix exponential, defined by 


e^t = I+ Att AAP +--+ PASH. (3.7.1) 


Clearly 


e = e^0 =], and £ (e^t) = Ae“, (3.7.2) 


Therefore, using the matrix exponential function, the solution to the system of homogeneous 
linear first-order differential equations with constant coefficients 


x’ = Ax, x(0) = Xo, (3.7.3) 


A 


is x(t) = e?! xp. 


140 Advanced Engineering Mathematics with MATLAB 


'The question now arises as to how to compute this matrix exponential. There are 
several methods. For example, from the concept of diagonalization of a matrix (see Example 
3.6.4) we can write A = PDP~', where P are the eigenvectors of A. Then, 


A wvS(PDP* <= p^ E c. D* ET D p-1 
=y I BRX RI P 255r Pt = Pe? P7}, (3.7.4) 
k=0 k=0 k=0 
where 
e^ 0 0 
0 e? DX 0 
peu ou cu cds (3.7.5) 
0 0 e € 


Because many software packages contain routines for finding eigenvalues and eigenvectors, 
Equation 3.7.4 provides a convenient method for computing e^. In the case of MATLAB, 
we just have to invoke the intrinsic function expm(-). 

In this section we focus on a recently developed method by Liz,’ who improved a 
method constructed by Leonard.? The advantage of this method is that it uses techniques 
that we have already introduced. We will first state the result and then illustrate its use. 

'The main result of Liz's analysis is: 


Theorem: Let A be a constant n x n matrix with characteristic polynomial p(A) = A" + 
C4 1A% 1 Tec oA-c c9. Then 


e^t = qi (t)I +ao(t)A+---+ay(t)A™, (3.7.6) 
where 
a(t) qi(t) 
v»(t) | ex) 
= B; . , (3.7.7) 
"NO ext) 
wit) v e Pa) 
pg-|e0 A0 - eO ie 
eu h(t) e vi D) 


and S = (qvu(t),qa(t),..., Pn(t)} being a fundamental system of solutions for the homoge- 
neous linear differential equations whose characteristic equation is the characteristic equa- 
tion of A, p(A) = 0. The proof is given in Liz's paper. Note that for this technique to work, 
x1(0) = 1 and z3(0) = z3(0) =--- = z4(0) = 0. 


e Example 3.7.1 
Let us illustrate this method of computing the matrix exponential by solving 


x —2r—y-cz, (3.7.9) 


T Liz, E., 1998: A note on the matrix exponential. SIAM Rev., 40, 700-702. 


8 Leonard, I. E., 1996: The matrix exponential. SIAM Rev., 38, 507-512. 
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y —3y-z, (3.7.10) 
and 
z =Qe+y43z. (3.7.11) 
The solution to this system of equations is x = e^'xo, where 
2-1 1 x(t) 
A—-|0 3 -1 and x= | y(t) (3.7.12) 
2 1 3 z(t) 


The vector xo is the value of x(t) at t = 0. 
Our first task is to compute the characteristic polynomial p(A) = 0. This is simply 


ll-A|2| 0 A-3 1 |[-(A-2Y(A-4)-0. (3.7.13) 


Consequently, A = 2 twice and A = 4, and the fundamental solutions are S = {e?!, te?! , e*t , 
Therefore, 


ett 4ett 16e*t 
B exe get 4e?! ; (3.7.14) 
te e?t + 2te** 4e% + Ate?! 
and 
6 —4 6 1 4 16 -1 0 -4 
A—-|[-2 8 -6],Bo=(1 2 4]|,BHj!—-|-1 1 3 |. (3.7.15) 
10 4 10 01 4 EM qe 
The inverse B ! can be found using either Gaussian elimination or MATLAB. 
To find z1(t), x(t) and za(t), we have from Equation 3.7.7 that 
x(t) -1 0 -4 e*t e^t — Ate?! 
zo6(t | -|-1 1 8 et |=| e-etx-3t[p295 |, (3.7.16) 
x3(t) i -i 7-3 tet tett — iet — Ste?! 
or 
y(t) = e** — 4te™, xo(t) = e” — e** + 3te, x(t) = 1e” — le” — lte”. (3.7.17) 
Note that z1(0) = 1 while x2(0) = x3(0) = 0. 
Finally, we have that 
100 De ole d 6 —4 6 
e" —g,(t|0 1 O}] +a0(t)| 0 3 -1|-ez0)|-2 8 -6]. (3.718) 
0.0 1 2 1 3 10 4 10 
Substituting for xı (t), z2(t) and x3(t) and simplifying, we finally obtain 
; eft p et — e? ogg gab et 
e^t = 5 e% — eft + Qte?* 2(t-- 1)e?!  e?t — et |; (3.7.19) 


ett — e2t + 2te?t 21e2t e^t p e2t 
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or 
x(t) = (ie + ie? — te") x1 (0) — te?'z(0) + (že — $e?) z3(0), (3.7.20) 


@(t) = (Ze — le“ + te?) z4 (0) + (t + 1)e”%zxa(0) + (Ze% — lett) x3(0), (3.7.21) 


x(t) = (4e — 1e?* + te?) 21 (0) + te**x2(0) + (1e** + de”) x3 (0). (3.7.22) 


e Example 3.7.2 


The matrix exponential can also be used to solve systems of first-order, nonhomoge- 
neous linear ordinary differential equations. To illustrate this, consider the following system 
of linear ordinary differential equations: 


x! = x — 4y + e”*, 3.7.23) 
and 
y =x+5y+t. 3.7.24) 
We can rewrite this system as 
x’ = Ax + b, 3.7.25) 
where 
|. [1 -4 cq | (a(t) 
4-(1 a bs and xe ; 3.7.26) 


We leave as an exercise the computation of the matrix exponential and find that 


3t 3t 3t 
At [€ — 2te —Ate 
€ = ( test e op, 21e3t ) y (3.7.27) 


Clearly the homogeneous solution is xy (t) = e4*C, where C is the arbitrary constant 
that is determined by the initial condition. But how do we find the particular solution, 
Xp(t)? Let x(t) = e^ty(t). Then 


x (t) = Ae^ty(t) + e^ty'(t), 3.7.28) 
Or 
x (t) = Ax,(t) + e^'y' (t). 3.7.29) 
Therefore, 
e"y (t) -b(t, or y(t)-—e “* b(t), 3.7.30) 


since (e^) ^ — e^^, Integrating both sides of Equation 3.7.29 and multiplying through by 
e^t, we find that 


x) f. et». as- f e^*b(t — s) ds. (3.7.31) 
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Returning to our original problem, 


t t 3E ^^ 3s EP 3s 2(t— s) 
As = = e 2se 4se e 
f e^*b(t — s) ds = n ( "n ga im) ( Bou ) ds (3.7.32) 


t 2t E s 3s 

O e” (e$ — 2se*) — As(t — s)e 

i f E +(t— s)e?5 4- 2s(t — s)e3s ds (3.7.33) 
2s eX 3e?t 2 test At = 

~ (3.7.34) 


28 ,3t | 2t 114.:3f 1 pb 
297€ te” + yte gt + oF 


The final answer consists of the homogeneous solution plus the particular solution. 


Problems 
Find e“ for the following matrices A: 

11 0 
tisli " asp 5) ASI 0. 3d 
0 0 1 

23 4 1 2 0 

4. A=|0 2 3 5. A=|0 1 2 

0 0 2 0 0 1 


For each of the following A’s and b’s, use the matrix exponential to find the general solution 
for the system of first-order, linear ordinary differential equations x’ = Ax + b: 
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Further Readings 


Bronson, R., and G. B. Costa, 2007: Linear Algebra: An Introduction. Academic Press, 
520 pp. Provides a step-by-step explanation of linear algebra. 


144 Advanced Engineering Mathematics with MATLAB 


Davis, H. T., and K. T. Thomson, 2000: Linear Algebra and Linear Operators in Engi- 
neering with Applications in Mathematica. Academic Press, 547 pp. Advanced textbook 
designed for first-year graduate students in the physical sciences and engineering. 


Hoffman, J., 2001: Numerical Methods for Engineers and Scientists. Mc-Graw Hill, 823 pp. 
A first course in numerical methods that is both lucid and in depth. 


Munakata, T., 1979: Matrices and Linear Programming with Applications. Holden-Day, 
469 pp. Provides the basic concepts with clarity. 


Noble, B., and J. W. Daniel, 1977: Applied Linear Algebra. Prentice Hall, 494 pp. Excellent 
conceptual explanations with well-motivated proofs. 


Strang, G., 2009: Linear Algebra and Its Applications. Academic Press, 584 pp. A good 
supplement to a formal text that provides intuitive understanding. 


Wilkinson, J. H., 1988: The Algebraic Eigenvalue Problem. Clarendon Press, 662 pp. The 
classic source book on the eigenvalue problem 


Chapter 4 


Vector Calculus 


Physicists invented vectors and vector operations to facilitate their mathematical ex- 
pression of such diverse topics as mechanics and electromagnetism. In this chapter we focus 
on multivariable differentiations and integrations of vector fields, such as the velocity of a 
fluid, where the vector field is solely a function of its position. 


4.1 REVIEW 


The physical sciences and engineering abound with vectors and scalars. Scalars are 
physical quantities that only possess magnitude. Examples include mass, temperature, 
density, and pressure. Vectors are physical quantities that possess both magnitude and 
direction. Examples include velocity, acceleration, and force. We shall denote vectors by 
boldfaced letters. 

Two vectors are equal if they have the same magnitude and direction. From the limitless 
number of possible vectors, two special cases are the zero vector 0, which has no magnitude 
and unspecified direction, and the unit vector, which has unit magnitude. 

The most convenient method for expressing a vector analytically is in terms of its 
components. A vector a in three-dimensional real space is any order triplet of real numbers 
(components) a1, a2, and aa such that a = aii + a»j + ask, where aji, a2j, and a3k are 
vectors that lie along the coordinate axes and have their origin at a common initial point. 
The magnitude, length, or norm of a vector a, |a|, equals \/a? + a2 + a3. A particularly 
important vector is the position vector, defined by r = xi + yj + zk. 

As in the case of scalars, certain arithmetic rules hold. Addition and subtraction are 
very similar to their scalar counterparts: 


a+b = (aj + b1)i + (a2 + b2)j + (a3 + b3)k, (4.1.1) 
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and 


a— b= (ai bi)i | (ag b3)j t (a3 b3)k. (4.1.2) 


In contrast to its scalar counterpart, there are two types of multiplication. The dot 
product is defined as 


a- b= Ja||b|cos(0) = a1bı + a202 + agbs, (4.1.3) 


where 0 is the angle between the vector such that 0 < 0 < m. The dot product yields a 
scalar answer. A particularly important case is a: b = 0 with |a| 4 0, and |b| Z 0. In this 
case the vectors are orthogonal (perpendicular) to each other. 

The other form of multiplication is the cross product, which is defined by a x b = 
|a||b| sin()n, where 0 is the angle between the vectors such that 0 < 0 < 7, and n is a unit 
vector perpendicular to the plane of a and b, with the direction given by the right-hand 
rule. A convenient method for computing the cross product from the scalar components of 
a and b is 


i j k 
ag a3 |s a1 a3 |.» ay ag 
axbc = z k. 4.1.4 
“a a Wr dal Tlo ba 3 Bp. b pu 
Bu d de 


Two nonzero vectors a and b are parallel if and only if a x b — 0. 

Most of the vectors that we will use are vector-valued functions. These functions are 
vectors that vary either with a single parametric variable t or multiple variables, say x, y, 
and z. 

'The most commonly encountered example of a vector-valued function that varies with 
a single independent variable involves the trajectory of particles. If a space curve is param- 
eterized by the equations x = f(t), y = g(t), and z = h(t) with a € t < b, the position 
vector r(t) = f(t)i-- g(t)j + A(t)k gives the location of a point P as it moves from its initial 
position to its final position. Furthermore, because the increment quotient Ar/At is in the 
direction of a secant line, then the limit of this quotient as At — 0, r'(t) gives the tangent 
(tangent vector) to the curve at P. 


e Example 4.1.1: Foucault pendulum 


One of the great experiments of mid-nineteenth-century physics was the demonstration 
by J. B. L. Foucault (1819-1868) in 1851 of the earth's rotation by designing a (spherical) 
pendulum, supported by a long wire, that essentially swings in an nonaccelerating coor- 
dinate system. This problem demonstrates many of the fundamental concepts of vector 
calculus. 

The total force! acting on the bob of the pendulum is F = T + mG, where T is 
the tension in the pendulum and G is the gravitational attraction per unit mass. Using 


Newton's second law, 
d?r T 
= +G 4.1.5 
dg TG, (4.1.5) 


inertial m 


where r is the position vector from a fixed point in an inertial coordinate system to the bob. 
This system is inconvenient because we live on a rotating coordinate system. Employing 


1 See Broxmeyer, C., 1960: Foucault pendulum effect in a Schuler-tuned system. J. Aerosp. Sci., 27, 
343-347. 
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the conventional geographic coordinate system,? Equation 4.1.5 becomes 


2 
UTE 9g E 


T 
dB ft 2x (xr) =—+G, (4.1.6) 


where €? is the angular rotation vector of the earth and r now denotes a position vector in 
the rotating reference system with its origin at the center of the earth and terminal point at 
the bob. If we define the gravity vector g = G — Q x (Q x r), then the dynamical equation 
is 

d?r r T 

EP PLE: (4.1.7) 
where the second term on the left side of Equation 4.1.7 is called the Coriolis force. 

Because the equation is linear, let us break the position vector r into two separate 

vectors: ro and ri, where r = ro + ri. The vector rg extends from the center of the earth 
to the pendulum's point of support, and r; extends from the support point to the bob. 
Because r is a constant in the geographic system, 


— +29 x —=—+g. (4.1.8) 
m 


If the length of the pendulum is L, then for small oscillations rı ~% xi + yj + Lk and 
the equations of motion are 


d^y ; dx Ty 
and d " 
y z 
20 —-g=— 4.1.11 
cos) Hg = © (41.11) 


where A denotes the latitude of the point and Q is the rotation rate of the earth. The 
relationships between the components of tension are T; = zT;/L, and Ty = yT,/L. From 
Equation 4.1.11, 
Te = 20 a) uo (4.1.12) 
+g= cos ao 1. 
Substituting the definitions of Ty, Ty, and Equation 4.1.12 into Equation 4.1.9 and Equation 
4.1.10, 


d'y g . dy 
dg = T + 2€ sin(A)-y = 0, (4.1.13) 
and 2 F 
Yy g ; T 
Y, NET T nfi 4.1.14 
"ORA, ML rr ( ) 
The approximate solution to these coupled differential equations is 
a(t) & Ag cos[Q sin(A)t] sin( g/L t) (4.1.15) 
and 
y(t) z Ag sin{Q sin(A)t] sin( g/L t) (4.1.16) 


? For the derivation, see Marion, J. B., 1965: Classical Dynamics of Particles and Systems. Academic 
Press, Sections 12.2-12.3. 
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Figure 4.1.1: For a two-dimensional field T(x, y), the gradient is a vector that is perpendicular to the 
isotherms T(x, y) = constant and points in the direction of most rapidly increasing temperatures. 


if Q? < g/L. Thus, we have a pendulum that swings with an angular frequency \/g/L. 
However, depending upon the latitude A, the direction in which the pendulum swings 
changes counterclockwise with time, completing a full cycle in 27/[Qsin(A)]. This result is 
most clearly seen when A = 7/2 and we are at the North Pole. There the earth is turning 
underneath the pendulum. If initially we set the pendulum swinging along the 0? longitude, 
the pendulum will shift with time to longitudes east of the Greenwich median. Eventually, 
after 24 hours, the process repeats itself. 


Consider now vector-valued functions that vary with several variables. A vector func- 
tion of position assigns a vector value for every value of x, y, and z within some domain. 
Examples include the velocity field of a fluid at a given instant: 


v = u(z, y, z)i + v(z,y, 2) + w(z, y, 2k. (4.1.17) 


Another example arises in electromagnetism where electric and magnetic fields often vary 
as a function of the space coordinates. For us, however, probably the most useful example 
involves the vector differential operator, del or nabla, 


V= E + 5j + E (4.1.18) 
which we apply to the multivariable differentiable scalar function F(x,y,z) to give the 
gradient VF. 

An important geometric interpretation of the gradient-one which we shall use frequent- 
ly-is the fact that V f is perpendicular (normal) to the level surface at a given point P. To 
prove this, let the equation F(r,y,z) = c describe a three-dimensional surface. If the 
differentiable functions x — f(t), y — g(t), and z — h(t) are the parametric equations of a 
curve on the surface, then the derivative of F[f(t), g(t), h(t)] = c is 


DH ORO SIC e d (4.1.19) 
Ox dt Oy dt Ozdt ^" i 


Or 
VF.r' =0. (4.1.20) 


When r’ # 0, the vector VF is orthogonal to the tangent vector. Because our argument 
holds for any differentiable curve that passes through the arbitrary point (x, y, z), then VF 
is normal to the level surface at that point. 
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Figure 4.1.1 gives a common application of the gradient. Consider a two-dimensional 
temperature field T(z, y). The level curves T(x, y) = constant are lines that connect points 
where the temperature is the same (isotherms). The gradient in this case VT' is a vector that 
is perpendicular or normal to these isotherms and points in the direction of most rapidly 
increasing temperature. 


e Example 4.1.2 


Let us find the gradient of the function f(x,y,z) = z?2? sin(4y). 
Using the definition of gradient, 
Vf = O[a?z? sin(4y)] i 8[z?2? sin(4y)] " Ox? z? sin(4y)] 
Ox Oy Oz 
= 2x2? sin(4y)i + 4x?2? cos(4y)j + 2x°z sin(4y)k. (4.1.22) 


k (4.1.21) 


e Example 4.1.3 


Let us find the unit normal to the unit sphere at any arbitrary point (2, y, z). 
The surface of a unit sphere is defined by the equation f(x,y,z) = z? +y? +2? — 1. 
Therefore, the normal is given by the gradient 


N = Vf = 2xi+ 2yj + 2zk, (4.1.23) 


and the unit normal 
V. 2xi + 2yj + 2zk 
E NUR xa, DLL ME ER TS (4.1.24) 


—dVf| 4/4? + Ay? + Az? 
y 


because x? + y? +27 — 1. 


e Example 4.1.4 


In Figure 4.1.2, MATLAB has been used to illustrate the unit normal of the surface 
z—4-—a?-—3?^. Here f(x,y,z) = z +x? +y? = 4 so that Vf = 2ri + 2yj +k. The 
corresponding script is: 
clear ^ clear variables 
clf ^ clear figures 
[x,y] = meshgrid(-2:0.5:2); % create the grid 
z=4-x.°2 - y.^2; % compute surface within domain 
^ compute the gradient of f(x,y,z) = Zz + x^°2 + y°2=4 
% the x, y, and z components are u, v, and w 
u = 2*x; v = 2*y; w= 1; 
^ find magnitude of gradient at each point 
magnitude = sqrt(u.*u + v.*v + w.*w); 
^ compute unit gradient vector 
u = u./magnitude; v = v./magnitude; w = w./magnitude; 
mesh(x,y,z) % plot the surface 
axis square 
xlabel(’x’); ylabel(’y’) 
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Figure 4.1.2: MATLAB plot of the function z = 4 — z? — y?. The arrows give the unit normal to this 
surface. 


hold on 
^ plot the unit gradient vector 
quiver3(x,y,z,u,v,w,0) 

This figure clearly shows that gradient gives a vector which is perpendicular to the 
surface. 


A popular method for visualizing a vector field F is to draw space curves that are 
tangent to the vector field at each x, y, z. In fluid mechanics these lines are called streamlines 
while in physics they are generally called lines of force or flux lines for an electric, magnetic, 
or gravitational field. For a fluid with a velocity field that does not vary with time, the 
streamlines give the paths along which small parcels of the fluid move. 

To find the streamlines of a given vector field F with components P(x,y, z), Q(x, y, z), 
and R(r,y,z), we assume that we can parameterize the streamlines in the form r(t) — 
x(t)i + y(t)j + z(t)k. Then the tangent line is r'(t) = z'(t)i + y'(t)j + z'(t)k. Because the 
streamline must be parallel to the vector field at any t, r'(t) = AF, or 


dx dy _ dz — 
dt = AP(z,y, z), dt = AQ(z, y, 2), and dt EE AR(x, y, z), (4.1.25) 
E d. d d 
a ANE E (4.1.26) 


P(z,y,z) Q@,y,z) R(a,y,2)' 


The solution of this system of differential equations yields the streamlines. 
e Example 4.1.5 


Let us find the streamlines for the vector field F = sec(r)i — cot(y)j + k that passes 
through the point (7/4, 7,1). In this particular example, F represents a measured or com- 
puted fluid’s velocity at a particular instant. 

From Equation 4.1.26, 


= =, (4.1.27) 
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'This yields two differential equations: 


cos(x) dz = -2o dy, and dz = -2 d (4.1.28) 
Integrating these equations gives 
sin(x) = ln | cos(y)| + cı, and z=In|cos(y)| + c2. (4.1.29) 
Substituting for the given point, we finally have that 
sin(x) = In|cos(y)| + V2/2, and z=In|cos(y)| +1. (4.1.30) 


e Example 4.1.6 


Let us find the streamlines for the vector field F = sin(z)j + e"k that passes through 
the point (2,0,0). 
From Equation 4.1.26, 


0 sin(z) e St 
This yields two differential equations: 
dz = 0, and sin(z) dz = e? dy. (4.1.32) 
Integrating these equations gives 
L=Ci, and e” = — cos(z) + c2. (4.1.33) 
Substituting for the given point, we finally have that 
eS 2, and e" = 2 — cos(z). (4.1.34) 


Note that Equation 4.1.34 only applies for a certain strip in the yz-plane. 
Problems 

Given the following vectors a and b, verify that a- (a x b) = 0, and b- (a x b) = 0: 

l.a=4i-—2j+5k, b=3i+j—k 2,.a=i-3j+k, b=2i+4k 

3.a=i+jt+k, b= —5i + 2j+3k 4. a= 8i +j- 6k, b=i-—2j+ 10k 

5. a= 2i+7j— 4k, b=i+j-k. 

6. Prove a x (b x c) = (a: c)b — (a: b)c. 


7. Prove a x (bx c) + b x (cx a) - c x (ax b) = 0. 
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Find the gradient of the following functions: 

8. f(x,y,z) = zy? /z23 9. f(x,y,z) = xy cos(yz) 10. f(z, y, z) = In(z? + y? + 27) 
11. f(x,y,z) = z?y? (2z + 1)? 12. f(x,y,z) 222 — y? + 2. 

Use MATLAB to illustrate the following surfaces as well as the unit normal. 

13.2—3 14. 22 --y? —4 15. 2 = 22 +y? 16. 2 = /2? +y? 

I fee 18.¢+y+z=1 19. z m, 

Find the streamlines for the following vector fields that pass through the specified point: 
20. F=i+j+k; (0,1,1) 21. F = 2i — y?j + zk; (1,1,1) 

22. F = 327i — y?j + z?k; (2,1,3) 23. F = z?i y?j — zk; (1,1,1) 

24. F = (1/x)i + e”j — k; (2,0,4). 


25. Solve the differential equations, Equation 4.1.13 and Equation 4.1.14 with the initial 
conditions z(0) = y(0) = y'(0) = 0, and z'(0) = Ao4/g/L assuming that 2? « g/L. 


26. If a fluid is bounded by a fixed surface f(x,y,z) = c, show that the fluid must satisfy 
the boundary condition v - V f = 0, where v is the velocity of the fluid. 


27. A sphere of radius a is moving in a fluid with the constant velocity u. Show that the 
fluid satisfies the boundary condition (v — u) - (r — ut) = 0 at the surface of the sphere, if 
the center of the sphere coincides with the origin at t = 0 and v denotes the velocity of the 
fluid. 


4.2 DIVERGENCE AND CURL 


Consider a vector field v defined in some region of three-dimensional space. The func- 
tion v(r) can be resolved into components along the i, j, and k directions, or 


v(r) = u(z, y, zu + v(z, y, z)) + w(z, y, z)k. (4.2.1) 


If v is a fluid’s velocity field, then we can compute the flow rate through a small (differential) 
rectangular box defined by increments (Az, Ay, Az) centered at the point (x,y,z). See 
Figure 4.2.1. The flow out from the box through the face with the outwardly pointing 
normal n — —j is 

v: (—j) = —v(z;y — A^y/2, z) Ax Az, (4.2.2) 


and the flow through the face with the outwardly pointing normal n — j is 
v-j=v(a,y + Ay/2, z) AxAz. (4.2.3) 
The net flow through the two faces is 


[v(z, y + Ay/2, z) — v(x, y — Ay/2, z)) Ax Az © vy (m, y, z) Ax Ay Az. (4.2.4) 
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uta ax 


Figure 4.2.1: Divergence of a vector function v(x, y, z). 


A similar analysis of the other faces and combination of the results give the approximate 
total flow from the box as 


[us (a, y, 2) + vun y, 2) + wa (x,y, z)] AAA. (4.2.5) 


Dividing by the volume AzAyAz and taking the limit as the dimensions of the box tend 
to zero yield uz + v, + wz as the flow out from (x,y,z) per unit volume per unit time. This 
scalar quantity is called the divergence of the vector v: 


div(v) =V-v= (Fi + = + ox) : (ui + vj + wk) = uz + vy + wz. (4.2.6) 
Thus, if the divergence is positive, either the fluid is expanding and its density at the point 
is falling with time, or the point is a source at which fluid is entering the field. When the 
divergence is negative, either the fluid is contracting and its density is rising at the point, 
or the point is a negative source or sink at which fluid is leaving the field. 

If the divergence of a vector field is zero everywhere within a domain, then the flux 
entering any element of space exactly balances that leaving it and the vector field is called 
nondivergent or solenoidal (from a Greek word meaning a tube). For a fluid, if there are no 
sources or sinks, then its density cannot change. 

Some useful properties of the divergence operator are 


V-(F+G)=V-F+V-G, (4.2.7) 
V- (pF) =9V-F+F-Vo (4.2.8) 

and 
V79p=V-Ve= Gar + Quy + zz- (4.2.9) 


Equation 4.2.9 is very important in physics and is given the special name of the Laplacian.? 


e Example 4.2.1 


If F = z?zi— 2y?z?j + zy?zk, compute the divergence of F. 


p ĉe, 2 3,2), 9 (2, _ 62.2 2 
V-F= az?) 4 ay | 2y2^) + az (ay z) = Qaz — 6y?2* + ay’. (4.2.10) 


3 Some mathematicians write A instead of V?. 
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— 4 v 
a+ El A 


——- 
no divergence divergence no divergence divergence 
no curl no curl curl and curl 


Figure 4.2.2: Examples of vector fields with and without divergence and curl. 


e Example 4.2.2 


If r = zi + yj + zk, show that r/|r|? is nondivergent. 


v.(.E \= o x 2 o y 
Ir3/ Ox | (a2 +y? + 22)38/2| © Oy | (x? + y? + 22)272 
ju f Z | (4.2.11) 


Oz x? + y? + 22)3/2 


- : Les chu E 4.2.12 
= (£2 + y? + 22)3/2 (x2 + y2 + z2)5/2 =U. (4.2.12) 


Another important vector function involving the vector field v is the curl of v, written 
curl(v) or rot(v) in some older textbooks. In fluid flow problems it is proportional to the 
instantaneous angular velocity of a fluid element. In rectangular coordinates, 


curl(v) = V x v = (wy — v;)i + (uz — We)j + (vs — uy)k, (4.2.13) 


where v = ui+ vj + wk as before. However, it is best remembered in the mnemonic form: 


ijk 
VxF=|2 E Z| = (wy — vz)it (uz — w;)j + (v; — uy)k. (4.2.14) 
u v w 


If the curl of a vector field is zero everywhere within a region, then the field is irrotational. 

Figure 4.2.2 illustrates graphically some vector fields that do and do not possess diver- 
gence and curl. Let the vectors that are illustrated represent the motion of fluid particles. 
In the case of divergence only, fluid is streaming from the point, at which the density is 
falling. Alternatively the point could be a source. In the case where there is only curl, 
the fluid rotates about the point and the fluid is incompressible. Finally, the point that 
possesses both divergence and curl is a compressible fluid with rotation. 

Some useful computational formulas exist for both the divergence and curl operations: 


Vx(F+G)=VxF+VxG, (4.2.15) 


V x Vo — 0, (4.2.16) 
V-VxF=0, (4.2.17) 
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V x (pF) = yV x F+V¢ xF, 4.2.18) 
V(F-G)=(F-V)G+(G-V)F+Fx (V x G)+Gx (Vx F), 4.2.19) 
V x (Fx G) = (G - V)F — (F - V)G + F(V - G) - G(V - F), 4.2.20) 
Vx(VxF)-2V(V-.F)-(V-V)E, 4.2.21) 

nd 
i V.(FxG)-G.VxF-F.VxG. 4.2.22) 


In this book the operation VF is undefined. 
e Example 4.2.3 


If F = z2?i — 22?yzj + 2yz4k, compute the curl of F and verify that V. V x F = 0. 
From the definition of curl, 


i j k 
VxF=|2 5 2 (4.2.23) 
zz? —2az?yz 2yz4 
= [$ Cu) - & C222) i- D Qu) - £ (e2) 
+ E (—22?y2) - & («29)] k (4.2.24) 
= (22* + 2a7y)i — (0 — 3z2°)j + (—4xyz — 0)k (4.2.25) 
= (22° + 2zx^y)i + 322^j — Axyzk. 4.2.26 
22^ + 23?y)i + 3222j — 4ryzk 


From the definition of divergence and Equation 4.2.26, 


ð ð ð 
V.VxF- a, Q7 + 2z°y) + 9, 927) Fa 42y) = 4ry +0— doy =0. (4.2.27) 


e Example 4.2.4: Potential flow theory 


One of the topics in most elementary fluid mechanics courses is the study of irrotational 
and nondivergent fluid flows. Because the fluid is irrotational, the velocity vector field v 
satisfies V x v = 0. From Equation 4.2.16 we can introduce a potential y such that 
v = Vy. Because the flow field is nondivergent, V. v = V2y = 0. Thus, the fluid flow 
can be completely described in terms of solutions to Laplace’s equation. This area of fluid 
mechanics is called potential flow theory. 


Problems 
Compute V-F,V x E, V. (V x F), and V(V - F), for the following vector fields: 


1. F = z?zi + yz?j + xy?k 2. F = Az?y?i + (2x + 2yz)j + (3z + y?)k 


3. F = (x — y)’ i + e 7j + zze?"k 4. F = 3zyi + 2x2?j + Pk 


5. F = 5yzi + z?zj + 32°k 6. F = y?i + (z?y? — zy)j — (x?yz — xzz)k 
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7. F = ze i yz?j 3e ^k 8. F = yIn(z)i + (2 — 3yz)j + zyz?k 
9. F = zyzi-ra?yze^j + zye^k 10. F = (zy? — z*)i + 4aty?zj — y*z?k 
11. F = zy?i4- zyz?j + zy cos(z)k 12. F = zy?i + zyz?j + zy sin(z)k 


13. F = xy?i + xyzj + zy cos(z)k 
14. (a) Assuming continuity of all partial derivatives, show that 
V x(VxF)2 V(V.F)- V?F. 
(b) Using F = 3xyi + 4yzj + 2zk, verify the results in part (a). 


15. If E = E(z, y, z,t) and B = B(z, y, z,t) represent the electric and magnetic fields in a 
vacuum, Maxwell’s field equations are: 


1 
V-E=0, Vemma” 
c Ot 
10E 
-B= VxB=-— 
V 0, x cr 


where c is the speed of light. Using the results from Problem 14, show that E and B satisfy 


1 E 1 B 
VIE- VB = 
im c2 0t" ane BS c2 Ot” 


16. If f and g are continuously differentiable scalar fields, show that V f x Vg is solenoidal. 
Hint: Show that Vf x Vg = V x (fVg). 


17. An inviscid (frictionless) fluid in equilibrium obeys the relationship Vp = pF, where 
p denotes the density of the fluid, p denotes the pressure, and F denotes the body forces 
(such as gravity). Show that F.V x F = 0. 


4.3 LINE INTEGRALS 


Line integrals are ubiquitous in physics. In mechanics they are used to compute work. 
In electricity and magnetism, they provide simple methods for computing the electric and 
magnetic fields for simple geometries. 

The line integral most frequently encountered is an oriented one in which the path C 
is directed and the integrand is the dot product between the vector function F(r) and the 
tangent of the path dr. It is usually written in the economical form 


[Pam | Pe ane Qi y dy + Roa 2) dz, (43.1) 
C C 


where F = P(x,y, z)i+ Q(z, y, z)) + R(x, y, z)k. If the starting and terminal points are the 
same so that the contour is closed, then this closed contour integral will be denoted by fo- 
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oo 


Figure 4.3.1: Diagram for the line integration in Example 4.3.1. 


In the following examples we show how to evaluate the line integrals along various types of 
curves. 


e Example 4.3.1 


If F = (3x? + 6y)i — 14yzj + 20z?k, let us evaluate the line integral Jo F - dr along 
the parametric curves x(t) = t, y(t) = t?, and z(t) = t? from the point (0,0,0) to (1,1,1). 
Using the MATLAB commands 
>> clear 
>> t = 0:0.02:1 
>> stem3(t,t.^2,t.^3); xlabel(’x’,’Fontsize’ ,20); 

ylabel(’y’,’Fontsize’,20); zlabel(’z’,’Fontsize’ ,20); 
we illustrate these parametric curves in Figure 4.3.1. 

We begin by finding the values of t, which give the corresponding endpoints. A quick 
check shows that t = 0 gives (0,0,0) while ¢ = 1 yields (1,1,1). It should be noted that 
the same value of t must give the correct coordinates in each direction. Failure to do so 
suggests an error in the parameterization. Therefore, 


f F- dr = n + 6t?) dt — 14t? (t°) d(t?) + 20t(9)?d(t3) (4.3.2) 
C 0 


1 
= n 9t? dt — 284° dt + 60t? dt = (306 — 4t + 669) |) = 5. (4.3.3) 
0 


e Example 4.3.2 


Let us redo the previous example with a contour that consists of three “dog legs,” 
namely straight lines from (0,0,0) to (1,0,0), from (1,0,0) to (1, 1,0), and from (1, 1,0) to 
(1, 1, 1). See Figure 4.3.2. 

In this particular problem we break the integration down into integrals along each of 


the legs: 
[ra Par. Fe. | F - dr. (4.3.4) 
C Ci C2 C3 
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Figure 4.3.2: Diagram for the line integration in Example 4.3.2. 


For Cj, y = z = dy = dz = 0, and 
1 1 
i Pedr = | (32? +6-0)dr—14-0-0-0+202-07-0= f 3z?dr —1. (4.3.5) 
C; 0 0 
For C5, x = 1 and z = dx = dz = 0, so that 
I 
| F.dr- f (3-1? + 6y)-0~1dy-0-dy +20-1-0?-0=0. (4.3.6) 
C2 0 
For C3, x = y = 1 and dx = dy = 0, so that 
1 1 
/ F-dr= f (3-1+6-1)-0-14-1-2-0420-1-2de= | 2027 dz = Æ. (4.3.7) 
C3 0 0 


Therefore, 
n dr = 23, (4.3.8) 


e Example 4.3.3 


For our third calculation, we redo the first example where the contour is a straight line. 
The parameterization in this case is x = y = z = t with 0 < t € 1. See Figure 4.3.3. Then, 


f F- dr = n + 6t) dt — 14(t) (t) dt + 20t(t)? dt (4.3.9) 
C 0 


1 
= I (3t? + 6t — 14? + 208%) dt = 2. (4.3.10) 
0 


An interesting aspect of these three examples is that, although we used a common 
vector field and moved from (0, 0,0) to (1,1, 1) in each case, we obtained a different answer 
in each case. Thus, for this vector field, the line integral is path dependent. This is generally 
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Figure 4.3.3: Diagram for the line integration in Example 4.3.3. 


true. In the next section we will meet conservative vector fields where the results will be 
path independent. 


e Example 4.3.4 
If F = (x? + y?)i — 2ayj + zk, let us evaluate fo F -dr if the contour is that portion of 
the circle z? + y? = a? from the point (a,0,3) to (—a,0,3). See Figure 4.3.4. 


The parametric equations for this example are x = acos(0), dx = —a sin(0) d0, y = 
asin(0), dy = acos(0) d0, z = 3, and dz = 0 with 0 < 0 € m. Therefore, 


f Paka | "e? cos? (8) + a? sin?(0)]|-a sin(6) d£] 
C 0 


— 2a? cos(0) sin(0)[a cos(0) d6] + a cos(0) - 0 (4.3.11) 
= -a f sin(0) dO — 2a f cos? (0) sin(0) d0 (4.3.12) 
0 0 
= a? cos(0)|; + 2a? cos*(6)|> = —2a? — $a? = — 3Qa?. (4.3.13) 


e Example 4.3.5: Circulation 


Let v(x,y,z) denote the velocity at the point (x,y,z) in a moving fluid. If it varies 
with time, this is the velocity at a particular instant of time. The integral fc v: dr around a 
closed path C is called the circulation around that path. The average component of velocity 
along the path is 


"E $o v. ds u fov: dr 


Ds (4.3.14) 
S S 


where s is the total length of the path. The circulation is thus fo v -dr = Uss, the product 
of the length of the path and the average velocity along the path. When the circulation is 
positive, the flow is more in the direction of integration than opposite to it. Circulation is 
thus an indication and to some extent a measure of motion around the path. 
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X 


Figure 4.3.4: Diagram for the line integration in Example 4.3.4. 
Problems 
Evaluate f. c E: dr for the following vector fields and curves: 


1. F = ysin(nz)i + z?e"j + 3vzk and C is the curve x = t, y = t?, and z = t? from (0,0,0) 
to (1,1, 1). Use MATLAB to illustrate the parametric curves. 


2. F = yi + zj + zk and C consists of the line segments from (0,0,0) to (2,3,0), and from 
(2, 3,0) to (2,3, 4). Use MATLAB to illustrate the parametric curves. 


3. F = e*i + ze*Vj + zye*"*k and C is the curve z = t, y = t?, and z = t? with 0 < t < 2. 
Use MATLAB to illustrate the parametric curves. 


4. F = yzi + zzj + zyk and C is the curve z = t, y = t?, and z = t with 1 < t < 2. Use 
MATLAB to illustrate the parametric curves. 


5. F = yi — zj + 3ryk and C consists of the semicircle x? + y? = 4, z = 0, y > 0, and the 
line segment from (—2,0,0) to (2,0,0). Use MATLAB to illustrate the parametric curves. 


6. F = (x + 2y)i + (6y — 2x)j and C consists of the sides of the triangle with vertices 
at (0,0,0), (1,1, 1), and (1, 1,0). Proceed from (0,0,0) to (1, 1, 1) to (1, 1,0) and back to 
(0,0,0). Use MATLAB to illustrate the parametric curves. 


7. F = 2xzi+4y?j+27k and C is taken counterclockwise around the ellipse z?/44-y?/9 = 1, 
z= 1. Use MATLAB to illustrate the parametric curves. 


8. F = 2ri + yj + zk and C is the contour x = t, y = sin(t), and z = cos(t) + sin(t) with 
0 € t € 2x. Use MATLAB to illustrate the parametric curves. 


9. F = (2y? + z)i + 4xyj + xk and C is the spiral x = cos(t), y = sin(t), and z = t 
with 0 € t € 2r between the points (1,0,0) and (1,0,27). Use MATLAB to illustrate the 
parametric curves. 


10. F = x7i+ y?j + (2? + 2zy)k and C consists of the edges of the triangle with vertices 
t (0,0,0), (1, 1,0), and (0,1,0). Proceed from (0,0,0) to (1, 1,0) to (0, 1,0) and back to 
(0,0,0). Use MATLAB to illustrate the parametric curves. 
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4.4 THE POTENTIAL FUNCTION 


In Section 4.2 we showed that the curl operation applied to a gradient produces the 
zero vector: V x Vy = 0. Consequently, if we have a vector field F such that V x F = 0 
everywhere, then that vector field is called a conservative field and we can compute a 
potential y such that F = Vy. 


e Example 4.4.1 


Let us show that the vector field F = ye?" cos(z)i + xe*" cos(z)j — e*" sin(z)k is con- 
servative and then find the corresponding potential function. 
To show that the field is conservative, we compute the curl of F or 


i j k 
a a a 
VxF= En Em Dz = 0. (4.4.1) 
ye?” cos(z) xe""cos(z) —e*Y sin(z) 


To find the potential we must solve three partial differential equations: 


Px = ye"? cos(z) = F - i, (4.4.2) 

Py = ze^? cos(z) = F -j, (4.4.3) 
and 

pz = —e"?sin(z) = F-k. (4.4.4) 


We begin by integrating any one of these three equations. Choosing Equation 4.4.2, 
p(x, y, z) = e"? cos(z) + fly, z). (4.4.5) 
To find f(y, z) we differentiate Equation 4.4.5 with respect to y and find that 
Py = 1x6" cos(z) + f(y, 2) = xe" cos(z) (4.4.6) 
from Equation 4.4.3. Thus, f, = 0 and f(y, z) can only be a function of z, say g(z). Then, 
p(x, y, z) = e™ cos(z) + g(z). (4.4.7) 


Finally, 
p: = —e"" sin(z) + g'(z) = —e*" sin(z) (4.4.8) 


from Equation 4.4.4 and g'(z) = 0. Therefore, the potential is 


p(x, y, z) = e7” cos(z) + constant. (4.4.9) 


Potentials can be very useful in computing line integrals, because 


] Fo | eode etae aso | dp = (8) - e) (4.4.10) 
C C C 
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where the point B is the terminal point of the integration while the point A is the starting 
point. T'hus, any path integration between any two points is path independent. 
Finally, if we close the path so that A and B coincide, then 


$ F-dr=0. (4.4.11) 
C 


It should be noted that the converse is not true. Just because fo E -dr = 0, we do not 
necessarily have a conservative field F. 

In summary then, an irrotational vector in a given region has three fundamental prop- 
erties: (1) its integral around every simply connected circuit is zero, (2) its curl equals zero, 
and (3) it is the gradient of a scalar function. For continuously differentiable vectors, these 
properties are equivalent. For vectors that are only piece-wise differentiable, this is not 
true. Generally the first property is the most fundamental and is taken as the definition of 
irrotationality. 


e Example 4.4.2 
Using the potential found in Example 4.4.1, let us find the value of the line integral 


Jo E- dr from the point (0,0,0) to (—1,2, 7). 
From Equation 4.4.9, 


(—1,2,7) 
f F - dr = [e™” cos(z) + constant] =-1- e°. (4.4.12) 
c (0,0,0) 


Problems 


Verify that the following vector fields are conservative and then find the corresponding 
potential: 


1.F = 2zyi + (x? + 2yz)j+ (y? + 4)k 2.F = (2x + 2ze?*)i + (2y — 1)j + e?7k 


3.F = yzi + xzzj + xyk 4.F = 2zi + 3y?j + 42°k 


5.F = [2x sin(y) + e??]i + x? cos(y)j + (31€?? +4)k 6.F = (2x + 5)i + 3y?j + (1/z)k 
7.F = e”i + 3y?j + 2re?”k 8.F = yi + (x + z)j + yk 

9.F = (z + y)i + zj + zk. 

4.5 SURFACE INTEGRALS 


Surface integrals appear in such diverse fields as electromagnetism and fluid mechanics. 
For example, if we were oceanographers we might be interested in the rate of volume of 
seawater through an instrument that has the curved surface S. The volume rate equals 
[fs v : ndo, where v is the velocity and ndo is an infinitesimally small element on the 
surface of the instrument. The surface element n do must have an orientation (given by n) 
because it makes a considerable difference whether the flow is directly through the surface 
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Figure 4.5.1: Diagram for the surface integration in Example 4.5.1. 


or at right angles. In the special case when the surface encloses a three-dimensional volume, 
then we have a closed surface integral. 

To illustrate the concept of computing a surface integral, we will do three examples 
with simple geometries. Later we will show how to use surface coordinates to do more 
complicated geometries. 


e Example 4.5.1 


Let us find the flux out the top of a unit cube if the vector field is F = zi + yj + zk. 
See Figure 4.5.1. 

The top of a unit cube consists of the surface z = 1 with 0 € x < 1 and 0 € y € 1. By 
inspection the unit normal to this surface is n = k, or n = —k. Because we are interested 
in the flux out of the unit cube, n — k, and 


1opl 
J| 5:22 - | | (xi + yj + k): kdzdy — 1, (4.5.1) 
S o Jo 


because z — 1. 


e Example 4.5.2 


Let us find the flux out of that portion of the cylinder y? + z? = 4 in the first octant 
bounded by z = 0, x = 3, y = 0, and z = 0. The vector field is F = zi + 27j + yk. See 
Figure 4.5.2. 

Because we are dealing with a cylinder, cylindrical coordinates are appropriate. Let 
y = 2cos(0), z = 2sin(0), and x = x with 0 < 0 € 7/2. To find n, we use the gradient in 
conjunction with the definition of the surface of the cylinder f(x,y,z) = y? 4- 2? = 4. Then, 


2yj + 22k 
YN BIA ee (4.5.2) 


(VEL \/4y? + Az? E 9 


because y? + 2? = 4 along the surface. Since we want the flux out of the surface, then 
n = yj/2 + zk/2, whereas the flux into the surface would require n = —yj/2 — zk/2. 
Therefore, 


n 


3 
F- n = (zi + 2zj + yk) - (5 + Zk) - T = 6 cos(0) sin(0). (4.5.3) 
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Figure 4.5.2: Diagram for the surface integration in Example 4.5.2. 


What is do? Our infinitesimal surface area has a side in the x direction of length dx and 
a side in the 0 direction of length 2 d0 because the radius equals 2. Therefore, do = 2 dx dé. 
Bringing all of these elements together, 


3 pr/2 3 — 3 
J| 5:22 f n I2cos(0) sin(£) d9 dz = 6 | |sim*(6)|; | a6 | dx = 18. 
S 0 0 0 0 
(4.5.4) 


As counterpoint to this example, let us find the flux out of the pie-shaped surface at 
x = 3. In this case, y = rcos(0), z = rsin(0), and 


7/2 p2 T/2 p2 
J| F2 - | n [3i + 2r sin(0)j + r cos(0)k] -irdrad — 3 | | r dr d0 = 3r. 
S 0 0 0 0 
(4.5.5) 


m 


e Example 4.5.3 


Let us find the flux of the vector field F = y?i + z?j + 5zk out of the hemispheric 
surface x? + y? + 2? = a?, z > 0. See Figure 4.5.3. 

We begin by finding the outwardly pointing normal. Because the surface is defined by 
f(z,y,z) 9a? +y? +27 =e’, 


2xi + 2yj + 2zk 
Vf _ 2xi+ 2yj + 2z ae LEN (4.5.6) 
a 


IVf] 4r? +4y2 +422 a a 
because x? + y? +27 = a?. This is also the outwardly pointing normal since n = r/a, where 
r is the radial vector. 

Using spherical coordinates, x = acos(y) sin(@), y = asin(q)sin(0), and z = acos(0), 
where y is the angle made by the projection of the point onto the equatorial plane, measured 
from the x-axis, and 0 is the colatitude or “cone angle" measured from the z-axis. To 
compute do, the infinitesimal length in the 0 direction is a d while in the q direction it 
is a sin(@) dy, where the sin(0) factor takes into account the convergence of the meridians. 
Therefore, do = a? sin(0) d0 dy, and 


m eee C eee 
F -ndo = (yi + z?j + 5zk) (“i+ =j+ =k) a^ sin(0) d0 dip (4.5.7) 
s o Jo a qa a 


n 
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Figure 4.5.3: Diagram for the surface integration in Example 4.5.3. 


2r pw /2 2 2 2 
J| 3 n= f fi (= pu E )e^sin6) dd do (4.5.8) 
S 0 0 a a a 


-m/2 pn 
= if ‘| [a* cos(y) sin? (p) sint (0) 
+ a* cos? (p) sin(y) sin*(8) + 5a? cos? (0) sin(@)| d dO — (4.5.9) 


n/2 a^ 2n at Qn 
= J B sin?(o)| sin'(8) — — cos?(u)  sin^(0) 
o 13 0 3 0 
+ 5a? cos? (8) sino | dé (4.5.10) 
n/2 1 3 7/2 1 3 
= 107a? f cos? (0) sin(0) d0 = — an cos? (8) = an : (4.5.11) 
0 0 


Although these techniques apply for simple geometries such as a cylinder or sphere, we 
would like a general method for treating any arbitrary surface. We begin by noting that 
a surface is an aggregate of points whose coordinates are functions of two variables. For 
example, in the previous example, the surface was described by the coordinates y and 60. 
Let us denote these surface coordinates in general by u and v. Consequently, on any surface 
we can reexpress x, y, and z in terms of u and v: x = z(u,v), y = y(u,v), and z = z(u,v). 

Next, we must find an infinitesimal element of area. The position vector to the surface 
is r = z(u, vi + y(u,v)j + z(u, v)k. Therefore, the tangent vectors along v = constant, ry, 
and along u = constant, ry, equal 


ry = Zul + Yuj + zuk, (4.5.12) 


and 


ry = Tyi + yuj + 2k. (4.5.13) 


Consequently, the sides of the infinitesimal area are r, du and r, dv. Therefore, the vectorial 
area of the parallelogram that these vectors form is 


ndo = r, X ry du dv (4.5.14) 
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* (2,00) 


Figure 4.5.4: Diagram for the surface integration in Example 4.5.4. 


and is called the vector element of area on the surface. Thus, we may convert F - n do into 
an expression involving only u and v and then evaluate the surface integral by integrating 
over the appropriate domain in the uv-plane. Of course, we are in trouble if r, x r, = 0. 
Therefore, we only treat regular points where r, x r, Z 0. In the next few examples, we 
show how to use these surface coordinates to evaluate surface integrals. 


e Example 4.5.4 
Let us find the flux of the vector field F = xi + yj + zk through the top of the plane 


3x + 2y + z = 6, which lies in the first octant. See Figure 4.5.4. 
Our parametric equations are z = u, y = v, and z = 6 — 3u — 2v. Therefore, 


r = ui + vj (6 — 3u — 2v)k, (4.5.15) 
so that 
ry, =i— 3k, ry, =j-— 2k, (4.5.16) 
and 
ru X ry = 314+ 2j +k. (4.5.17) 


Bring all of these elements together, 
2 p3—3u/2 2 p3—3u/2 
J| 5:22 - | | (3u +20 +6 — 3u — 20) dudu — 6 | | dv du (4.5.18) 
S o Jo 0 Jo 
2 
- 6 | (3 — 3u/2) du = 6 (3u — łu?) |; = 18. (4.5.19) 
0 


To set up the limits of integration, we note that the area in u,v space corresponds to the 
rzy-plane. On the zy-plane, z = 0 and 3u + 2v = 6, along with boundaries u = v = 0. 


e Example 4.5.5 


Let us find the flux of the vector field F = xi + yj + zk through the top of the surface 
z = ry + 1, which covers the square 0 < x < 1,0 < y € 1 in the xy-plane. See Figure 4.5.5. 

Our parametric equations are x = u, y = v, and z = w + 1 with 0 € u < 1 and 
0x v1. Therefore, 


r= ui vj t (uv + 1)k, (4.5.20) 
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Figure 4.5.5: Diagram for the surface integration in Example 4.5.5. 
so that 
ru —i-d vk, ry =j + uk, (4.5.21) 


and 
ry X Ty = —vi — uj + k. (4.5.22) 


Bring all of these elements together, 


J| E» E jJ ende (is » det nd Codd (4.5.23) 
-f f (1 — uv) du dv -f (u — 1u*v) |, dv (4.5.24) 
2 [ (1— àv) dv = (v - 4v?) | = 3. (4.5.25) 


e Example 4.5.6 


Let us find the flux of the vector field F = Azzi + zyz?j + 3zk through the exterior 
surface of the cone z? = x? + y? above the zy-plane and below z = 4. See Figure 4.5.6. 
A natural choice for the surface coordinates is polar coordinates r and 0. Because 
x —rcos(0) and y = rsin(0), z =r. Then, 
r = r cos(0)i + r sin(0)j + rk (4.5.26) 
with 0 € r < 4 and 0 € 0 < 27 so that 
r, = cos(0)i + sin(0)j + kro = —r sin(0)i + r cos(0)j, (4.5.27) 


and 


r, X rg = —rcos(0)i — rsin(0)j + rk. (4.5.28) 


168 


Figure 4.5.6 
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: Diagram for the surface integration in Example 4.5.6. 


'This is the unit area inside the cone. Because we want the exterior surface, we must take 
the negative of Equation 4.5.28. Bring all of these elements together, 


ui F -ndo = i [tur cos(6)|r[r cos(8)] 


+ [r? sin(0) cos(0)]r?[r sin(8)] — 3r?) d0 dr (4.5.29) 


4 
= | far [0 + i sin(20)] | tri sin? (9)|^" — wei dr (4.5.30) 
0 


4 
= f (4nr? — 6rr?) dr = (nr* — 227?) I; = 128m. (4.5.31) 


Problems 


Compute the surface integral ff; F ndo for the following vector fields and surfaces: 


1. F = zi— 
0 € y € 1. 
2.F — zi- 
zd 


zj + yk and the surface is the top side of the z = 1 plane where 0 € x € 1 and 


yj + zzk and the surface is the top side of the cylinder z? + y? = 9, z = 0, and 


3. F = xyi + zj + zk and the surface consists of both exterior ends of the cylinder defined 
by x? +y? = 4, z = 0, and z = 2. 


4. F=rxri+ 


zj + yk and the surface is the lateral and exterior sides of the cylinder defined 


by a? ey! =4, z = —3, and z =3. 


5. F = zyi-- 2?j + yk and the surface is the curved exterior side of the cylinder y? + 2? = 9 
in the first octant bounded by x = 0, z = 1, y = 0, and z = 0. 


6. F = yj 


| z?k and the surface is the exterior of the semicircular cylinder y? + z? = 4, 


z > 0, cut by the planes x = 0 and x = 1. 


T. F=zi+ 


xj + yk and the surface is the curved exterior side of the cylinder x? + y? = 4 


in the first octant cut by the planes z = 1 and z = 2. 
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8. F = z?i — z?j + yzk and the surface is the exterior of the hemispheric surface of 
x? + y? +2? = 16 above the plane z = 2. 


9. F = yi + zj + yk and the surface is the top of the surface z = x + 1, where —1 <x < 1 
and —1 «€ y & 1. 


10. F = zi + zj — 3zk and the surface is the top side of the plane x + y + z = 2a that lies 
above the square 0 < x < a, 0 < y < a in the zy-plane. 


11. F = (y? + 27)it (a? + 22)j + (a? + y?)k and the surface is the top side of the surface 
z=1-2? with —1 < x € 1 and -2 € y < 2. 


12. F = y?i-4 zzj-— k and the surface is the cone z = V/z? + y2, 0 € z < 1, with the normal 
pointing away from the z-axis. 


13. F = yi + z?j + 5zk and the surface is the top side of the plane z = y + 1, where 
-l<a<land-l<y<l. 


14. F = —yi + xj + zk and the surface is the exterior or bottom side of the paraboloid 
z = xz? +y, where 0 € z € 1. 


15. F = —yi + zj + 62?k and the surface is the exterior of the paraboloids z = 4 — x? — y? 
and z = z? + y?. 


4.6 GREEN'S LEMMA 


Consider a rectangle in the xy-plane that is bounded by the lines x = a, x = b, 
y — c, and y — d. We assume that the boundary of the rectangle is a piece-wise smooth 
curve that we denote by C. If we have a continuously differentiable vector function F — 
P(x,y)i + Q(x, y)j at each point of enclosed region R, then 


JZ dA = ie £5 2 a y= [ ab.nray— [ Qv f Qa 


(4.6.1) 
where the last integral is a closed line integral counterclockwise around the rectangle because 
the horizontal sides vanish, since dy — 0. By similar arguments, 


IF ess. f Plow ae (4.6.2) 


If, (5: i x) oe f. P(x, y) dz + Q(x, y) dy. (4.6.3) 


'This result, often known as Green's lemma, may be expressed in vector form as 


so that 


fra] V x F- kdA. (4.6.4) 
C R 
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Figure 4.6.1: Diagram for the verification of Green's lemma in Example 4.6.1. 


Although this proof was for a rectangular area, it can be generalized to any simply 
closed region on the zgy-plane as follows. Consider an area that is surrounded by simply 
closed curves. Within the closed contour we can divide the area into an infinite number of 
infinitesimally small rectangles and apply Equation 4.6.4 to each rectangle. When we sum 
up all of these rectangles, we find ffp V x F- kdA, where the integration is over the entire 
surface area. On the other hand, away from the boundary, the line integral along any one 
edge of a rectangle cancels the line integral along the same edge in a contiguous rectangle. 
Thus, the only nonvanishing contribution from the line integrals arises from the outside 
boundary of the domain $5 F - dr. 


e Example 4.6.1 


Let us verify Green's lemma using the vector field F — (3a — 8y”)i+ (4y — 6ry)j, and 
the enclosed area lies between the curves y = yx and y = z?. The two curves intersect at 
x = 0 and x = 1. See Figure 4.6.1. 


We begin with the line integral: 


1 
j F - dr = | (32? — 8x4) dz + (4x? — 6x?) (2x dz) 
C 0 
0 
«f (32? — 8x) dz + (Ax? — 623/2) (1571/2 dx) (4.6.5) 
1 


1 
= f (—20x* + 82? + lle — 2) dz = 8. (4.6.6) 


In Equation 4.6.6 we used y = x? in the first integral and y = yz in our return integration. 
For the areal integration, 


]].v»* kdA= [ ~ "i0oydyas= f 5y? Fass (z—a*)dz — 3 (4.6.7) 
0 


and Green's lemma is verified in this particular case. 
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Figure 4.6.2: Diagram for the verification of Green's lemma in Example 4.6.3. 


e Example 4.6.2 
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Let us redo Example 4.6.1 except that the closed contour is the triangular region defined 


by the lines x = 0, y = 0, and zt +y = 1. 
The line integral is 


1 
jre-[ (32? — 8-02)dz + (4-0 — 62-0). 0 
C 0 


«f [3(1 — y)? — 8y?](—dy) + [Ay — 6(1 — y)y] dy 


0 
«f (3-0? — 8?) -0 + (4y — 6- 0- y) dy 
1 


1 1 1 
7 32? d- | aydy | (—3 + 4y + 114°) dy 
0 0 0 


a |, - 2y?| + (739-29? + 9) E 


On the other hand, the areal integration is 


al 1-2 1 
[f vxE-xaa= f n 10ydydz = | by? |. ^ dz 
R 0 0 0 
1 
=5f (1-2)? dex =- §(1-2)3|, — 8 
0 


and Green's lemma is verified in this particular case. 


e Example 4.6.3 


Let us verify Green's lemma using the vector field F = (3x + 4y)i+ (2x — 3y)j, and the 
closed contour is a circle of radius two centered at the origin of the zy-plane. See Figure 


4.6.2. 
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Beginning with the line integration, 


f F= Pi 6 cos(8) + 8sin(£)]L-2:sin(8) d6] + [4cos(8) — 6sin(8)][2cos(6) d£] 
C 0 


(4.6.13) 
2n 
= | [—24 cos(0) sin(0) — 16 sin?(0) + 8 cos? (0)] d0 (4.6.14) 
0 
= 12 cos?(8)]" — 8 [9 — 1sin(26)] |" + 4 [8 + 1sin(26)]|" = —8r. (4.6.15) 


For the areal integration, 


2 27 
J VxF-kda= f | —2r d0 dr = —8n (4.6.16) 
R o Jo 


and Green's lemma is verified in the special case. 
Problems 
Verify Green's lemma for the following two-dimensional vector fields and contours: 


1. F = (a? + 4y)i + (y — x)j and the contour is the square bounded by the lines x = 0, 
y —0,x—1, and y — 1. 


2. F = (a — y)i + zyj and the contour is the square bounded by the lines £ = 0, y = 0, 
zx —1,andyc- l1. 


3. F = —y?i + 2?j and the contour is the triangle bounded by the lines x = 1, y = 0, and 
ya. 


4. F = (xy — x?)i + z2yj and the contour is the triangle bounded by the lines y = 0, x = 1, 
and y = x. 


5. F = sin(y)i + zcos(y)j and the contour is the triangle bounded by the lines x + y = 1, 
y —r-—]1,and y — O0. 


6. F = y?i + z?j and the contour is the same contour used in Problem 4. 


7. F = —y?i + z?j and the contour is the circle x? + y? = 4. 


8. F = —z?i + xry?j and the contour is the closed circle of radius a. 
9. F = (6y + x)i + (y + 2z)j and the contour is the circle (x — 1)? + (y — 2)? = 4. 


10. F = (x + y)i+ (22? — y?)j and the contour is the boundary of the region determined 
by the curves y = x? and y = 4. 


11. F = 3yi + 2aj and the contour is the boundary of the region determined by the curves 
y =Oand y = sin(x) with 0 € z € m. 
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12. F = —16yi + (4e" + 322)j and the contour is the pie wedge defined by the lines y = z, 
y = —z, x? + y? = 4, and y > 0. 


4.7 STOKES’ THEOREM? 


In Section 4.2 we introduced the vector quantity V x v, which gives a measure of the 
rotation of a parcel of fluid lying within the velocity field v. In this section we show how 
the curl can be used to simplify the calculation of certain closed line integrals. 

This relationship between a closed line integral and a surface integral involving the curl 
is 


Stokes’ Theorem: The circulation of F = Pi+ Qj+ Rk around the closed boundary C 
of an oriented surface S in the direction counterclockwise with respect to the surface’s unit 
normal vector n equals the integral of V x F -n over S, or 


frear=[[v«P- nde, (4.7.1) 


Stokes’ theorem requires that all of the functions and derivatives be continuous. 

The proof of Stokes’ theorem is as follows: Consider a finite surface S whose boundary 
is the loop C. We divide this surface into a number of small elements ndo and compute 
the circulation dl = fr F - dr around each element. When we add all of the circulations 
together, the contribution from an integration along a boundary line between two adjoining 
elements cancels out because the boundary is transversed once in each direction. For this 
reason, the only contributions that survive are those parts where the element boundaries 
form part of C. Thus, the sum of all circulations equals fo F - dr, the circulation around 
the edge of the whole surface. 

Next, let us compute the circulation another way. We begin by finding the Taylor 
expansion for P(z,y, z) about the arbitrary point (20, yo, zo): 


OP (xo, yo, 20) OP (xo, yo, 20) 


P(z,y,z) = P(zo; yo, zo) + (x — vo) Bas + (y — vo) ðy 
P 
yia OO) (4.7.2) 
Oz 
with similar expansions for Q(x, y, z) and R(x, y, z). Then 
P 
dr = f F: dr = Plo, v0: 20) $ dx + EOM fe zo) dz (4.7.3) 

L L Ox L 

OP(20, yo, OQ (T0, yo. Z 

4 Cowo) AC T EEE Q(xo; Yo I dodi 
Oy L Ox L 


where L denotes some small loop located in the surface S. Note that integrals such as f, dx 
and $, (a — xg) da vanish. 


4 For the history behind the development of Stokes’ theorem, see Katz, V. J., 1979: The history of 
Stokes’ theorem. Math. Mag., 52, 146-156. 
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Sir George Gabriel Stokes (1819-1903) was Lucasian Professor of Mathematics at Cambridge Uni- 
versity from 1849 until his death. Having learned of an integral theorem from his friend Lord Kelvin, 
Stokes included it a few years later among his questions on an examination that he wrote for the 
Smith Prize. It is this integral theorem that we now call Stokes’ theorem. (Portrait courtesy of the 
Royal Society of London.) 


If we now require that the loop integrals be in the clockwise or positive sense so that 
we preserve the right-hand screw convention, then 


n-kbo = do - 20) dy =~ $ (v - wo) de, (4.7.4) 


n- ja = $ (z-z) de =~ $ (0 zo) dz, (4.7.5) 


ida = $ (yy) dz =~ $ (z- 20) dy, (4.7.6) 


and 


-($- S Doo Q PY. a 5g 
m-(5 "3r nie + (57 zar nis + (78 7) nóg = V x F- nóc. 
(4.7.7) 


Therefore, the sum of all circulations in the limit when all elements are made infinitesimally 
small becomes the surface integral ffo V x F -ndo and Stokes’ theorem is proven. 


In the following examples we first apply Stokes’ theorem to a few simple geometries. 
We then show how to apply this theorem to more complicated surfaces.” 


5 Thus, different Stokes for different folks. 
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Figure 4.7.1: Diagram for the verification of Stokes’ theorem in Example 4.7.1. 
e Example 4.7.1 


Let us verify Stokes’ theorem using the vector field F = z?i -- 22j + z?k, and the closed 
curve is a square with vertices at (0,0, 3), (1,0, 3), (1, 1, 3), and (0, 1,3). See Figure 4.7.1. 
We begin with the line integral: 


jre-[ Fk. | Pe. | Fe. | F - dr, (4.7.8) 
C Cy C2 C3 C4 


where C1, C», C3, and C4 represent the four sides of the square. Along C, x varies while 
y =0 and z = 3. Therefore, 


1 
I F-dr= | de +20-0+9-0=5, (4.7.9) 
C1 0 
because dy = dz = 0, and z = 3. Along C5, y varies with x = 1 and z = 3. Therefore, 
1 
f F-dr= | 1?7-0+2-1-dy+9-0=2. (4.7.10) 
C» 0 
Along C3, x again varies with y = 1 and z = 3, and so, 
0 
n Pede= | atde+20-0+9-0=-3. (4.7.11) 
C3 1 


Note how the limits run from 1 to 0 because x is decreasing. Finally, for C4, y again varies 
with z — 0 and z = 3. Hence, 


0 
n 02.04-2-0-dy4-9-0— 0. (4.7.12) 
C4 1 


Hence, 
f F-dr = 2. (4.7.13) 
C 
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(0,a,1) 


Figure 4.7.2: Diagram for the verification of Stokes’ theorem in Example 4.7.2. 


Turning to the other side of the equation, 


ij k 
VXE=|=> £o c (4.7.14) 
zx? Qe z? 


Our line integral has been such that the normal vector must be n — k. Therefore, 


1 pl 
][ vxsse-[ | 2k - kdz dy = 2 (4.7.15) 
S 0 Jo 


and Stokes’ theorem is verified for this special case. 


e Example 4.7.2 


Let us verify Stokes’ theorem using the vector field F = (x? — y)i + 4zj + z?k, where 
the closed contour consists of the x and y coordinate axes and that portion of the circle 
x? + y? =a? that lies in the first quadrant with z = 1. See Figure 4.7.2. 

The line integral consists of three parts: 


fra Foart | Fe. | F dr. (4.7.16) 
C Cı C3 C3 


Along Ci, x varies while y = 0 and z = 1. Therefore, 


3 


f Pedr = | (2? —0)de+4-1-04+27-0=%. (4.7.17) 
C 0 3 


Along the circle C5, we use polar coordinates with x = acos(t), y = asin(t), and z = 1. 
Therefore, 


m/2 
| F - dr = J [a? cos? (t) — a sin(t)]|-a sin (t) dt] + 4- 1 - a cos(t) dt + a? cos? (t) - 0, 
Cs 0 


(4.7.18) 
"/2 
= | —a? cos? (t) sin(t) dt + a? sin? (t) dt + 4a cos(t) dt (4.7.19) 
0 
d m. g 1 7/2 1/2 
3 : š 
= — cos” (t) Tou | -= sio) + Aa sin(t) (4.7.20) 
3 0 2 2 0 0 
a? ar 
——— + —— + 4a, (4.7.21) 


3 4 
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Figure 4.7.3: Diagram for the verification of Stokes’ theorem in Example 4.7.3. 


because dx = —asin(t) dt, and dy = acos(t) dt. Finally, along C3, y varies with x = 0 and 
z = 1. Therefore, 


0 
| F-dr= f (0? y)-0-+4-1-dy-+0?-0=~4a, (4.7.22) 
C3 a 


so that 3 
$ F- dr = 2. (4.7.23) 
C 4 


Turning to the other side of the equation, 
i j k 
[o ð ə . š 
z?—y 4z mc 


From the path of our line integral, our unit normal vector must be n — k. Then, 


a pr/2 2 
II VX F-ndo = ri ri [-4i — 2r cos(0)j + k] -kr dô dr = Z- (4.7.25) 
S 0 0 


and Stokes’ theorem is verified for this case. oO 


e Example 4.7.3 


Let us verify Stokes’ theorem using the vector field F = 2yzi— (x-4-3y — 2)j - (x? + z)k, 
where the closed triangular region is that portion of the plane x +y + z = 1 that lies in the 
first octant. 

As shown in Figure 4.7.3, the closed line integration consists of three line integrals: 


jre-[ Foart | Pe. | F dr. (4.7.26) 
C Cy C3 C3 


Along Ci, z = 0 and y = 1 — x. Therefore, using x as the independent variable, 


0 
[ee 2(1—2)-0-dx—(2+3—3x—2)(—dx)+(x?+0)-0 = -a? | +a) =0. (4.7.27) 
1 1 
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Along Co, x = 0 and y = 1 — z. Thus, 


1 
[Pas f 2(1—2)2-0— (04-3—32 — 2)( dz) + (0? + z) dz = — 822474123) — 0. 
C2 0 


(4.7.28) 
Finally, along C3, y = 0 and z 2 1 — x. Hence, 


1 
f| za- 2-0- (1— g) dx—(x+0—2)-0+(2?+1-2)(—dr) = —iz? gti = 5, 
3 


(4.7.29) 
Thus, 
jre- (4.7.30) 
[o 
On the other hand, 
i j k 
VxF=|2 E & |-(-2x + 2y)j + (-1-— 2z)k. (4.7.31) 
2yz r—3yct2 a+z 


To find ndo, we use the general coordinate system x = u, y = v, andz—1-— u- v. 
Therefore, r = ui + vj + (1 — u — v)k and 


ij k 
Tu Xr =|1 0 —1|=i+j+k. (4.7.32) 
0 1 -1 


Thus, 


1 1—u 
Jf vxP nae = | i, [((—2u + 2v)j + (-1— 2 + 2u + 2v)k] - [i +j + k] dv du 
S o Jo 


(4.7.33) 
= a ac = 3) dv du = fea 7 u)? = 3 = u)] diy (4.7.34) 
0 0 0 
s (—1— u + 24°) du = - (4.7.35) 
0 


and Stokes' theorem is verified for this case. 
Problems 
Verify Stokes’ theorem using the following vector fields and surfaces: 


1. F = 5yi — 5xj + 3zk and the surface S is that portion of the plane z = 1 with the square 
at the vertices (0,0, 1), (1,0, 1), (1, 1, 1), and (0, 1, 1). 


2. F = «i+ y?j + z?k and the surface S is the rectangular portion of the plane z = 2 
defined by the corners (0,0, 2), (2,0, 2), (2, 1, 2), and (0, 1,2). 


3. F = zi + xj + yk and the surface S is the triangular portion of the plane z = 1 defined 
by the vertices (0,0, 1), (2,0, 1), and (0, 2, 1). 
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4. F = 2zi — 3xj + 4yk and the surface S is that portion of the plane z = 5 within the 
cylinder a? + y? = 4. 


5. F = zi + zj + yk and the surface S is that portion of the plane z = 3 bounded by the 
lines y = 0, x = 0, and z? + y? = 4. 


6. F = (2z + 2)i+ (y — z)j + (x + y)k and the surface S is the interior of the triangularly 
shaped plane with vertices at (1,0,0), (0, 1,0), and (0,0, 1). 


T. F = zi + xj + yk and the surface S is that portion of the plane 2x + y + 2z = 6 in the 
first octant. 


8. F = zi + zzj + yk and the surface S is that portion of the paraboloid z = 9 — x? — y? 
within the cylinder x? + y? = 4. 


4.8 DIVERGENCE THEOREM 


Although Stokes' theorem is useful in computing closed line integrals, it is usually very 
difficult to go the other way and convert a surface integral into a closed line integral because 
the integrand must have a very special form, namely V x F-n. In this section we introduce 
a theorem that allows with equal facility the conversion of a closed surface integral into a 
volume integral and vice versa. Furthermore, if we can convert a given surface integral into 
a closed one by the introduction of a simple surface (for example, closing a hemispheric 
surface by adding an equatorial plate), it may be easier to use the divergence theorem 
and subtract off the contribution from the new surface integral rather than do the original 
problem. 

'This relationship between a closed surface integral and a volume integral involving the 
divergence operator is 


The Divergence or Gauss's Theorem: Let V be a closed and bounded region in three- 
dimensional space with a piece-wise smooth boundary S that is oriented outward. Let 
F = P(r,y,z)-Q(r,yz)j-- R(x,y,z)k be a vector field for which P, Q, and R are 
continuous and have continuous first partial derivatives in a region of three-dimensional 
space containing V. Then 


fron - fff vw (4.8.1) 


Here, the circle on the double integral signs denotes a closed surface integral. 

A nonrigorous proof of Gauss’s theorem is as follows. Imagine that our volume V is 
broken down into small elements dr of volume of any shape so long as they include all of 
the original volume. In general, the surfaces of these elements are composed of common 
interfaces between adjoining elements. However, for the elements at the periphery of V, 
part of their surface will be part of the surface S that encloses V. Now d® = V.Fdr is 
the net flux of the vector F out from the element dr. At the common interface between 
elements, the flux out of one element equals the flux into its neighbor. Therefore, the sum 


of all such terms yields 
»- ff V.Fdr (4.8.2) 
V 
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Carl Friedrich Gauss (1777-1855), the prince of mathematicians, must be on the list of the greatest 
mathematicians who ever lived. Gauss, a child prodigy, is almost as well known for what he did not 
publish during his lifetime as for what he did. This is true of Gauss’s divergence theorem, which he 
proved while working on the theory of gravitation. It was only when his notebooks were published 
in 1898 that his precedence over the published work of Ostrogradsky (1801-1862) was established. 
(Portrait courtesy of Photo AKG, London, with permission.) 


and all of the contributions from these common interfaces cancel; only the contribution 
from the parts on the outer surface S is left. These contributions, when added together, 


give FfF -ndo over S and the proof is completed. 


e Example 4.8.1 


Let us verify the divergence theorem using the vector field F = 4zi — 2y?j + z?k and 
the enclosed surface is the cylinder x? + y? = 4, z = 0, and z = 3. See Figure 4.8.1. 
We begin by computing the volume integration. Because 


o(4x) | 0(-2y?) , A(z?) 
Yas "ay | Oz 


Ox 
J| 9 Pe = [[[ &— m 22v (4.8.4) 


3 p2 p2r 
E | Í J [4 — 4r sin(0) + 22] dO r dr dz (4.8.5) 
o Jo Jo 


—4-— 4y + 22, (4.8.3) 


8 p2 
= J | |ui + 4r cos(8) |^" + d r dr dz (4.8.6) 
o Jo 
3 p2 3 : 
= J (8r + Anz)r dr dz a 4n(2 + z)ir?|, dz (4.8.7) 
o Jo 0 


3 
= anf 2(2 + z) dz = 8n(2z + 12?)|) = 84m. (4.8.8) 
0 
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Figure 4.8.1: Diagram for the verification of the divergence theorem in Example 4.8.1. 


Turning to the surface integration, we have three surfaces: 


ffr ndo = || F- nds « |f F nds |f F ndo. (4.8.9) 
S Si S2 Ss 


The first integral is over the exterior to the cylinder. Because the surface is defined by 


V 2ai + 2yj 
DEL REL PET (4.8.10) 


IVA yAa?--4Ay? 2 


e 


N 


Therefore, 
2n 
JE ndo = Jf (22? — 4?) do 2 f (2(2cos(8)? — [2sin(0)? ) 240 dz — (4.8.11) 
E 
: 2T 
= =s f f fi 1 + cos(20)] — sin(0) + cos? (0) wae dé dz 4.8.12) 
2n 
= =16 f E + isin(20) + cos(8) — 0) dz 4.8.13) 
0 0 
3 
€ 16s | dz = 487, 4.8.14) 
0 


because x = 2cos(0), y = 2sin(0), and do = 2 d0 dz in cylindrical coordinates. 
Along the top of the cylinder, z — 3, the outward pointing normal is n — k, and 
do = r dr d0. Then, 


2T 2 
I F-ndo= ff 2 do = f i 9rdrd@ = 2n x 9 x 2 = 36r. (4.8.15) 
S2 S2 0 0 


However, along the bottom of the cylinder, z — 0, the outward pointing normal is n — —k 
and do — r dr d0. 'Then, 


27 2 
J F-ndo= |) 2do = f i Ordr dé = 0. (4.8.16) 
S3 S3 0 0 
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Consequently, the flux out of the entire cylinder is 


ff F -ndo = 48r + 367 + 0 = 847, (4.8.17) 
S 


and the divergence theorem is verified for this special case. 
e Example 4.8.2 


Let us verify the divergence theorem given the vector field F = 3z?y?i + yj — 6xy?zk 
and the volume is the region bounded by the paraboloid z = x? + y?, and the plane z = 2y. 
See Figure 4.8.2. 

Computing the divergence, 


O(3z?y?) , Aly) , 0(—-6xy?z) 


. F = = 2 = 2 -— 1. Oo. 
V ET + au + 8z 6xy^--1—6zry^-—1 (4.8.18) 
Then, 
nm p2sin(@) p2rsin(0) 
Jx 9v - [[[ av = | l l dz r dr d0 4.8.19) 
V V 0 0 T2 
m p2sin(0) 
= n n [2r sin(0) — r°] r dr d6 4.8.20) 
0 Jo 
T 2sin(0) 2sin(0) 
=| a sin(9) — ir* | d 4.8.21) 
0 0 0 
=| [4$ sin^(0) — 4sin*(6)] a= f 4 sin (0) d0 4.8.22) 
1 n [1 — 2cos(20) + cos?(20)] d0 4.8.23) 
= sfo —sin(20)| + 40| + $sin(40) | Eu 4.8.24) 
0 0 0 0 


The limits in the radial direction are given by the intersection of the paraboloid and plane: 
r? = 2r sin(0), or r = 2sin(0), and y is greater than zero. 
Turning to the surface integration, we have two surfaces: 


gpP nao = ff F-ndo+ |f F: ndo, (4.8.25) 
S Si 2 


where $1 is the plane z = 2y, and S» is the paraboloid. For either surface, polar coordinates 
are best so that « = rcos(0), and y = rsin(0). For the integration over the plane, z = 
2r sin(0). Therefore, 


r—rcos(0)i + rsin(0)j + 2r sin(0)k, (4.8.26) 
so that 
r, = cos(0)i + sin(0)j + 2sin(0)k, (4.8.27) 
and 
rg = —rsin(0)i + r cos(0)j + 2r cos(0)k. (4.8.28) 
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Figure 4.8.2: Diagram for the verification of the divergence theorem in Example 4.8.2. The dashed line 
denotes the curve r — 2sin(0). 


Then, 
i j k 
r, X rg =| cos(0) sin(0)  2sin(@) | 2 —2rj + rk. (4.8.29) 
—rsin(0) rcos(@) 2rcos(0) 


'This is an outwardly pointing normal so that we can immediately set up the surface integral: 


m p2sin(0) 
J F- ndo =| J (3r^ cos? (0) sin?(0)i + r sin(0)j 
Si 0 0 


— 6[2r sin()][r cos(6)][r? sin? (@)|k} - (—2rj+rk)drd@ — (4.8.30) 


2sin(0) 
= i | [-2r? sin(8) — 12r? sin?(8) cos(0)] dr dé (4.8.31) 
t 2sin(0) . 2sin(0) . 
-f ET ( ) sin(0) — 2r$[ ( ) sin*(6) eos) d0 (4.8.32) 
zi E sin^(0) — 128 sin? (0) eost) dé (4.8.33) 
0 


i -s[er — sin(20) MES ie[ + x sin(40)|" J — St sin! KOJN (4.8.34) 


= —2r. (4.8.35) 


For the surface of the paraboloid, 
r = rcos(0)i + r sin(0)j + r?k, (4.8.36) 


so that 
r, = cos(0)i + sin(0)j + 2rk, (4.8.37) 


and 
rg = —rsin(0)i + r cos(0)j. (4.8.38) 
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Then, 
i j k 
r.Xrg-—| cos(0) sin() 2r|— —2r? cos(0)i — 2r? sin(0)j + rk. (4.8.39) 
—rsin(0) rcos(0) 0 


This is an inwardly pointing normal, so that we must take the negative of it before we do 
the surface integral. Then, 


n -ndo = L rn P gj cos? (8) sin"()i + r sin(@)j — 6r?[r cos(6)][r^ sin? (6)]k] 
- [2r? cos(0)i + 2r? sin(0)j — rk] dr d8 (4.8.40) 


2sin 
= f | [6r? cos? (0) sin? (8) + 2r? sin? (0) + 6r? cos(0) sin? (0)] dr d8 


(4.8.41) 
- | [ero cos? (0) sin?(8) + 142 sin?(9) 
0 
+ 877/72) cos(0) sno) do (4.8.42) 
7 0 iss 


= LY = sin?(8)[1 — sin?(0)] cos(0) + 8 sin? (0) + 768 sin” (0) ext] dé 
0 


4.8.43) 
= 1596 sin'°(6)|5 — € sim? ()|" «f [1 — cos(20)]? d0 4.8.44) 
0 
= f fı — 3 cos(20) + 3 cos? (20) — cos(20)[1 — sin? 20)]}a0 4.8.45) 
0 


= - e| - 3 sin(20) )o + 3[0 + i sin(40)]|5 — į sin(20) )o + 2 + sin?(20)|7 4.8.46) 


=T +r = Šrī. 4.8.47) 


Consequently, 
ff F -ndo = —2n + 30 = im, (4.8.48) 
S 


and the divergence theorem is verified for this special case. 
e Example 4.8.3: Archimedes' principle 


Consider a solid? of volume V and surface S that is immersed in a vessel filled with a 
fluid of density p. The pressure field p in the fluid is a function of the distance from the 
liquid/air interface and equals 


P = po — pgz, (4.8.49) 


6 Adapted from Altintas, A., 1990: Archimedes’ principle as an application of the divergence theorem. 
IEEE Trans. Educ., 33, 222. 
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where g is the gravitational acceleration, z is the vertical distance measured from the in- 
terface (increasing in the k direction), and po is the constant pressure along the liquid/air 
interface. 

If we define F = —pk, then F -n dø is the vertical component of the force on the surface 
due to the pressure and fh F - ndo is the total lift. Using the divergence theorem and 
noting that V - F = pg, the total lift also equals 


ni V-.FdV = pg J dV = pgV, (4.8.50) 
V V 


which is the weight of the displaced liquid. This is Archimedes’ principle: The buoyant force 
on a solid immersed in a fluid of constant density equals the weight of the fluid displaced. 


e Example 4.8.4: Conservation of charge 


Let a charge of density p flow with an average velocity v. Then the charge crossing the 
element dS per unit time is pv- dS = J - dS, where J is defined as the conduction current 
vector or current density vector. The current across any surface drawn in the medium is 


ff; J : dS. 

The total charge inside the closed surface is Jf f\, p dV. If there are no sources or sinks 
inside the surface, the rate at which the charge decreases is — ff Sy pr dV. Because this 
change is due to the outward flow of charge, 


-f [iE dv = f J- ds. (4.8.51) 
Applying the divergence theorem, 


Ii (ev: J) v=o. (4.8.52) 


Because the result holds true for any arbitrary volume, the integrand must vanish identically 
and we have the equation of continuity or the equation of conservation of charge: 


—+V-J=0. (4.8.53) 


Problems 
Verify the divergence theorem using the following vector fields and volumes: 


1. F = z?i-- y?j + 27k and the volume V is the cube cut from the first octant by the planes 
x=1, y= 1, andz=1. 


2. F = xyi + yzj + zk and the volume V is the cube bounded by 0<z<1,0<y<1, 
and 0 € z € 1. 


3. F = (y—a2)i+(z-y)j+ x)k and the volume V is the cube bounded by —1 < x <1, 
< 


(y— 
-l<y<l,and-l<z<l. 
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4. F = z?i + yj + zk and the volume V is the cylinder defined by the surfaces x? + y? = 1, 
z — 0, and z — 1. 


5. F = z2i-- y?j4- z?k and the volume V is the cylinder defined by the surfaces x? + y? = 4, 
z=0,and z=1. 


6. F = yi + xz3j + (z — 1)?k and the volume V is the cylinder bounded by the surface 
x? + y? = 4, and the planes z = 1 and z = 5. 


T. F = 6ryi+4yzj+ae ?k and the volume V is that region created by the plane x+y+z = 1, 
and the three coordinate planes. 


8. F = yi + zyj — zk and the volume V is that solid created by the paraboloid z = x? + y? 
and plane z — 1. 


Further Readings 


Davis, H. F., and A. D. Snider, 1995: Introduction to Vector Analysis. Wm. C. Brown 
Publ., 416 pp. Designed as a reference book for engineering majors. 


Kendall, P. C., and D. E. Bourne, 1992: Vector Analysis and Cartesian Tensors. Wm. 
C. Brown, Publ., 304 pp. A clear introduction to the concepts and techniques of vector 
analysis. 


Matthews, P. C., 2005: Vector Calculus. Springer, 200 pp. A good book for self-study with 
complete solutions to the problems. 


Schey, H. M., 2005: Div, Grad, Curl, and All That. Chapman & Hall, 176 pp. A book to 
hone your vector calculus skills. 
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Chapter 5 


Fourier Series 


Fourier series arose during the eighteenth century as a formal solution to the classic 
wave equation. Later on, it was used to describe physical processes in which events re- 
cur in a regular pattern. For example, a musical note usually consists of a simple note, 
called the fundamental, and a series of auxiliary vibrations, called overtones. Fourier's the- 
orem provides the mathematical language that allows us to precisely describe this complex 
structure. 


5.1 FOURIER SERIES 


One of the crowning glories! of nineteenth-century mathematics was the discovery that 


the infinite series 
f(t) — > +E ancos( 17") + basin( 27E) (5.1.1) 


can represent a function f(t) under certain general conditions. This series, called a Fourier 
series, converges to the value of the function f(t) at every point in the interval [- L, L] with 
the possible exceptions of the points at any discontinuities and the endpoints of the interval. 


1 “Fourier’s Theorem ... is not only one of the most beautiful results of modern analysis, but may be 
said to furnish an indispensable instrument in the treatment of nearly every recondite question in modern 
physics. To mention only sonorous vibrations, the propagation of electric signals along a telegraph wire, 
and the conduction of heat by the earth's crust, as subjects in their generality intractable without it, is to 
give but a feeble idea of its importance." (Quote taken from Thomson, W., and P. G. Tait, 1879: Treatise 
on Natural Philosophy, Part 1. Cambridge University Press, Section 75.) 
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Because each term has a period of 2L, the sum of the series also has the same period. The 
fundamental of the periodic function f(t) is the n — 1 term while the harmonics are the 
remaining terms whose frequencies are integer multiples of the fundamental. We must now 
find some easy method for computing the coefficients a, and b, for a given function f(t). 
As a first attempt, we integrate Equation 5.1.1 term by term? from —L to L. On the right 
side, all of the integrals multiplied by a, and bn vanish because the average of cos(nat/L) 
and sin(nat/L) is zero. Therefore, we are left with 


L 
- s. f(t) dt. (5.1.2) 


Consequently ag is twice the mean value of f(t) over one period. 
We next multiply each side of Equation 5.1.1 by cos(mat/L), where m is a fixed integer. 
Integrating from —L to L, 


foo) fm) Ee fe) )« 


Dx X sin( 27) eos (77) dt. (5.1.3) 


The ap and b, terms vanish by direct integration. Finally, all of the a, integrals vanish 
when n Z m. Consequently, Equation 5.1.3 simplifies to 


= 2 f(t) cos( 27t) dt, (5.1.4) 


because Ji cos?(nzt/ L) dt = L. Finally, by multiplying both sides of Equation 5.1.1 by 
sin(mat/L) (m is again a fixed integer) and integrating from — L to L, 


- 2 f(t) sin“) dt. (5.1.5) 


Although Equation 5.1.2, Equation 5.1.4, and Equation 5.1.5 give us ao, an, and bpn for 
periodic functions over the interval [- L, L], in certain situations it is convenient to use the 
interval [r,r + 2L], where 7 is any real number. In that case, Equation 5.1.1 still gives the 


? We assume that the integration of the series can be carried out term by term. This is sometimes 
difficult to justify but we do it anyway. 
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Fourier series of f(t) and 
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7" F( cos( 27t) dt, (5.1.6) 
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These results follow when we recall that the function f(t) is a periodic function that extends 
from minus infinity to plus infinity. The results must remain unchanged, therefore, when 
we shift from the interval [—L, L] to the new interval [7,7 + 2L]. 

We now ask the question: what types of functions have Fourier series? Secondly, if a 
function is discontinuous at a point, what value will the Fourier series give? Dirichlet®:+ 
answered these questions in the first half of the nineteenth century. His results may be 
summarized as follows. 


Dirichlet's Theorem: If for the interval [—L,L] the function f(t) (1) is single-valued, 
(2) is bounded, (3) has at most a finite number of maxima and minima, and (4) has only 
a finite number of discontinuities (piecewise continuous), and if (5) f(t + 2L) = f(t) for 
values of t outside of |- L, L], then 


N 
ft) = b + 5 an cos( 27t) + by sin( 27t) (5.1.7) 


converges to f(t) as N — oo at values of t for which f(t) is continuous and to $[f(t*) + 
f(t-)] at points of discontinuity. The quantities t” and t^ denote points infinitesimally 
to the right and left of t. The coefficients in Equation 5.1.7 are given by Equation 5.1.2, 
Equation 5.1.4, and Equation 5.1.5. A function f(t) is bounded if the inequality |f(t)| < M 
holds for some constant M for all values of t. Because the Dirichlet’s conditions (1)-(4) are 
very mild, it is very rare that a convergent Fourier series does not exist for a function that 
appears in an engineering or scientific problem. 


e Example 5.1.1 


Let us find the Fourier series for the function 


0, —7<t<0, 
n - (e drm (5.1.8) 


We compute the Fourier coefficients a, and bn using Equation 5.1.6 by letting L = m 
and T = —7. We then find that 


pp mac = f idi 
0 


T T 


T 


2: (5.1.9) 


3 Dirichlet, P. G. L., 1829: Sur la convergence des séries trigonométriques qui servent à représenter une 
fonction arbitraire entre des limites données. J. Reine Angew. Math., 4, 157—169. 


^ Dirichlet, P. G. L., 1837: Sur l'usage des intégrales définies dans la sommation des séries finies ou 
infinies. J. Reine Angew. Math., 17, 57-67. 
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A product of the French Revolution, (Jean Baptiste) Joseph Fourier (1768-1830) held positions 
within the Napoleonic Empire during his early career. After Napoleon's fall from power, Fourier 
devoted his talents exclusively to science. Although he won the Institut de France prize in 1811 for 
his work on heat diffusion, criticism of its mathematical rigor and generality led him to publish the 
classic book Théorie analytique de la chaleur in 1823. Within this book he introduced the world to 
the series that bears his name. (Portrait courtesy of the Archives de l'Académie des sciences, Paris.) 


"  cos(nz)—1 EP uc ac: 


an = JE dt — 1 E ; 2 


: 2 


T T n n " n?gz nn 
(5.1.10) 
because cos(nz) = (—1)", and 
ft E i r —1)r+1 
bn = - | tsin(nt) dt = | FORD) X I M UNO (5.1.11) 
T Jo T n n o n 
for n = 1,2,3,.... Thus, the Fourier series for f(t) is 
e. (-1)}—1 —1)”+! 
f(t)— “ H X ( = cos(nt) + =a sin(nt) (5.1.12) 
n=1 
T 2«&«cos(2m— D] S GC. 
= aS 5 OnI 5 ~~ sin(nt). (5.1.13) 
m=1 n=1 


We note that at the points t = +(2n — 1), where n = 1,2,3,..., the function jumps 
from zero to m. To what value does the Fourier series converge at these points? From 
Dirichlet's theorem, the series converges to the average of the values of the function just 
to the right and left of the point of discontinuity, i.e., (m + 0)/2 = 7/2. At the remaining 
points the series converges to f(t). 
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Second to Gauss, Peter Gustav Lejeune Dirichlet (1805-1859) was Germany's leading mathematician 
during the first half of the nineteenth century. Initially drawn to number theory, his later studies 
in analysis and applied mathematics led him to consider the convergence of Fourier series. These 
studies eventually produced the modern concept of a function as a correspondence that associates 
with each real x in an interval some unique value denoted by f(x). (Taken from the frontispiece of 
Dirichlet, P. G. L., 1889: Werke. Druck und Verlag von Georg Reimer, 644 pp.) 


Figure 5.1.1 shows how well Equation 5.1.12 approximates the function by graphing 
various partial sums of this expansion as we include more and more terms (harmonics). The 
MATLAB script that created this figure is: 


clear; 

t = [-4:0.1:4]; 4 create time points in plot 

f = zeros(size(t)); % initialize function f(t) 

for k = 1:length(t) % construct function f(t) 
if t(k) < 0; f(k) = 0; else f(k) = t(k); end; 
if t(k) < -pi; f(k) = t(k) + 2*pi; end; 
if t(k) > pi ; f(k) 0; end; 

end 

^4 initialize Fourier series with the mean term 

fs = (pi/4) * ones(size(t)); 

clf % clear any figures 

for n = 1:6 

^4 create plot of truncated FS with only n harmonic 
fs = fs - (2/pi) * cos((2*n-1)*t) / (2*n-1)^72; 
fs = fs - (-1)7n * sin(n*t) / n; 
subplot(3,2,n), plot(t,fs,t,f,’--’) 
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— mean plus 1 term 
EL d 


— mean plus 2 terms 


-2 -2 
-4 -2 0 2 4 -4 -2 0 2 4 
4 4 
— meanplus3terms — — meanplus4terms — | 
LU ELLA NI 
2 ar N p 
i 
; l 
0 0 NS == J 
-2 -2 
-4 -2 0 2 4 -4 -2 0 2 4 


—— mean plus 6 terms 
--f0 


-4 -2 0 2 4 -4 -2 0 2 4 


Figure 5.1.1: Partial sum of the Fourier series for Equation 5.1.8. 


if n== 
legend(?mean plus 1 term’,’f(t)’); legend boxoff; 
else 
legend([’mean plus ?,num2str(n),? terms’],’f(t)’) 
legend boxoff 
end 
if n >= 5; xlabel(’t’); end; 
end 
As the figure shows, successive corrections are made to the mean value of the series, 7/2. 
As each harmonic is added, the Fourier series fits the function better in the sense of least 
squares: 


T+2L 
/ f(x) — fn (x)]? dz = minimum, (5.1.14) 


where fw(x) is the truncated Fourier series of N terms. 


e Example 5.1.2 
Let us calculate the Fourier series of the function f(t) — |t|, which is defined over the 


range -mn <t<7. 
From the definition of the Fourier coefficients, 


1 E T T T 
aja —tdt td| =- = 11 
ay = i |/ +f | T+ oa, (5.1.15) 


>R 


an = - Dr —t cos(nt) at f toos(nt) a| (5.1.16) 
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4 4 
— mean plus 1 term — mean plus 2 terms 
3| - - - f(t) ^ 3t 7 --ft) 
2 2 
1 1 
0 vg 0 z 
-4 -2 0 2 4 -4 -2 0 2 4 
4 4 
—— mean plus 3 terms —— mean plus 4 terms 
3 - - f(t) 3 - - f(t) 
2 2 
1 1 
0 0 
-4 -2 0 2 4 -4 -2 0 2 4 
4 4 
—— mean plus 5 terms —— mean plus 6 terms 
3 - - Wt 3 =f 
2 2 
1 1 
0 0 
-4 -2 0 2 4 -4 -2 0 2 4 
t t 
Figure 5.1.2: Partial sum of the Fourier series for f(t) = |t]. 


|. ntsin(nt) + cos(nt) |" nt sin(nt) + cos(nt) |” 
i nT me nt " 
2 
= < |(-1)} -1 
erei 
and 
1 0 T 
bn = — / —tsin(nt) dt +f tsin(nt) ai 

T -T 0 

| ntcos(nt) — sin(nt) |° nt cos(nt) — sin(nt) |" P 

i n?n » nn " B 


for n — 1,2,3,.... Therefore, 


acer lene ee 


for -r €t. 
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(5.1.17) 


(5.1.18) 


(5.1.19) 


(5.1.20) 


(5.1.21) 


In Figure 5.1.2 we show how well Equation 5.1.21 approximates the function by graph- 
ing various partial sums of this expansion. As the figure shows, the Fourier series does very 
well even when we use very few terms. The reason for this rapid convergence is the nature 


of the function: it does not possess any jump discontinuities. 


e Example 5.1.3 


Sometimes the function f(t) is an even or odd function.? Can we use this property to 


simplify our work? The answer is yes. 


5 An even function fe(t) has the property that fe(—t) = fe(t); an odd function fo(t) has the property 


that fo(—t) = —fo(t). 
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Let f(t) be an even function. Then 


L L 
E if, f(t) dt = i | f(t) dt, (5.1.22) 


an = ae f(t) cos( 575) at = AT f(t) cos( 27t) dt, (5.1.23) 
-if n f(t) )sin( J dt — 0. (5.1.24) 


Here we used the properties that f^, fe(«) dx = 2 ih fe(w) da and (^, fo(a)dx = 0. Thus, 
if we have an even function, we pry bud do and ay, a Equation 5.1. ? and Equation 
5.1.23, and b, 2 0. Because the corresponding series contains only cosine terms, it is often 


called a Fourier cosine series. 
Similarly, if f(t) is odd, then 


and 


whereas 


ao = an = 0, and =2 fx f(t) )sin( J dt. (5.1.25) 


Thus, if we have an odd function, we merely compute b, via Equation 5.1.25 and ap = ay = 
0. Because the corresponding series contains only sine terms, it is often called a Fourier 
Sine series. 


e Example 5.1.4 


In the case when f(x) consists of a constant and/or trigonometric functions, it is much 
easier to find the corresponding Fourier series by inspection rather than by using Equation 
5.1.6. For example, let us find the Fourier series for f(x) = sin? (x) defined over the range 
-T ILTIT. 

We begin by rewriting f(x) = sin? (x) as f(x) = 1[1— cos(2x)]. Next, we note that any 
function defined over the range —7 < x < m has the Fourier series 


f(x) = b t Y cos(na) + bn sin(nx) (5.1.26) 
= b + a4 cos(z) + bı sin(x) + a2 cos(2x) + bz sin(2x) +--+. (5.1.27) 

On the other hand, 
f(x) = $ — 5 c0s(2x) = 4 + Ocos(x) + Osin(x) — 5 cos(2x) + Osin(2x)+---. (5.1.28) 


Consequently, by inspection, we can immediately write that 
ag = 1, ay = by — 0, ag = -i, bo = 0, an = bn = 0, n > 3. (5.1.29) 


Thus, instead of the usual expansion involving an infinite number of sine and cosine terms, 
our Fourier series contains only two terms and is simply 


f(x) = à — $cos(2z), —rTEm-Xm. (5.1.30) 
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e Example 5.1.5: Quieting snow tires 


An application of Fourier series to a problem in industry occurred several years ago, 
when drivers found that snow tires produced a loud whine? on dry pavement. Tire sounds 
are produced primarily by the dynamic interaction of the tread elements with the road 
surface.’ As each tread element passes through the contact patch, it contributes a pulse of 
acoustic energy to the total sound field radiated by the tire. 

For evenly spaced treads we envision that the release of acoustic energy resembles the 
top of Figure 5.1.3. If we perform a Fourier analysis of this distribution, we find that 


1 —mn/2--e m/2+e 4 
ao = — / iae f Lg = *#, (5.1.31) 
T —n/2—e m/2—e T 


where € is half of the width of the tread and 


1 —mn/2-4€ m/2+e 
an = — / cos(nt) dt «f cos(nt) dt 5.1.32) 

T —r/2—e T/2—e 
1 " —m/2--e : m/2+e 

zx [sino 7/2 + sin(nt) r 5.1.33) 

R. sin( = +ne) sin( = ne) | sin( 7 + nc) sin( ne) 5.1.34) 
nT 2 2 2 2 
1 nT nT 4 nm 

= — ; j i = — — Jsi 5.1. 
= [2 cos( 5 ) 2cos( 5 J sin(ne) = cos( 5 ) sin(ne) 35) 


Because f(t) is an even function, b, = 0. 

The question now arises of how to best illustrate our Fourier coefficients. In Section 5.4 
we will show that any harmonic can be represented as a single wave A, cos(n7t/L + pn) or 
A, sin(nzt/ L + Yn), where the amplitude A, = ,/a? + 02. In the bottom frame of Figure 
5.1.3, MATLAB was used to plot this amplitude, usually called the amplitude or frequency 
spectrum 34/a2 + 62, as a function of n for an arbitrarily chosen e = 7/12. Although the 


value of e will affect the exact shape of the spectrum, the qualitative arguments that we 
1 


will present remain unchanged. We have added the factor 5 so that our definition of the 
frequency spectrum is consistent with that for a complex Fourier series stated after Equation 
5.5.12. The amplitude spectrum in Figure 5.1.3 shows that the spectrum for periodically 
placed tire treads has its largest amplitude at small n. This produces one loud tone plus 
strong harmonic overtones because the fundamental and its overtones are the dominant 
terms in the Fourier series representation. 

Clearly this loud, monotone whine is undesirable. How might we avoid it? Just as 
soldiers marching in step produce a loud uniform sound, we suspect that our uniform tread 
pattern is the problem. Therefore, let us now vary the interval between the treads so that 
the distance between any tread and its nearest neighbor is not equal, as illustrated in Figure 


5.1.4. Again we perform its Fourier analysis and obtain that 


1 —m/2--e m/4+e 4 
— | [ae Psa] =, (5.1.36) 
T T 


T —n/2—e /4—€ 


6 See Varterasian, J. H., 1969: Math quiets rotating machines. SAE J., 77(10), 53. 


7 Willett, P. R., 1975: Tire tread pattern sound generation. Tire Sci. Tech., 3, 252-266. 
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f(t) 


2€ 


—2n —1 T 2m t 


(5.1.37) 


(5.1.38) 


ne) (5.1.39) 


© 
— 0.05 | 
LJ 
0—e e e e pk sacs ace 
0 5 10 15 20 
n 
Figure 5.1.3: Temporal spacing (over two periods) and frequency spectrum of uniformly spaced snow tire 
treads. 
1 —mn/2--e m/4--e 
an = = / cos(nt) a+ [ cos(nt) dt 
T —1/2-€ m/A—e 
1 —mn/2--e m/4+e 
= — sin(nt) + — sin(nt) 
TT —m/2—e m/4—e 
1 [si (= ) á (m l J od [s = i ) a (= 
= ın nE 1n r TE 1 ın r nE 1n 
nc 2 2 nT 4 4 
NT . 
An = [cos ( ) cos( J sin(ne), 
nm 2 
and 


1 —m/2-4€ m/4--€ 
bn = — j. sin(nt) dt «f sin(nt) dt 
T 


—m/2—e m/A—e 


= = [cos ( 7 ne) cos( 7 } ne) 
2 . (mm . (NT ; 
Sa [sin( 1 ) sin( 7 J sin(ne). 


'The MATLAB script 


epsilon = pi/12; % set up parameter for fs coefficient 
n = 1:20; % number of harmonics 


l [cos ( did + ) cos ( Be 
ne 
T 4 4 
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Eid | | 
0 " | Uia. eso. 1 112] 
0 5 10 15 20 


n 


Figure 5.1.4: Temporal spacing and frequency spectrum of nonuniformly spaced snow tire treads. 


argl = (pi/2)*n; arg2 = (pi/4)*n; arg3 = epsilon*n; 
% compute the Fourier coefficient an 

an = (cos(arg1) + cos(arg2)).*sin(arg3); 

an = (2/pi) * an./n; 

^ compute the Fourier coefficient bn 

bn = (sin(arg2) - sin(arg1)).*sin(arg3) ; 

bn = (2/pi) * bn./n; 

^ compute the magnitude 

cn = 0.5 * sqrt(an.*an + bn.*bn); 

^ add in the à. 0 term 

cn = [2*epsilon/pi,cn]; 

n = [0,n]; 

clf % clear any figures 

axes(?FontSize?,20) % set font size 
stem(n,cn,’filled’) % plot spectrum 

set (gca,?PlotBoxAspectRatio',[8 4 1]) 4 set aspect ratio 
xlabel(’n’) % label x-axis 

ylabel(’?( à n^2 + bn°2 )^(1/2)/2?) % label y-axis, 
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was used to compute the amplitude of each harmonic as a function of n and the results 
were plotted. See Figure 5.1.4. The important point is that our new choice for the spacing 
of the treads has reduced or eliminated some of the harmonics compared to the case of 
equally spaced treads. On the negative side we have excited some of the harmonics that 
were previously absent. However, the net effect is advantageous because the treads produce 


less noise at more frequencies rather than a lot of noise at a few select frequencies. 
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If we were to extend this technique so that the treads occurred at completely random 
positions, then the treads would produce very little noise at many frequencies and the total 
noise would be comparable to that generated by other sources within the car. To find the 
distribution of treads with the whitest noise? is a process of trial and error. Assuming 
a distribution, we can perform a Fourier analysis to obtain its frequency spectrum. If 
annoying peaks are present in the spectrum, we can then adjust the elements in the tread 
distribution that may contribute to the peak and analyze the revised distribution. You are 
finished when no peaks appear. 


Problems 


Find the Fourier series for the following functions. Using MATLAB, plot the Fourier spec- 
trum. Then plot various partial sums and compare them against the exact function. 


En" —7<t<0 t, —7<t<0 
» 5- [5 Üct«m i 5- [5 0ct«m 
| ft, —7<t<0 1/2 +t, -1<t<0 
& (9-7 0ct«m : 50-112*7 0<t<1 
0, —7<t<0 0, —r < t< —n[2 
5. f= t  0Ostzm2 6. f(t) 2 4 sin(2t), -7/2 < t< 1/2 
n—t, n/2<t<a 0, n/2«XtX€m 
7 ft) 2e", | -L«t«L 8. f(t)=t+?, | —L«t«L 
0 —m7<t<0 1 1 
" ; Sts t, lt 
a r0 = asta, 0ctzm 10 HO as Pere? 
’ 2>*>2 
u 0, —a«t«0 0 —m«t«0 
u 10 [x 0ct«a 12 (0 - [e Üscsem 
13. f(t) = (1—10)/2, 0«t«2 14. f(t) =tcos(at/L), -L«t«L 
x (2L — x), 0<t<2L 


15. f(t) =sinh[a(a/2—|t|)], -m<t<m 16. f(t) = 
x? —6La4+8L?, 2b<t<4L 


5.2 PROPERTIES OF FOURIER SERIES 


In the previous section we introduced the Fourier series and showed how to compute 
one given the function f(t). In this section we examine some particular properties of these 
series. 


8 White noise is sound that is analogous to white light in that it is uniformly distributed throughout 
the complete audible sound spectrum. 
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| Differentiation of a Fourier series 


In certain instances we only have the Fourier series representation of a function f(t). 
Can we find the derivative or the integral of f(t) merely by differentiating or integrating the 
Fourier series term by term? Is this permitted? Let us consider the case of differentiation 
first. 

Consider a function f(t) of period 2L, which has the derivative f'(t). Let us assume 
that we can expand f(t) as a Fourier series. This implies that f'(t) is continuous except 
for a finite number of discontinuities and f(t) is continuous over an interval that starts at 
t = T and ends at t = 7 + 2L. Then 


MNA t t 
Iu e P t, cos( 2 ) s,sin( 7 ) (5.2.1) 


where we denoted the Fourier coefficients of f’(t) with a prime. Computing the Fourier 
coefficients, 


; 1 T+2L 1 
y=} f&-ilf(ce2-H0)-9. (5.2.2) 
if f(r + 2L) = f(r). Similarly, by integrating by parts, 
x x E j nnt 
d, =F f f (t) cos T dt (5.2.3) 
1 nztN1 7? nr [rb . [nrt 
NTDn 
= 5.2. 
E ( 5) 
and 
TN zi i . nnt 
b, = nad f (t) sin L dt (5.2.6) 
1 . (nat COLO ago (TO nat 
WT s 
— .2.8 
i (5.2.8) 


Consequently, if we have a function f(t) whose derivative f'(t) is continuous except for a 
finite number of discontinuities and f(T) = f(T + 2L), then 


f(t) = 2 a p cos( 22t) = sin( 22) (5.2.9) 


That is, the derivative of f(t) is given by a term-by-term differentiation of the Fourier series 


of f(t). 
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e Example 5.2.1 


'The Fourier series for the periodic function 


0, —1<t<0, 
f(t) = t O<t<n/2, — f(t)=f(t+2n), (5.2.10) 

T-t, m/2xt&m, 

is " 7 
(0-2 - »» Bd "n Z 3 E sin[2n — 1)¢] (5.2.11) 


Because f(t) is continuous over the entire interval (—7,7) and f(—7) = f(z) = 0, we can 
find f'(t) by taking the derivative of Equation 5.2.11 term by term: 


PA= : SS Aan Dt Z y -— cos|(2n — 1)t]. (5.2.12) 


This is the same Fourier series that we would obtain by computing the Fourier series for 


0, —-m«t«0, 
F= | 1, 0<t<7/2, (5.2.13) 
—1, n/2«t«m. 


| Integration of a Fourier series 


To determine whether we can find the integral of f(t) by term-by-term integration of 
its Fourier series, consider a form of the antiderivative of f(t): 


F(t) EJ [/m - =| dr. (5.2.14) 
Now 
t t+2L 

F+) = | [f - 2 ar+ f [f - 2 dr (5.2.15) 

L 
= F(t) e lig z 2 dr (5.2.16) 

"m 
= F(t) + UR f(r) dr — Lag = F(t), (5.2.17) 


so that F(t) has a period of 2L. Consequently we may expand F(t) as the Fourier series 


oo 


F(t) = a +>" An cos( 27t) 4B, ss (^). (5.2.18) 
n=1 
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For An, 
m=z fF Neos” z) a (5.2.19) 
1 sin(nat/L) 1 st Gol . f nmt 
z E nn/L || E 3 NT P HG B 3] sa( 2) dt (5.2.20) 
DE i 2.21 
a (5 ) 
Similarly, 
An 
Bn = .2.22 
nrt/L (5 ) 
Therefore, 
t 
_ aot a, sin(nmt/ L) — bn cos(nmt/L) 
/ fac SY Sum (5.2.23) 


This is identical to a term-by-term integration of the Fourier series for f(t). Thus, we can 
always find the integral of f(t) by a term-by-term integration of its Fourier series. 


e Example 5.2.2 


The Fourier series for f(t) = t for —r < t < v is 


o2: c 


sin(nt). (5.2.24) 


To find the Fourier series for f(t) = t?, we integrate Equation 5.2.24 term by term and find 
that : 
7? > (—1)" 
DUM we Se d 
n=1 
But $77 ,(-1)"/n? = —7?/12. Substituting and multiplying by 2, we obtain the final 


result that 
T’? 
erc 


| Parseval’s equality 


One of the fundamental quantities in engineering is power. The power content of a 
periodic signal f(t) of period 2L is |o f? (t) dt/ L. This mathematical definition mirrors 
the power dissipation I?R that occurs in a resistor of resistance R where I is the root mean 
square (RMS) of the current. We would like to compute this power content as simply as 
possible given the coefficients of its Fourier series. 


(5.2.25) 


y 


cos(nt). (5.2.26) 
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Assume that f(t) has the Fourier series 


29 y LAMP 2m 
OE PX Y ) + bnsin( L ) (5.2.27) 
'Then, 
inpr 2 a DUM rg, [tt nrt 
= t) dt = 2 t) dt Sn t 2T) 
;j fee-x[ OaD) ft) cos (575) a 
go 7T+2L 
. [nat 
2x4 f(t) sn( 2t) dt (5.2.28) 
a2 un 
=a + (am t+ bn): (5.2.29) 
n=1 


Equation 5.2.29 is Parseval’s equality.? It allows us to sum squares of Fourier coefficients 
(which we have already computed) rather than performing the integration D tcs J^ at 
analytically or numerically. 


e Example 5.2.3 


The Fourier series for f(t) = t? over the interval [—7, 7] is 


v= un + 133 = cos(nt) (5.2.30) 
; 45 n | 2. 
'Then, by Parseval's equality, 
tf tan | = eis E 1160 (5.2.31) 
"nj. dri, 18 an C7 5 18/7 an P 


Consequently, 


25/90 = M7 LN (5.2.32) 


| Gibbs phenomena 


In the actual application of Fourier series, we cannot sum an infinite number of terms 
but must be content with N terms. If we denote this partial sum of the Fourier series by 


9 Parseval, M.-A., 1805: Mémoire sur les séries et sur l'intégration complète d'une équation aux dif- 
férences partielles linéaires du second ordre, à coefficients constants. Mémoires présentés a l’Institut des 
sciences, lettres et arts, par divers savans, et lus dans ses assemblées: Sciences mathématiques et Physiques, 
1, 638-648. 
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Figure 5.2.1: The scanning function over 0 € x < 27 for N = 5. 


Syn (t), we have from the definition of the Fourier series: 


N 
Sn(t) = žao + a cos(nt) + b, sin(nt) 5.2.33) 
n=1 
1 2m 1 2m N 
m ; f(a) dx + =a f(z) [Seon cos(nx) + sin(nt) sin(nz)| dx (5.2.34) 


1 of is te 
-if fg + Scott al a 5.2.35) 


EE SRI + i)(z- t)] 
Tah IO sw - 0) 


— dz. 5.2.36) 
2m Jo 


The quantity sin[(N + 3)(z — t)]/ sin[3(x — t)] is called a scanning function. Over the 
range 0 < x < 2m it has a very large peak at x = t where the amplitude equals 2N + 1. 
See Figure 5.2.1. On either side of this peak there are oscillations that decrease rapidly 
with distance from the peak. Consequently, as N — oo, the scanning function becomes 
essentially a long narrow slit corresponding to the area under the large peak at x — t. If we 
neglect for the moment the small area under the minor ripples adjacent to this slit, then 
the integral, Equation 5.2.36, essentially equals f(t) times the area of the slit divided by 
2s. If 1/2» times the area of the slit equals unity, then the value of Sy (t) ~ f(t) to a good 
approximation for large N. 

For relatively small values of N, the scanning function deviates considerably from its 
ideal form, and the partial sum Sy(t) only crudely approximates f(t). As the partial sum 
includes more terms and N becomes relatively large, the form of the scanning function 
improves and so does the agreement between Sw(t) and f(t). The improvement in the 
scanning function is due to the large hump becoming taller and narrower. At the same time, 
the adjacent ripples become more numerous as well as narrower in the same proportion as 
the large hump does. 
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Figure 5.2.2: The finite Fourier series representation Sy (t) for the function, Equation 5.2.38, for the range 
—1 <t <7 for N = 27 and N = 8l. 


The reason why Sy (t) and f(t) will never become identical, even in the limit of N — oo, 
is the presence of the positive and negative side lobes near the large peak. Because 


. i " 
EET wait? > cosin(t — a)], (5.2.37) 


n=1 


an integration of the scanning function over the interval 0 to 27 shows that the total area 
under the scanning function equals 27. However, from Figure 5.2.1 the net area contributed 
by the ripples is numerically negative so that the area under the large peak must exceed 27 
if the total area equals 27. Although the exact value depends upon N, it is important to 
note that this excess does not become zero as N — oo. 

Thus, the presence of these negative side lobes explains the departure of our scanning 
function from the idealized slit of area 27. To illustrate this departure, consider the function: 


M (5.2.38) 
Then, 
4E fens pes 1 f? sin((N + $)(z—0)] 
ic E EID F Use n puo 
— 1 (7 fsi[(N-c 3)r-1£) , , sin + 3)(x  £) k 
-5 f l sin[1(x — t)] de sin[i(x + t)] de) E 
1 [77 sin((N + 1)6] 1 f7**' sin[(N + 296] 
EP. f. sin($0) d x sin(40) id procu 


The first integral in Equation 5.2.41 gives the contribution to Sy(t) from the jump dis- 
continuity at t = 0 while the second integral gives the contribution from t = m. In Figure 
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5.2.2 we have plotted S(t) when N = 27 and N = 81. Residual discrepancies remain 
even for very large values of N. Indeed, as N increases, this figure changes only in that 
the ripples in the vicinity of the discontinuity of f(t) proportionally increase their rate of 
oscillation as a function of t while their relative magnitude remains the same. As N — oo 
these ripples compress into a single vertical line at the point of discontinuity. True, these 
oscillations occupy smaller and smaller spaces but they still remain. Thus, we can never 
approximate a function in the vicinity of a discontinuity by a finite Fourier series without 
suffering from this over- and undershooting of the series. This peculiarity of Fourier series 
is called the Gibbs phenomena. Gibbs phenomena can only be eliminated by removing 
the discontinuity.!! 


Problems 


Additional Fourier series representations can be generated by differentiating or integrating 
known Fourier series. Work out the following two examples. 


1. Given 
—2 [(2 1)a 
n? i d n+ LN jus 
(2n + 1)? 
n=0 
obtain 
25 2 oo 8 2 1 
TI LS SUE )x] üccc m, 


(Qn+13 ' 


by term-by-term integration. Could we go the other way, i.e., take the derivative of the 
second equation to obtain the first? Explain. 


2. Given 
T32 Z cos(nx) 
= 1 n+l < < 
obtain 
r-r? — 5i pen sin(nz) d 
2  £ "NN "hes Per 


by term-by-term integration. Could we go the other way, i.e., take the derivative of the 
second equation to obtain the first? Explain. 


3. (a) Show that the Fourier series for the odd function: 


2t +t? —-2«t«0, . 32 = 1 . 
t — ? sal — * t — 
FO E psica, 05 f T 25(m-l18 a 


10 Gibbs, J. W., 1898: Fourier’s series. Nature, 59, 200; Gibbs, J. W., 1899: Fourier’s series. Nature, 
59, 606. For the historical development, see Hewitt, E., and R. E. Hewitt, 1979: The Gibbs-Wilbraham 
phenomenon: An episode in Fourier analysis. Arch. Hist. Exact Sci., 21, 129-160. 


11 For a particularly clever method for improving the convergence of a trigonometric series, see Kan- 
torovich, L. V., and V. I. Krylov, 1964: Approximate Methods of Higher Analysis. Interscience, pp. 77-88. 
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(b) Use Parseval's equality to show that 


ml om 2n — 1)6 


This series converges very rapidly to 19/960 and provides a convenient method for comput- 
; 6 
ing 7°. 


5.3 HALF-RANGE EXPANSIONS 


In certain applications, we will find that we need a Fourier series representation for 
a function f(x) that applies over the interval (0, D) rather than (—L, L). Because we are 
completely free to define the function over the interval (— L, 0), it is simplest to have a series 
that consists only of sines or cosines. In this section we shall show how we can obtain these 
so-called half-range expansions. 

Recall in Example 5.1.3 how we saw that if f(x) is an even function, then bn = 0 for all 
n. Similarly, if f(x) is an odd function, then ao = an = 0 for all n. We now use these results 
to find a Fourier half-range expansion by extending the function defined over the interval 
(0, L) as either an even or odd function into the interval (—L,0). If we extend f(a) as an 
even function, we will get a half-range cosine series; if we extend f(a) as an odd function, 
we obtain a half-range sine series. 

It is important to remember that half-range expansions are a special case of the general 
Fourier series. For any f(x) we can construct either a Fourier sine or cosine series over the 
interval (— L, L). Both of these series will give the correct answer over the interval of (0, L). 
Which one we choose to use depends upon whether we wish to deal with a cosine or sine 
series. 


e Example 5.3.1 
Let us find the half-range sine expansion of 
f(x) =1, O<a<n. (5.3.1) 


We begin by defining the periodic odd function 


7 —1, -r «m «0, 
f(x) = { L oca. (5.3.2) 


with f(x + 2r) = f(x). Because f(x) is odd, ay = an = 0 and 


Def 2 2 2 
by = Ji lsin(nz)dx = ——cos(nx ee = —— [cos(nr) — 1] = [(—1)" — 1]. 

T Jo no n nT 

(5.3.3) 

The Fourier half-range sine series expansion of f(x) is therefore 

2 & [I — (—1)*] 4 « sin[(2m — 1)z] 

= = 5.3.4 

fa) = 2 HH inn) = = y; SR — (5.3.4) 
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Figure 5.3.1: Partial sum of N terms in the Fourier half-range sine representation of a square wave. 


As counterpoint, let us find the half-range cosine expansion of f(z) 21,0 « x « m. 
Now, we have that bn = 0, 


9 T 
ao = F ldr = 2, (5.3.5) 
7T Jo 
and 
2 J" 2. T 
à, — — | cos(nx)dz = — sin(nz)|o =0. (5.3.6) 
T Jo no 


Thus, the Fourier half-range cosine expansion equals the single term: 
f(x) — 1, O<T<T. (5.3.7) 


This is perfectly reasonable. To form a half-range cosine expansion we extend f(x) as 
an even function into the interval (—7,0). In this case, we would obtain f(x) = 1 for 
—m < x « n. Finally, we note that the Fourier series of a constant is simply that constant. 
In practice it is impossible to sum Equation 5.3.4 exactly and we actually sum only the 
first N terms. Figure 5.3.1 illustrates f(x) when this Fourier series contains N terms. As 
seen from the figure, the truncated series tries to achieve the infinite slope at z — 0, but in 
the attempt, it overshoots the discontinuity by a certain amount (in this particular case, by 
17.996). This is another example of the Gibbs phenomena. Increasing the number of terms 
does not remove this peculiarity; it merely shifts it nearer to the discontinuity. 


e Example 5.3.2: Inertial supercharging of an engine 


An important aspect of designing any gasoline engine involves the motion of the fuel, 
air, and exhaust gas mixture through the engine. Ordinarily an engineer would consider the 
motion as steady flow; but in the case of a four-stroke, single-cylinder gasoline engine, the 
closing of the intake valve interrupts the steady flow of the gasoline-air mixture for nearly 
three quarters of the engine cycle. This periodic interruption sets up standing waves in the 
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intake pipe - waves that can build up an appreciable pressure amplitude just outside the 
input value. 

When one of the harmonics of the engine frequency equals one of the resonance fre- 
quencies of the intake pipe, then the pressure fluctuations at the valve will be large. If the 
intake valve closes during that portion of the cycle when the pressure is less than average, 
then the waves will reduce the power output. However, if the intake valve closes when the 
pressure is greater than atmospheric, then the waves will have a supercharging effect and 
will produce an increase of power. This effect is called inertia supercharging. 

While studying this problem, Morse et al.!? found it necessary to express the velocity 
of the air-gas mixture in the valve, given by 


0, =r < wt < —mn/4, 
f(t) = 4 v cos(2wt)/2, —mn/4 < wt < 7/4, (5.3.8) 
0, n/A « wt « m, 


in terms of a Fourier expansion. The advantage of working with the Fourier series rather 
than the function itself lies in the ability to write the velocity as a periodic forcing function 
that highlights the various harmonics that might resonate with the structure comprising 
the fuel line. 

Clearly f(t) is an even function and its Fourier representation will be a cosine series. 
In this problem 7 = —7/w, and L = m/w. Therefore, 


2 7 / Au) ndn 
inis 7 cos(2ut) dt = 4 sin(2wt) T7 = 1, (5.3.9) 
T =r /4w Tn 
and 
2 m/ Aw t 
ae = con(2ut) cos ( 27) dt (5.3.10) 
T J—m«/Aw m/w 
w m / Aw) 
25 {cos|(n + 2)wt] + cos|(n — 2)wt]) dt (5.3.11) 
—m/Aw 
(n2) | sm(n-2od| ^" 
sin[(n4-2)wt sin[(n—2)wt 
2(n+2) T 2(n—2) m n # 2, 
= | n/d (5.3.12) 
2 4 Sin (eot) n=2, 
—mn/ Aw 
4 (nm 
_ -= cos (24), n#2, (5.3.13) 
4i n=2 dui 


Plotting these Fourier coefficients using the MATLAB script: 
for m = 1:21; 
n = m-1; 4 compute the indices for the harmonic 
^4 compute the Fourier coefficients an 
if n == 2; an(m) = pi/4; else; 
an(m) = 4.*cos(pixn/4)/(4-n*n); end; 


12 Morse, P. M., R. H. Boden, and H. Schecter, 1938: Acoustic vibrations and internal combustion engine 
performance. I. Standing waves in the intake pipe system. J. Appl. Phys., 9, 16-23. 
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Figure 5.3.2: The spectral coefficients of the Fourier cosine series of the function given by Equation 5.3.9. 


end 

nn-0:20; 4 create indices for x-axis 
fzero-zeros(size(nn)); ^4 create the zero line 

clf ^ clear any figures 

axes(’FontSize’,20) % set font size 

stem(nn,an,’filled’) % plot spectrum 

hold on 

plot(nn,fzero,’-’) % plot the zero line 

set (gca,’PlotBoxAspectRatio’,[8 4 1]) ^ set aspect ratio 
xlabel(’n’) % label x-axis 


ylabel(?a n?) % label y-axis, 


we see that these Fourier coefficients become small rapidly (see Figure 5.3.2). For that 
reason, Morse et al. showed that there are only about three resonances where the acoustic 
properties of the intake pipe can enhance engine performance. These peaks occur when 
q = 30c/NL = 3,4, or 5, where c is the velocity of sound in the air-gas mixture, L is the 
effective length of the intake pipe, and N is the engine speed in rpm. See Figure 5.3.3. 
Subsequent experiments!? verified these results. 

Such analyses are valuable to automotive engineers. Engineers are always seeking ways 
to optimize a system with little or no additional cost. Our analysis shows that by tuning 
the length of the intake pipe so that it falls on one of the resonance peaks, we could obtain 
higher performance from the engine with little or no extra work. Of course, the problem is 
that no car always performs at some optimal condition. 


Problems 


Find the Fourier cosine and sine series for the following functions. Then, use MATLAB to 
plot the Fourier coefficients. 


1. f(t) =t, 0ct«m 2. f(t) =r —t, 0Oct«m 


3. f(t) ^ t(a — t), 0ct«a 470) =e", 0<t<a 


13 Boden, R. H., and H. Schecter, 1944: Dynamics of the inlet system of a four-stroke engine. NACA 
Tech. Note 935. 
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1220 RPM. 
2.56" diam. 


Figure 5.3.3: Experimental verification of inertial supercharging within a gasoline engine resulting from 
the resonance of the air-gas mixture and the intake pipe system. The peaks correspond to the n = 3,4, 
and 5 harmonics of the Fourier representation, Equation 5.3.13, and the parameter q is defined in the text. 
(From Morse, P., R. H. Boden, and H. Schecter, 1938: Acoustic vibrations and internal combustion engine 
performance. J. Appl. Phys., 9, 17 with permission.) 


t, 0<t<i t, O<t<l 
5. f) wee 6.) ={ 
1-t, $<t<l 1, 1<t<2 
0, jaiz 
7. f(t) 2? —8, 0«t«m 8. f(t) = A 
1, x=<t<a 
2 
a 
0, TES 0, Us ts 
a a 2a 3a 
that <t< Ls pat. xb 
a 2a a 
0 — <t 
3 uw bes ? 4 «a 
2t 
L. debet = 0<t<5 
Uy e 5 $ PAJA ui uv. og 
Z — < 
1, 3 <t<a oa sts 
t, Quis 
-t 
13. f(t) = 14. f(t) = —, O<t<a 
S. nexu “ 
2^ qw 


15. Using the relationships!^ that 


14 Gradshteyn, I. S., and I. M. Ryzhik, 1965: Table of Integrals, Series, and Products. Academic Press, 
Section 3.753, Formula 2 and Section 3.771, Formula 8. 
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1 u v 
cos(ax) T 3 cd yr (2u i 
~= dr = = — =A) T 2) Jp ; 
EY ae x 5 Jo(a), and K (u^ —z^)"^3 cos(ax) dx ae (v +4) J (au) 
with a > 0, u > 0, 8(v) > —5, obtain the following half-range expansions: 
z den SN Jeos(nmz), 0<r<1 
jE A nT nma), <l, 
vl-z? 2 = ? 
and 
2 
v1-z222 3 ae Ue E | cos|(2n — 1)z2/2], 0 « x « I. 


Here J,(-) denotes the Bessel function of the first kind and order v (see Section 6.5) and 
T(-) is the gamma function.? 


16. The function 


wiri oa optet) eat. 0ct« 
=1- —+(a—1)>+(a — -a— T 
"m ne T3 TE’ 


is a curve fit to the observed pressure trace of an explosion wave in the atmosphere. Because 
the observed transmission of atmospheric waves depends on the five-fourths power of the 
frequency, Reed!Ó had to re-express this curve fit as a Fourier sine series before he could 
use the transmission law. He found that 


f(t) = 23 L | sa : ? sin(2nt) 
]a 2 2(a — 1) 48a 
t T » 2n —1 | z?(2n - 1)?  m*(2n — 1)* spon Ld 


Confirm his result. 
5.4 FOURIER SERIES WITH PHASE ANGLES 


Sometimes it is desirable to rewrite a general Fourier series as a purely cosine or purely 
sine series with a phase angle. Engineers often speak of some quantity leading or lagging 
another quantity. Re-expressing a Fourier series in terms of amplitude and phase provides 
a convenient method for determining these phase relationships. 

Suppose, for example, that we have a function f(t) of period 2L, given in the interval 
[—L, L], whose Fourier series expansion is 


E IX ED sin( 27t). (5.4.1) 


15 Gradshteyn and Ryzhik, op. cit., Section 6.41. 


16 Reed, J. W., 1977: Atmospheric attenuation of explosion waves. J. Acoust. Soc. Am., 61, 39-47. 
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We wish to replace Equation 5.4.1 by the series: 


| ao = . f nnt 
f) > Bn sin( + en] (5.4.2) 
To do this we note that 
t t t 
B, sin( 27 + Pn) — a cos( 27t) + by sin( 27t) (5.4.3) 
. [nnt : nrt 
= Bn sin( ^) cos(Yn) + Bn sin(pn) cos( 27t) : (5.4.4) 


We equate coefficients of sin(nzt/ LL) and cos(nzt/ LL) on both sides and obtain 
an = B, sin(qa), and b, = Bn cos(Pn)- (5.4.5) 
Hence, upon squaring and adding, 
B, = Va +, (5.4.6) 
while taking the ratio gives 


Pn = tan” '(an/bn). (5.4.7) 


Similarly we could rewrite Equation 5.4.1 as 


ao = nrt 
t2 29 As cos| —— to), 4.8 
f(t) 2725 cos( 77 tes) (5.4.8) 
where 
An = ya? 402, and Yn = tan ^! (54/04), (5.4.9) 
and 
an = An cos(Yn), and bn = —A, sin(Yn). (5.4.10) 


In both cases, we must be careful in computing Yn because there are two possible values of 
Yn that satisfy Equation 5.4.7 or Equation 5.4.9. These angles Yn must give the correct an 
and b, using either Equation 5.4.5 or Equation 5.4.10. 


e Example 5.4.1 


The Fourier series for f(t) = e* over the interval —L < t < L is 


sinh(aL) : e^ aL(-1)* nrt 
f(t) = — L + 2sinh(aL) > ar naa Ol FT 
; L nr(-1)}  , (nat 
— 2sinh(aL) > an sin( y ) ; (5.4.11) 


Let us rewrite Equation 5.4.11 as a Fourier series with a phase angle. Regardless of whether 
we want the new series to contain cos(nat/L + pn) or sin(nat/L + pn), the amplitude A, 
or B,, is the same in both series: 


2sinh(aL) 


MET TT oen 
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If we want our Fourier series to read 


_ sinh(aL) . X cos(nzt/ L + Yn) 
je EUMD ae (5.4.13) 
then j 
Pn = tan”! (-2) = tan! (7), (5.4.14) 
Qn aL 


where Yn lies in the first quadrant if n is even and in the third quadrant if n is odd. This 
ensures that the sign from the (—1)" is correct. 
On the other hand, if we prefer 


. sinh(aL) sin(nzt/ L + Pn) 
f(t) = — z *2sinh(aL) ys cae) VENDU | (5.4.15) 


n=1 


" L 
Pn = tan"! (£) = — tan! (=) (5.4.16) 
bn nT 


where yn lies in the fourth quadrant if n is odd and in the second quadrant if n is even. 


then 


Problems 


Write the following Fourier series in both the cosine and sine phase angle form: 


T M | 5 Paler Unt 2. f(t) = : | zy = E d 
3. f(t) = -25 En sin(nt) 4. fi) = 5 D a 2 
an 2 next 2n — 1)? 


5.5 COMPLEX FOURIER SERIES 


So far in our discussion, we expressed Fourier series in terms of sines and cosines. We 
are now ready to re-express a Fourier series as a series of complex exponentials. There 
are two reasons for this. First, in certain engineering and scientific applications of Fourier 
series, the expansion of a function in terms of complex exponentials results in coefficients 
of considerable simplicity and clarity. Second, these complex Fourier series point the way 
to the development of the Fourier transform in the next chapter. 

We begin by introducing the variable w, = na/L, where n = 0,+1,+2,... Using 
Euler's formula we can replace the sine and cosine in the Fourier series by exponentials and 
find that 


ie x y (gest cms 


ao z An bni iw,t an bni —iwnt 
= EAS n — + — Aes 5.2 
; » (4 z) «(S33 (5.5.2) 


a (ennt — tent) (5.5.1) 
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If we define cn = i(an — ibn), then 


1 T+2L 1 T-2L 
m _ = ABS Mes —iwnt 
Cn = 5(Gn — ibn) = 2r f f (t)[cos(wnt) — isin(wnt)] dt 3r. f f(t)e dt. 
(5.5.3) 
Similarly, the complex conjugate of cn, c}, equals 
1 T42L : 
c, = $(as + ibn) = 3r f fOe dt. (5.5.4) 
To simplify Equation 5.5.2 we note that 
o (nr nm 
Wain = = Wns (5.5.5) 


which yields the result that 


1 T+2L 1 T+2L . 
C-n = x | fte -tdt = xz | f(thetntdt = et (5.5.6) 


so that we can write Equation 5.5.2 as 


ues ye gn aL of etal = ee ye eet te je innt (5.5.7) 
Letting n = —m in the second summation on the right side of Equation 5.5.7, 
oo —oo —1 —1 
5 ope ^S 5 Qué n= 5 Gu, ei mt = 5 Cnet, (5.5.8) 
n=1 m=—1 m--—oo n--—oo 


where we introduced m = n into the last summation in Equation 5.5.8. Therefore, 


Tex eint + 5 geo, (5.5.9) 


n=—Co 


On the other hand, 
ao 1 T+2L x 
2 7 OL f(t) dt = co = egent, (5.5.10) 


because wọ = 07 /L = 0. Thus, our final result is 


y due, (5.5.11) 


n=— oo 


where 


1 T+2L f 
us xl f (t)e 9 dt (5.5.12) 
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and n = 0,+1,+2,.... Note that even though c, is generally complex, the summation 
Equation 5.5. T always gives a real-valued function f (t). 

Just as we can represent the function f(t) graphically by a plot of t against f(t), we 
can plot c, as a function of n, commonly called the frequency spectrum. Because c, is 
generally complex, it is necessary to make two plots. Typically the plotted quantities are 
the amplitude spectra |cn| and the phase spectra yn, where v; is the phase of cn. However, 
we could just as well plot the real and imaginary parts of c,. Because n is an integer, these 
plots consist merely of a series of vertical lines representing the ordinates of the quantity 
[e| or Yn for each n. For this reason we refer to these plots as the line spectra. 

Because 2c, = an — ibn, the coefficients cn for an even function will be purely real; the 
coefficients c, for an odd function are purely imaginary. It is important to note that we 
lose the advantage of even and odd functions in the sense that we cannot just integrate over 
the interval 0 to L and then double the result. In the present case we have a line integral 
of a complex function along the real axis. 


e Example 5.5.1 


Let us find the complex Fourier series for 


1, 0ct«m, 
f(t) 2 n Lube (5.5.13) 
which has the periodicity f(t + 27) = f(t). 
With L =r and T = —7, wn = nt/L =n. Therefore, 
ei Tod Ler dt + : fo PME (5.5.14) 
Cn = on a Da € oO. 
ee, j 1 int | 
= — e" —-——e "n 5.5.15 
2n 2nri 0 ( ) 
=—— (1- ee MS s nmi ] 5 
2nm (ses: 2nm (c E (STO) 


if n Z 0. Because e”™* = cos(nz) +isin(nr) = (—1)" and e~"™ = cos(—n7) +isin(—n7) = 


(—1)", then 
i E 0, n even, 
Cn = E < (—1) ] = Là. n odd, (5.5.17) 
with = 
iOS yee. (5.5.18) 


In this particular problem we must treat the case n = 0 specially because Equation 
5.5.15 is undefined for n = 0. In that case, 


0 T 
nS x s 1)dt4 x] (dt = z-Co[*, + Ol <0. (5.5.19) 


Because cp = 0, we can write the expansion: 


f)--2 Quo cud (5.5.20) 
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Figure 5.5.1: Amplitude and phase spectra for the function, Equation 5.5.13. 


since we can write all odd integers as 2m — 1, where m = 0, +1 
MATLAB script 


max = 31; % total number of harmonics 

mid = (max*1)/2; % in the array, location of c.0 

for m = i:max; 
n =m - mid; % compute value of harmonic 

^4 compute complex Fourier coefficient c.n = (cnr,cni) 
if mod(n,2) == 0; cnr(m) = 0; cni(m) = 0; else; 
cnr(m) = 0; cni(m) = - 2/(pi*n); end; 

end 

nn-(i-mid):(max-mid); ^4 create indices for x-axis 

fzero-zeros(size(nn)); 4 create the zero line 

clf % clear any figures 

amplitude = sqrt(cnr.*cnr+cni.*cni) ; 

phase = atan2(cni,cnr); 

^ plot amplitude of cn 

subplot(2,1,1), stem(nn,amplitude, ’filled’) 

^4 label amplitude plot 

text (6,0.75, amplitude’ ,’ FontSize’ ,20) 

subplot (2,1,2), stem(nn,phase,’filled’) % plot phases of cn 

text(7,1,’phase’,’FontSize’,20) % label phase plot 

xlabel(’n’,’Fontsize’,20) ^ label x-axis, 


Using the 


we plot the amplitude and phase spectra for the function, Equation 5.5.13, as a function of 


n in Figure 5.5.1. 


e Example 5.5.2 


The concept of Fourier series can be generalized to multivariable functions. Consider 
the function f(x,y) defined over 0 < x < L and 0 < y < H. Taking y constant, we have 
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that P 
1 2 
en(y) = if f(z,y) dm, én = TS, (5.5.21) 
Ed L 
Similarly, holding €, constant, 
p 2 ; 2 
Cnm = al en(yje dy, — n, = (5.5.22) 


Therefore, the (complex) Fourier coefficient for the two-dimensional function f(x,y) is 


1 PP (€ ) 
= ee 5.23 


assuming that the integral exists. 
To recover f(x,y) given Cnm, we reverse the process of deriving Cnm. Starting with 


= XO eae, (5.5.24) 
we find that -" 
fm) X age: (5.5.25) 
Therefore, 
flu y)es >> s Gade. (5.5.26) 


n=— oo m -—-—oo 


Problems 


Find the complex Fourier series for the following functions. Then use MATLAB to plot the 
corresponding spectra. 


1. f = lt, —rxtzm 2. f(t) =e, 0<t<2 

3. ft =t, 0<t<2 4.f(t) 28, -n<t<r 
_ J0, —n/2«t«0 u E 

s. - [f O<t<n/2 6. f(t)=t, —-1<t<1 


5.6 THE USE OF FOURIER SERIES IN THE SOLUTION OF ORDINARY 
DIFFERENTIAL EQUATIONS 


An important application of Fourier series is the solution of ordinary differential equa- 
tions. Structural engineers especially use this technique because the occupants of buildings 
and bridges often subject these structures to forcings that are periodic in nature.!” 


17 Timoshenko, S. P., 1943: Theory of suspension bridges. Part II. J. Franklin Inst., 235, 327-349; Inglis, 
C. E., 1934: A Mathematical Treatise on Vibrations in Railway Bridges. Cambridge University Press, 203 
Pp. 
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e Example 5.6.1 


Let us find the general solution to the ordinary differential equation 
y" + 9y = f(t), (5.6.1) 
where the forcing is 
f(t)=|t,  —m<t<n,  f(t+27)= F(t). (5.6.2) 


This equation represents an oscillator forced by a driver whose displacement is the saw-tooth 
function. 

We begin by replacing the function f(t) by its Fourier series representation because 
the forcing function is periodic. The advantage of expressing f(t) as a Fourier series is its 
validity for any time t. The alternative would be to construct a solution over each interval 
nm « t <(n+1)z and then piece together the final solution assuming that the solution and 
its first derivative are continuous at each junction t = nz. Because the function is an even 
function, all of the sine terms vanish and the Fourier series is 


nm 4 cos[(2n — 1)t] 

t| = : 5.6.3 

it 2 m 2 (2n — 1)? ( ) 

Next, we note that the general solution consists of the complementary solution, which 
equals 

yu (t) = Acos(3t) + B sin(3t), (5.6.4) 


and the particular solution yj (t), which satisfies the differential equation 


" T 4 cos[(2n — 1)t] 
| = ‘ 6.5 


To determine this particular solution, we write Equation 5.6.5 as 


4 
= cos(t) 


4 4 
Yp 9p = = cos(3t) — —— cos(5t) — --.. (5.6.6) 


9 257 


By the method of undetermined coefficients, we guess the particular solution: 


Yp(t) = > + a1 cos(t) + by sin(t) + a» cos(3t) + bo sin(3t) + --- (5.6.7) 
yp(t) = $ao + 4 cos[(2n — 1)t] + bn sin[(2n — 1)t]. (5.6.8) 
Because = | 
yp (t) = M, - (2n — 1? (a, cos((2n — 1)t] + b, sin[(2n — 1)t]}, (5.6.9) 
3 — (2n — 1)? (a4 cos[(2n — 1)t] + bn sin[(2n — 1)t]) (5.6.10) 


+ 2ao + 9) “an cos[(2n — 1)t] + bn sin[(2n — 1)t] = 


n=1 n=1 


Cn- ' 
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or 
E — ; + > fi (2n — 1)?]a,, 4 mm D? \ cos|(2n — 1)t] 
+ »» [9 — (2n — 1)?]b, sin[(2n — 1)t] = 0. (5.6.11) 


Because Equation 5.6.11 must hold true for any time, each harmonic must vanish separately 


and 
T 4 


~9 "^7 «(n — 1219 — Qn - 1y]] 
and bn = 0. All of the coefficients an are finite except for n = 2, where ag becomes undefined. 
This coefficient is undefined because the harmonic cos(3£) in the forcing function resonates 
with the natural mode of the system. 

Let us review our analysis to date. We found that each harmonic in the forcing func- 
tion yields a corresponding harmonic in the particular solution, Equation 5.6.8. The only 
difficulty arises with the harmonic n — 2. Although our particular solution is not correct 
because it contains cos(3t), we suspect that if we remove that term then the remaining 
harmonic solutions are correct. The problem is linear, and difficulties with one harmonic 
term should not affect other harmonics. But how shall we deal with the cos(3t) term in the 
forcing function? Let us denote that particular solution by Y (t) and modify our particular 
solution as follows: 


(5.6.12) 


Yp(t) = $ao + a1 cos(t) + Y (t) + a3 cos(5t) +--+. (5.6.13) 


Substituting this solution into the differential equation and simplifying, everything cancels 
except 


4 
Y” +9Y = xS cos(3t). (5.6.14) 
The solution of this equation by the method of undetermined coefficients is 
2 
Y(t) = ———tsi : .6. 
(t) 57, ¢ sin(3t) (5.6.15) 


This term, called a secular term, is the most important one in the solution. While the other 
terms merely represent simple oscillatory motion, the term ¢ sin(3t) grows linearly with time 
and eventually becomes the dominant term in the series. Consequently, the general solution 
equals the complementary plus the particular solution, or 


y(t) = Acos(3t) + B sin(3t) + a = jc tsin( 3) : Y e Dame pa: 86:1) 
Mer 


e Example 5.6.2 


Let us redo the previous problem only using complex Fourier series. That is, let us find 
the general solution to the ordinary differential equation 


oo TP E 
cin 1)t 


2 
m d 
D OU ug 1-25 gc 


n=—Co 


(5.6.17) 
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From the method of undetermined coefficients we guess the particular solution for 
Equation 5.6.17 to be 


wlt) — co -- X c, 07 Dt. (5.6.18) 
'Then T 
yit) 2 M. -Qn- 1)?e, £/09n-0t, (5.6.19) 


Substituting Equation 5.6.18 and Equation 5.6.19 into Equation 5.6.17, 


(2 Jis i T 2 o9 ei(2n— 1)t 
sat Y B- n — 1y7e, d x 2.0 nc (5.6.20) 
Because Equation 5.6.20 must be true for any t, 
T 2 
071g NS a= Kn — [n — 1? — 9] ( ) 
Therefore, 
T P giQn-1)t , 
t) 2 4 Non oe. 5.6.22 
wl) = 1g * - oe (Qn — D2[8n - 13? - 9]^ ( ) 
However, there is a problem when n = —1 and n = 2. Therefore, we modify Equation 5.6.22 
to read 
Vt) E +e te 4 2 Y cid ent (5.6.23) 
2 T : m £4 (2n - D? [Qn — 1)? - 9 dE s 
nz-—1,2 


Introducing Equation 5.6.23 into Equation 5.6.17 and simplifying, 
1 1 


OERS D arzen 5.6.24 
a= c S E 7 ( ) 
The general solution is then 
, , tesit peit gg L giQn- 0t 
t) = Ac? + Be3# 4. 7. . (5.6.25 
WT AP RECTUS Sm E X p» On yen ig -g l ) 


nz-—1,2 


'The first two terms on the right side of Equation 5.6.25 represent the complementary solu- 
tion. Although this expansion is equivalent to Equation 5.6.16, we have all of the advantages 
of dealing with exponentials rather than sines and cosines. These advantages include ease 
of differentiation and integration, and writing the series in terms of amplitude and phase. 


e Example 5.6.3: Temperature within a spinning satellite 


In the design of artificial satellites, it is important to determine the temperature distri- 
bution on the spacecraft’s surface. An interesting special case is the temperature fluctuation 
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in the skin due to the spinning of the vehicle. If the craft is thin-walled so that there is 
no radial dependence, Hrycak!? showed that he could approximate the nondimensional 
temperature field at the equator of the rotating satellite by 


dT dT 3 nc F(n)+ 8/4 
T = .6.2 
am e( i) 4 lapa IBS) 
where 
167S 7B S NV (1 gp/A I^ 

— 442,2 = {ict T = | — an 5.6.27 
b=4r'r*f/a, c m (cm). (=) 148 x f ) 

cos(2rn), 0<n< "m 
F(n) = 0, sisi (5.6.28) 

cos(277), 2<n<l, 


a is the thermal diffusivity of the shell, f is the rate of spin, r is the radius of the spacecraft, 
S is the net direct solar heating, 8 is the ratio of the emissivity of the interior shell to the 
emissivity of the exterior surface, e is the overall emissivity of the exterior surface, y is 
the satellite’s skin conductance, and ø is the Stefan-Boltzmann constant. The independent 
variable 7 is the longitude along the equator with the effect of rotation subtracted out 
(21) = p—27 ft). The reference temperature Ta equals the temperature that the spacecraft 
would have if it spun with infinite angular speed so that the solar heating would be uniform 
around the craft. We nondimensionalized the temperature with respect to Ty. 
We begin by introducing the new variables 


3 TB 2m? r? f np? 
T = Ap = — 5.6.29 
* 4 1644r OT ap? °° 4478 ( ) 
and pĝ = c so that Equation 5.6.26 becomes 
D PT (5.6.30) 
dr? 0 E 0 0 " .6. 


Next, we expand F'(7) as a Fourier series because it is a periodic function of period 1. 
Because it is an even function, 


f(n) = $ao + eo cos(2nmm), (5.6.31) 
n=1 
where 
1 1/4 1 1 2 
ao = vid. cos(272x) dx + ip [n cos(272) dx = zi (5.6.32) 
1 1/4 J 1 1 " 1 
ài = vs | cos" (27) dx + 1 a cos“ (27a) dz = 5 (5.6.33) 


18 From Hrycak, P., 1963: Temperature distribution in a spinning spherical space vehicle. AIAA J., 1, 
96-99. Reprinted with permission of the American Institute of Aeronautics and Astronautics. 
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and 
an = — cos(27x) cos(2naa) dx + —— f cos(2rx) cos(2nmz) dx 
1/2 f 1/2 Jaj4 

<2 2-1) cos (^) 

|». m(n? — 1) 2/' 
if n > 2. Therefore, 

1 1 2 y 
f(n) =- z + 5608 (277) — - xs g CS (4n). 


From the method of undetermined coefficients, the particular solution is 


oo 


yp) = 


n=l 


which yields 


tao + ay cos(277) + bı sin(2rn) + Xan cos(4nrn) + bən sin(4n7n), 


(5.6.34) 


(5.6.35) 


(5.6.36) 


(5.6.37) 


Yp(n) = —27a; sin(27n) + 215, cos(2n1) + x [—4n7a2, sin(4nz1) + 4n», cos(4nm1)], 


n=l 
(5.6.38) 
and 
yp (n) = —4r° [ay cos(2) + bi sin(27n)] z 16n?z? [azn cos(4n7n) + ben sin(4nrn)]. 
(5.6.39) 
Substituting into Equation 5.6.30, 
1 A A 
3/040 > — ( Ara, + Arpovobi — pear — 2) cos(277) 
+ (—4n?b; — Anpovoai — pobi) sin(277) 
= 2Ao(—1)” 
+ 2. [1612024 + 8nmpovobon Potan | IS EA cos(4n77) 
+ 5 (—16n7 17 bon — BNT PoVol2n — pban) sin(4nrn) = 0. (5.6.40) 
n=1 
To satisfy Equation 5.6.40 for any 7, we set 
2A 
ag = ——S, 5.6.41) 
T 0G 
— (41? + p£)a1 + 4npovobi = — 5.6.42) 
4T povoa; + (4r? + pg)bi = 0, 5.6.43) 
2Ao(—1)" 
(16n?z? + pg)ao, — 8nmpovoba, = o(—1) 5.6.44) 


m(4n? — 1)’ 
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--- EXACT SOLUTION CONDUCTION , 
DISREGARDED 


=! THROUGHOUT 


0 OI! O2 O3 O4 O5 O6 07 OB 09 10 
7 


Figure 5.6.1: Temperature distribution along the equator of a spinning spherical satellite. (From Hrycak, 
P., 1963: Temperature distribution in a spinning spherical space vehicle. AJAA J., 1, 97. ©1963 AIAA, 
reprinted with permission.) 


and 
8n PoVodan + (16n?2? + pa)ban = 0, 5.6.45) 
or —— 
4 A 
[1622922 + (4n? + p2)?Ja1 = —— 5.6.46) 
[16m pave + (4n? + p3)?]b, = 27 0ovo Ao, 5.6.47) 
2Ao(—1)"(16n?7? + pd) 
[64n? n” pave + (16n?z? + pb) Jaan = a(n? —1) a 5.6.48) 
s 16(—1)" A 
[64n?«? pave + (16n?z? + pe)? bon = Cem 2. (5.6.49) 
"M 
Substituting for ao, a1, b1, dan, and bən, the particular solution is 
uie Ao (4T? + p2) Ao cos(277) 27 povo Ao sin(277) 
P npa  2|(4r? + p2)? 16z?pQvi] (Am? + pe)? + 16r? p22 
_ 2Ao S (—1)" (16n?z? + pg) cos(2n71) 
mJ c (n? — 1)[64n? r? pov} + (16n?7? + p2)?] 
z (—1)”nsin(2nrn) 
— 16 A , 5.6.50 
Poo (2, (4n? — 1)[64n?z? p2v2 + (16n?z? + p2)?| ( ) 


n=1 


Figure 5.6.1 is from Hrycak’s paper and shows the variation of the nondimensional 
temperature as a function of 7 for the spinning rate vg. The other parameters are typical 
of a satellite with aluminum skin and fully covered with glass-protected solar cells. As 
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a check on the solution, we show the temperature field (the dashed line) of a nonrotating 
satellite where we neglect the effects of conduction and only radiation occurs. The difference 
between the vo = 0 solid and dashed lines arises primarily due to the linearization of the 
nonlinear radiation boundary condition during the derivation of the governing equations. 


Problems 


Solve the following ordinary differential equations by Fourier series if the forcing is given 
by the periodic function 
1, 0«ct«m, 


mia m<t<2n, 


and f(t) = f(t + 27): 
ly” — y = f(t), 2. y" +y = f(t), 3. y" — 3y' + 2y = f(t). 


Solve the following ordinary differential equations by complex Fourier series if the forcing is 
given by the periodic function 


ft) =|,  -"stzm, 
and f(t) = f(t + 27): 
4. y" —y = f(t), 5. y" + 4y = f(t). 
6. An object radiating into its nocturnal surrounding has a temperature y(t) governed by 


the equation!® 


d oo 
^ Tay Áo + 2 A, cos(nwt) + Bn sin(nwt), 

where the constant a is the heat loss coefficient and the Fourier series describes the temporal 
variation of the atmospheric air temperature and the effective sky temperature. If y(0) = To, 


find y(t). 
7. The equation that governs the charge q on the capacitor of an LRC electrical circuit is 
q" + 2aq' + w?q =w E, 


where a = R/(2L), w? = 1/(LC), R denotes resistance, C denotes capacitance, L denotes 
the inductance, and E is the electromotive force driving the circuit. If E is given by 


oo 
E= 5 pne ot, 
n=—00 
find q(t). 
8. Use Fourier series to find the particular solution” of the ordinary differential equation 


" " _ 2 1, 0 « x « A/A, 
y" (a) + k’y(2) = evd i MA < x <r/2, 


, 


19 See Sodha, M. S., 1982: Transient radiative cooling. Solar Energy, 28, 541. 


?0 See Chabert, P., J. L. Raimbault, J. M. Rax, and M. A. Lieberman, 2004: Self-consistent nonlinear 
transmission line model of standing wave effects in a capacitive discharge. Phys. Plasmas, 11, 1775-1785. 


Fourier Series 225 


where k, Vr, and A are constants. Extend the forcing function as an even function into the 
interval (—A/2, 0). 


5.7 FINITE FOURIER SERIES 


In many applications we must construct a Fourier series from values given by data or 
a graph. Unlike the situation with analytic formulas where we have an infinite number of 
data points and, consequently, an infinite number of terms in the Fourier series, the Fourier 
series contains a finite number of sines and cosines where the number of coefficients equals 
the number of data points. 

Assuming that these series are useful, the next question is how do we find the Fourier 
coefficients? We could compute them by numerically integrating Equation 5.1.6. However, 
the results would suffer from the truncation errors that afflict all numerical schemes. On 
the other hand, we can avoid this problem if we again employ the orthogonality properties 
of sines and cosines, now in their discrete form. Just as in the case of conventional Fourier 
series, we can use these properties to derive formulas for computing the Fourier coefficients. 
These results will be exact except for roundoff errors. 

We start by deriving some preliminary results. Let us define £m = mP/(2N). Then, 
if k is an integer, 


2N-1 . 2N-1 ; 2N—1 THEN a 0 1 

2rikEm kmri " Ir 7V, r1, 
yoo 7S )- Dew) ee 6.73) 
m=0 m=0 m=0 2N, r= 1, 


because r?N = exp(2rki) = 1 if r Æ 1. If r = 1, then the sum consists of 2N terms, each 
of which equals one. The condition r = 1 corresponds to k = 0,+2N,+4N,.... Taking the 
real and imaginary part of Equation 5.7.1, 


2N-1 
Qnkam\ — f 0, k £0,+2N,+4N,..., 
» cos( P Jon k =0,+2N,+4N,..., (6:7:2) 
and bun 
A .f2nknzq 
5 sin( P )=0 (5.7.3) 
m=0 
for all k. 


Consider now the following sum: 


2N-1 : 2N-1 
f 21km 2njzm V 1 2n(k + j)Em 2n(k — j)Em 
J cos( P ) cos( P ) 5 tes E + cos —— 


m=0 m=0 
(5.7.4) 
0, |k — j| and |k ++m| 40,2N,4N,..., 
= N |k — j| Or |k + m| 4 0,2N,4N,...,(5.7.5) 


; |k — j| and |k + m| =0,2N,4N,.... 

Let us simplify the right side of Equation 5.7.5 by restricting ourselves to k+ 7 lying between 
0 to 2N. This is permissible because of the periodic nature of this equation. If k + j = 0, 
k= 7=0;ifk+j7 =2N,k=j7 — N. h either case, k— j = 0 and the right side of Equation 
5.7.5 equals 2N. Consider now the case k Z j. Then k+ j Z 0 or 2N and k — j £0 or 2N. 
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The right side of this equaton must equal 0. Finally, if k = j # 0 or N, then k + j 40 or 
2N but k — j = 0 and the right side of this equation equals N. In summary, 


2N-1 : 0 kzj 
2nkz, 2mjXq : : 
J zi 2 )«s( 2 ) = N, k—jz0,N (5.7.6) 
2N, k=j=0,N. 


m=0 


In a similar manner, 


2N-1 
2mkza N . ( 20j9m 
1 cos( 2 ) sin( A ) =0 (5.7.7) 


for all k and 7 and 


2N-1 0 kzj 
2 k m . 2 m ! . 
5 sin( 7) sin( > ) = N, k=j40,N, (5.7.8) 
0, k=j=0,N. 


m=0 


Armed with these equations we are ready to find the coefficients A, and B, of the 
finite Fourier series, 


Bi Na 27k ork A 2N 
f(x) = i +5 | cos (7) + Bk sin( =) | = cos( = z), (5.7.9) 
k=1 


where we have 2N data points and now define P as the period of the function. 
To find Aj we proceed as before and multiply Equation 5.7.9 by cos(27ja /P) (j may 
take on values from 0 to N) and sum from 0 to 2N — 1. At the point £ = £m, 


2N-1 . 2N-1 : 
2 A 2 
5 f(m) cos( rs.) = >: cos (“Fem 
N-1 2N-1 : 
2nk 2 
+ A; 23 cos (“Frm cos( Fem 


An RA 2nN 2nj 
4 aa cos( ^ 2 cos ( “em ) . (5.7.10) 


If 7 4 0 or N, then the first summation on the right side vanishes by Equation 5.7.2, the 
third by Equation 5.7.8, and the fourth by Equation 5.7.6. The second summation does not 
vanish if k — j and equals N. Similar considerations lead to the formulas for the calculation 
of Ax and By: 


ti 27k 
tSr Flom)eos( p m) 00 E2012... N, (5.7.11) 
and bbs 
1 '— . (nk 
Bu 5 fs )sin( EEan) $ k=1,2,...,N—1. (5.7.12) 
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Table 5.7.1: The Depth of Water in the Harbor at Buffalo, NY (Minus the Low-Water 
Datum of 568.8 ft) on the 15*® Day of Each Month During 1977 


mo n depth mo n depth mo n depth 
Jan 1 1.61 May 5 3.16 Sep 9 2.42 
Feb 2 1.57 Jun 6 2.95 Oct 10 2.95 
Mar 3 2.01 Jul 7 3.10 Nov 11 2.74 
Apr 4 2.68 Aug 8 2.90 Dec 12 2.63 


If there are 2N +1 data points and f(xo) = f(x2n), then Equation 5.7.11 and Equation 
5.7.12 are still valid and we need only consider the first 2N points. If f(xo) Æ f(ven), we can 
still use our formulas if we require that the endpoints have the value of [f (zo) + f (zaw)]/2. 
In this case the formulas for the coefficients A; and Bj, are 


N-1 
1 [f(zo) + f(zav) |^ 2nk 
T Y. Flam ae N 7.13 
Ak N 5 + 2. f (244) cos p* (5 ) 
where k = 0,1,2,..., N, and 
2N-1 
1 .f(2nk 
B, = x 5 Fl2m)sin( 2 2 ] (5.7.14) 


where k = 1,2,...,N — 1. 

It is important to note that 2N data points yield 2N Fourier coefficients A; and Br. 
Consequently our sampling frequency will always limit the amount of information, whether 
in the form of data points or Fourier coefficients. It might be argued that from the Fourier 
series representation of f(t) we could find the value of f(t) for any given t, which is more 
than we can do with the data alone. This is not true. Although we can calculate f(t) at any 
t using the finite Fourier series, the values may or may not be correct since the constraint on 
the finite Fourier series is that the series must fit the data in a least-squared sense. Despite 
the limitations imposed by only having a finite number of Fourier coefficients, the Fourier 
analysis of finite data sets yields valuable physical insights into the processes governing 
many physical systems. 


e Example 5.7.1: Water depth at Buffalo, NY 


Each entry?! in Table 5.7.1 gives the observed depth of water at Buffalo, NY (minus the 
low-water datum of 568.6 ft) on the 15*^ of the corresponding month during 1977. Assuming 
that the water level is a periodic function of 1 year, and that we took the observations at 
equal intervals, let us construct a finite Fourier series from these data. This corresponds to 
computing the Fourier coefficients Ao, A1,..., Ac, Bi,..., Bs, which give the mean level and 
harmonic fluctuations of the depth of water, the harmonics having the periods 12 months, 
6 months, 4 months, and so forth. 


21 National Ocean Survey, 1977: Great Lakes Water Level, 1977, Daily and Monthly Average Water 
Surface Elevations. National Oceanic and Atmospheric Administration. 
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In this problem, P equals 12 months, N = P/2 = 6 mo, and zm = mP/(2N) = 
m(12 mo)/12 mo = m. That is, there should be a data point for each month. From 
Equation 5.7.11 and Equation 5.7.12, 


11 
1 k 
Ak — 6 2. fm) cos( 755) D k= 0, 1, 2, 3, 4, 5, 6, (5.7.15) 
and 
B Ly Jii 5). Rano as (5.7.16) 
E m) SM| —— ]; = 4, 4,9,4,9. rs 
E — du 6 


Substituting the data into these equations yields 


Ao = twice the mean level = +5.120 ft 
EU — harmonic component with a period of 12 mo = —0.566 ft 
Bı = harmonic component with a period of 12 mo = —0.128 ft 
A» — harmonic component with a period of 6 mo = —0.177 ft 
Bə = harmonic component with a period of 6 mo = —0.372 ft 
A3 = harmonic component with a period of 4 mo = —0.110 ft 
Bs = harmonic component with a period of 4 mo = —0.123 ft 
A4 — harmonic component with a period of 3 mo = +0.025 ft 
B4 — harmonic component with a period of 3 mo = +0.052 ft 
As = harmonic component with a period of 2.4 mo = —0.079 ft 
Bs = harmonic component with a period of 2.4 mo = —0.131 ft 
Ag — harmonic component with a period of 2 mo — —0.107 ft 


Figure 5.7.1 is a plot of our results using Equation 5.7.9. Note that when we include 
all of the harmonic terms, the finite Fourier series fits the data points exactly. The values 
given by the series at points between the data points may be right or they may not. To 
illustrate this, we also plotted the values for the first of each month. Sometimes the values 
given by the Fourier series and these intermediate data points are quite different. 

Let us now examine our results in terms of various physical processes. In the long 
run the depth of water in the harbor at Buffalo, NY depends upon the three-way balance 
between precipitation, evaporation, and inflow-outflow of any rivers. Because the inflow and 
outflow of the rivers depends strongly upon precipitation, and evaporation is of secondary 
importance, the water level should correlate with the precipitation rate. It is well known 
that more precipitation falls during the warmer months rather than the colder months. 
The large amplitude of the Fourier coefficient A; and B1, corresponding to the annual cycle 
(k — 1), reflects this. 

Another important term in the harmonic analysis corresponds to the semiannual cycle 
(k — 2). During the winter months around Lake Ontario, precipitation falls as snow. 
Therefore, the inflow from rivers is greatly reduced. When spring comes, the snow and ice 
melt and a jump in the water level occurs. Because the second harmonic gives periodic 
variations associated with seasonal variations, this harmonic is absolutely necessary if we 
want to get the correct answer while the higher harmonics do not represent any specific 
physical process. 


e Example 5.7.2: Numerical computation of Fourier coefficients 


At the beginning of this chapter, we showed how you could compute the Fourier coeffi- 
cients ao, an, and bn from Equation 5.1.6 given a function f(t). All of this assumed that you 
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mean plus first harmonic 


water level 


mean plus first two harmonics 


water level 


mean plus first three harmonics 


water level 


mean plus all of the harmonics 
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Figure 5.7.1: Partial sums of the finite Fourier series for the depth of water in the harbor of Buffalo, NY 
during 1977. Circles indicate observations on the 15th 


of the month; crosses are observations on the first. 
could carry out the integrations. What do you do if you cannot perform the integrations? 
'The obvious solution is perform it numerically. In this section we showed that the best 
approximation to Equation 5.1.6 is given by Equation 5.7.11 and Equation 5.7.12. In the 
case when we have f(t) this is still true but we may choose N as large as necessary to obtain 
the desired number of Fourier coefficients. 

To illustrate this we have redone Example 5.1.1 and plotted the exact (analytic) and 
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Figure 5.7.2: The computation of Fourier coefficients using a finite Fourier series when f(t) is given by 
Equation 5.1.8. The circles give an and bn as computed from Equation 5.1.9, Equation 5.1.10, and Equation 
5.1.11. The crosses give the corresponding Fourier coefficients given by the finite Fourier series with N = 15. 


numerically computed Fourier coefficients in Figure 5.7.2. This figure was created using the 
MATLAB script 


clear; 

N = 15, M = 2*N; dt = 2*pi/M; % number of points in interval 
^ create time points assuming x(t) = x(t+period) 
[-pi:dt:pi-dt] ; 


ct 
ll 


f = zeros(size(t)); % initialize function f(t) 
for k = 1:length(t) ^ construct function f(t) 
if t(k) < 0; f(k) = 0; else f(k) = t(k); end; end; 
^ 
^4 compute Fourier coefficients using fast Fourier transform 
^ 
fourier = fft(f) / N; 
a0 comp = real(fourier(1)); sign = 1; 


for n = 2:N; 
an comp(n-1) = - sign * real(fourier(n)); 
bn comp(n-1) = sign * imag(fourier(n)); 
sign - - sign; 

end 


^ 

^4 plot comparisons 

^ 

NN = linspace(0,N-1,N); 
exact coeff(1) = pi/2; 
numer .coeff(1) = a 0.comp; 
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for n = 1:N-1; 
exact. coeff(n*1) 
numer.coeff (n*1) 

end; 

subplot (2,1,1),plot(NN,exact_coeff,’o’ ,NN,numer_coeff,’kx’) 

ylabel(’a_n’,’Fontsize’ ,20) 

clear exact_coeff numer_coeff 

NN = linspace(1,N-1,N-1); 

for n = 1:N-1; 
exact coeff(n) = -(-1)^n/n; numer .coeff(n) = bon comp(n); 

end; 

Subplot(2,1,2), plot(NN,exact coeff,'o?,NN,numer coeff,?kx?) 

xlabel(’n’,’Fontsize’,20); ylabel(’b_n’,’Fontsize’ ,20); 


((-1)^n-1) / (pi*(2*n-1)^2); 
amn_comp(n) ; 


It shows that a relative few data points can yield quite reasonable answers. 

Let us examine this script a little closer. One of the first things that you will note is 
that there is no explicit reference to Equation 5.7.11 and Equation 5.7.12. How did we get 
the correct answer? 

Although we could have coded Equation 5.7.11 and Equation 5.7.12, no one does that 
any more. In the 1960s, J. W. Cooley and J. W. Tukey?? devised an incredibly clever 
method of performing these calculations. This method, commonly called a fast Fourier 
transform or FFT, is so popular that all computational packages contain it as an intrinsic 
function and MATLAB is no exception, calling it fft. This is what has been used here. 

Although we now have an fft to compute the coefficients, this routine does not directly 
give the coefficients a, and bn but rather some mysterious (complex) number that is related 
to an + ibn. This is a common problem in using a package's FFT rather than your own 
and why the script divides by N and we keep changing the sign. The best method for 
discovering how to extract the coefficients an and bp is to test it with a dataset created by 
a simple, finite series such as 


f(x) = 20 + cos(t) + 3sin(t) + 6 cos(2t) — 20sin(2t) — 10 cos(3t) — 30sin(3t). (5.7.17) 


If the code is correct, it must give back the coefficient in Equation 5.7.17 to within round-off. 
Otherwise, something is wrong. 

Finally, most FFTs assume that the dataset will start repeating after the final data 
point. Therefore, when reading in the dataset, the point corresponding to x = L must be 
excluded. 


e Example 5.7.3: Aliasing 


In the previous example, we could only resolve phenomena with a period of 2 months 
or greater although we had data for each of the 12 months. This is an example of Nyquist's 
sampling criteria:?? At least two samples are required to resolve the highest frequency in a 
periodically sampled record. 

Figure 5.7.3 will help explain this phenomenon. In case (a) we have quite a few data 
points over one cycle. Consequently our picture, constructed from data, is fairly good. In 


22 Cooley, J. W., and J. W. Tukey, 1965: An algorithm for machine calculation of complex Fourier series. 
Math. Comput., 19, 297-301. 


23 Nyquist, H., 1928: Certain topics in telegraph transmission theory. AIEE Trans., 47, 617-644. 
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Figure 5.7.3: The effect of sampling in the representation of periodic functions. 
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Figure 5.7.4: The sea elevation at the mouth of the Chesapeake Bay from its average depth as a function 
of time after 1 July 1985. 


case (b), we took only samples at the ridges and troughs of the wave. Although our picture 
of the real phenomenon is poor, at least we know that there is a wave. From this picture 
we see that even if we are lucky enough to take our observations at the ridges and troughs 
of a wave, we need at least two data points per cycle (one for the ridge, the other for the 
trough) to resolve the highest-frequency wave. 

In case (c) we have made a big mistake. We have taken a wave of frequency N Hz and 
misrepresented it as a wave of frequency N/2 Hz. This misrepresentation of a high-frequency 
wave by a lower-frequency wave is called aliasing. It arises because we are sampling a 
continuous signal at equal intervals. By comparing cases (b) and (c), we see that there is 
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Figure 5.7.5: The amplitude of the Fourier coefficients for the sea elevation at the Chesapeake Bay bridge 
and tunnel (top) and Baltimore harbor (bottom) as a function of period. 


a cutoff between aliased and nonaliased frequencies. This frequency is called the Nyquist 
or folding frequency. It corresponds to the highest frequency resolved by our finite Fourier 
analysis. 


Because most periodic functions require an infinite number of harmonics for their rep- 
resentation, aliasing of signals is a common problem. Thus the question is not "can I avoid 
aliasing?” but “can I live with it?" Quite often, we can construct our experiments to say 
yes. An example where aliasing is unavoidable occurs in a Western at the movies when 
we see the rapidly rotating spokes of the stagecoach’s wheel. A movie is a sampling of 
continuous motion where we present the data as a succession of pictures. Consequently, a 
film aliases the high rate of revolution of the stagecoach’s wheel in such a manner so that 
it appears to be stationary or rotating very slowly. 
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Figure 5.7.6: Same as Figure 5.7.4 but with the tides removed. 


e Example 5.7.4: Spectrum of the Chesapeake Bay 


For our final example, we perform a Fourier analysis of hourly sea-level measurements 
taken at the mouth of the Chesapeake Bay during the 2000 days from 9 April 1985 to 29 
June 1990. Figure 5.7.4 shows 200 days of this record, starting from 1 July 1985. As this 
figure shows, the measurements contain a wide range of oscillations. In particular, note the 
large peak near day 90 that corresponds to the passage of Hurricane Gloria during the early 
hours of 27 September 1985. 

Utilizing the entire 2000 days, we plotted the amplitude of the Fourier coefficients as 
a function of period in Figure 5.7.5. We see a general rise of the amplitude as the period 
increases. Especially noteworthy are the sharp peaks near periods of 12 and 24 hours. The 
largest peak is at 12.417 hours and corresponds to the semidiurnal tide. Thus, our Fourier 
analysis shows that the dominant oscillations at the mouth of the Chesapeake Bay are the 
tides. A similar situation occurs in Baltimore harbor. Furthermore, with this spectral 
information we could predict high and low tides very accurately. 

Although the tides are of great interest to some, they are a nuisance to others because 
they mask other physical processes that might be occurring. For that reason we would like 
to remove them from the tidal gauge history and see what is left. One way would be to 
zero out the Fourier coefficients corresponding to the tidal components and then plot the 
resulting Fourier series. Another method is to replace each hourly report with an average 
of hourly reports that occurred 24 hours ahead of and behind a particular report. We 
construct this average in such a manner that waves with periods of the tides sum to zero.?* 
Such a filter is a popular method for eliminating unwanted waves from a record. Filters 
play an important role in the analysis of data. We plotted the filtered sea level data in 
Figure 5.7.6. Note that summertime (0-50 days) produces little variation in the sea level 
compared to wintertime (100-150 days) when intense coastal storms occur. 


24 See Godin, G., 1972: The Analysis of Tides. University of Toronto Press, Section 2.1. 
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Figure 5.7.7: The orography of the earth and its spectrum in meters along three latitude belts using a 
topography dataset with a resolution of 1.25? longitude. 


Problems 
Find the finite Fourier series for the following pieces of data: 
1. f(0) 2 0, f(1) 2 1, f(2) = 2, f(3) = 3, and N = 2. 
2. f(0) — 1, f(1) 2 1, f(2) = -1, f(3) = —1, and N = 2. 
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Project: Spectrum of the Earth's Orography 


Table 5.7.2 gives the orographic height of the earth’s surface used in an atmospheric 
general circulation model (GCM) at a resolution of 2.5? longitude along the latitude belts 
of 28°S, 36°N, and 66°N. In this project you will find the spectrum of this orographic field 
along the various latitude belts. 


Step 1: Write a MATLAB script that reads in the data and find A, and B, and then 
construct the amplitude spectra for this data. 


Step 2: Construct several spectra by using every data point, every other data point, etc. 
How do the magnitudes of the Fourier coefficient change? You might like to read about 
leakage from a book on harmonic analysis.?? 


Step 3: Compare and contrast the spectra from the various latitude belts. How do the 
magnitudes of the Fourier coefficients decrease with n? Why are there these differences? 


Step 4: You may have noted that some of the heights are negative, even in the middle of 
the ocean! Take the original data (for any latitude belt) and zero out all of the negative 
heights. Find the spectra for this new data set. How have the spectra changed? Is there a 
reason why the negative heights were introduced? 


Further Readings 


Carslaw, H. S., 1950: An Introduction to the Theory of Fourier’s Series and Integrals. 
Dover, 368 pp. A classic treatment of the Fourier technique. 


Tolstov, Georgi P., 1976: Fourier Series. Dover, 336 pp. This book covers the basic theory 
of Fourier series and its use in mathematical physics. 


?5 For example, Bloomfield, P., 1976: Fourier Analysis of Time Series: An Introduction. John Wiley & 
Sons, 258 pp. 
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Table 5.7.2: Orographic Heights (in m) Times the Gravitational Acceleration Constant 


(g = 9.81 m/s?) along Three Latitude Belts 


Longitude 


—180.0 
—177.5 
—175.0 
—172.5 
—170.0 
—167.5 
—165.0 
—162.5 
—160.0 
—157.5 
—155.0 
—152.5 
—150.0 
—14T.5 
—145.0 
—142.5 
—140.0 
—137.5 
—135.0 
—132.5 
—130.0 
—12T.5 
—125.0 
—122.5 
—120.0 
—117.5 
—115.0 
—112.5 
—110.0 
—107.5 
—105.0 
—102.5 
—100.0 
—97.5 
—95.0 
—92.5 
—90.0 
—87.5 
—85.0 
15.0 
17.5 
20.0 
22.5 
25.0 


28°S 


l = l l l l l l = l 


POT eo EO sse Pocos SR S a E E Oa perc ECL EE Sd 


l 
PEO 


36°N 


Code so Oe eoa EI uon ue ee er biduo 


gx 
bo 
CN 


66°N 


2532. 
1665. 
1432. 
1213. 

501. 

367. 

963. 
1814. 
2562. 
3150. 
4008. 
4980. 
6011. 
6273. 
5928. 
6509. 
7865. 
7752. 
6817. 
6272. 
5582. 
4412. 
3206. 
2653. 
2702. 
3062. 
3344. 
3444. 
3262. 
3001. 
2931. 
2633. 
1933. 
1473. 
1689. 
2318. 
2285. 
1561. 
1199. 
5263. 
5646. 
3672. 
1628. 

889. 


Longitude 


—82.5 
—80.0 


2164. 
4728. 
5347. 
2667. 
1213. 
1612. 
1744. 
1153. 
838. 
35538. 
31985. 
23246. 
17363. 
14315. 
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Table 5.7.2, contd.: Orographic Heights (in m) Times the Gravitational Acceleration 
Constant (g — 9.81 m/s?) along Three Latitude Belts 


Longitude 28°S 36°N 66°N Longitude 28°S 36°N 66°N 


27.5 14583. 3377. 1366. 110.0 —17. 120639. 4674. 
30.0 11351. 7682. 1857. 112.5 302. 10543. 4435. 
32.5 3370. 9663. 1534. 115.0 1874. 4967. 3646. 
35.0 15. 10197. 993. 117.5 4005. 1119. 2655. 
37.5 49. | 10792. 863. 120.0 4989. 696. 2065. 
40.0 —31. 11322. 156. 122.5 4887. 475. 1583. 
42.5 20. 13321. 620. 125.0 4445. 1631. 3072. 
45.0 —17. 15414. 626. 127.5 4362. 2933. 7290. 
47.5 —19. 12873. 836. 130.0 4368. 1329. 8541. 
50.0 —18. 6114. 1029. 132.5 3485. 88. 7078. 
52.5 6. 2962. 946. 135.0 1921. 598. 7322. 
55.0 —2. 4913. 828. 137.5 670. 1983. 9445. 
57.5 3. 6600. 1247. 140.0 666. 2511. | 10692. 
60.0 —3. 4885. 2091. 142.5 1275. 866. 9280. 
62.5 2. 3380. 2276. 145.0 1865. 13. 8372. 
65.0 —1. 5842. 1870. 147.5 2452. 11. 6624. 
67.5 2. 12106. 1215. 150.0 3160. —4. 3617. 
70.0 0. 23032. 680. 152.5 2676. 1. 2717. 
72.5 2. 35376. 531. 155.0 697. 0. 3474. 
79.0 —1. 36415. 539. 157.5 —67. —3. 4337. 
77.5 1. 26544. 579. 160.0 25. 3. 4824. 
80.0 0. 19363. 554. 162.5 —12. =I, 5525. 
82.5 1. 17915. 632. 165.0 10. 4. 6323. 
85.0 —2. 22260. 791. 167.5 —5. —2. 5899. 
87.5 —1. 30442. 1455. 170.0 0. 1. 4330. 
90.0 —3. 33601. 3194. 172.5 0. —4. 3338 
92.5 —1. 30873. 4878. 175.0 4. 3. 3408. 
95.0 0. 31865. 5903. 177.5 3 -1 3407 


Chapter 6 
The Sturm-Liouville Problem 


In the next three chapters we will be solving partial differential equations using the 
technique of separation of variables. This technique requires that we expand a piece-wise 
continuous function f(x) as a linear sum of eigenfunctions, much as we used sines and 
cosines to re-express f(x) in a Fourier series. The purpose of this chapter is to explain and 
illustrate these eigenfunction expansions. 


6.1 EIGENVALUES AND EIGENFUNCTIONS 


Repeatedly, in the next three chapters on partial differential equations, we will solve 
the following second-order linear differential equation: 


3: LM + lae(x) -Ar(zm)y =0, amb, (6.1.1) 


dx 


together with the boundary conditions: 


ay(a) + By'(a)=0 and — yy(b) + dy'(b) = 0. (6.1.2) 


In Equation 6.1.1, p(x), q(x), and r(x) are real functions of x; A is a parameter; and 
p(x) and r(x) are functions that are continuous and positive on the interval a < x < b. 
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(B 


By the time that Charles-Francois Sturm (1803-1855) met Joseph Liouville in the early 1830s, he 
had already gained fame for his work on the compression of fluids and his celebrated theorem on the 
number of real roots of a polynomial. An eminent teacher, Sturm spent most of his career teaching 
at various Parisian colleges. (Portrait courtesy of the Archives de l'Académie des sciences, Paris.) 


Taken together, Equation 6.1.1 and Equation 6.1.2 constitute a regular Sturm-Liouville 
problem, named after the French mathematicians Sturm and Liouville! who first studied 
these equations in the 1830s. In the case when p(x) or r(x) vanishes at one of the endpoints 
of the interval [a,b] or when the interval is of infinite length, the problem becomes a singular 
Sturm-Liouville problem. 

Consider now the solutions to the regular Sturm-Liouville problem. Clearly there is the 
trivial solution y = 0 for all 4. However, nontrivial solutions exist only if A takes on specific 
values; these values are called characteristic values or eigenvalues. The corresponding 
nontrivial solutions are called the characteristic! functions or eigenfunctions. In particular, 
we have the following theorems. 


Theorem: For a regular Sturm-Liouville problem with p(x) > 0, all of the eigenvalues are 
real if p(x), q(x), and r(x) are real functions and the eigenfunctions are differentiable and 


continuous. 


Proof: Let y(x) = u(x) + iv(z) be an eigenfunction corresponding to an eigenvalue À = 


1 For the complete history as well as the relevant papers, see Lützen, J., 1984: Sturm and Liouville's 
work on ordinary linear differential equations. The emergence of Sturm-Liouville theory. Arch. Hist. Exact 
Sci., 29, 309-376. 
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Although educated as an engineer, Joseph Liouville (1809-1882) would devote his life to teaching 
pure and applied mathematics in the leading Parisian institutions of higher education. Today he 
is most famous for founding and editing for almost 40 years the Journal de Liouville. (Portrait 
courtesy of the Archives de l'Académie des sciences, Paris.) 


Ay + 4A;, where Ar, A; are real numbers and u(x), v(x) are real functions of x. Substituting 
into the Sturm-Liouville equation yields 


{p(x) [ul (x) + iv’ (ax)]} + lal) + (Ar + 3X;)r()][u(2c) + iv(x)] = 0. (6.1.3) 
Separating the real and imaginary parts gives 
[p(x)u' (x) + [g(x) + A,]u(z) — Agr (x)v(x) = 0, (6.1.4) 
and 
[p(z)v' (£) + (g(x) + A, ]o(zc) + Agr(x)u(x) = 0. (6.1.5) 


If we multiply Equation 6.1.4 by v and Equation 6.1.5 by u and subtract the results, we 
find that 


u(x)[p(x)v' (x)|’ — v(a)[p(z)u' (a) + Agr (a) [u? (£) + v?(x)] = 0. (6.1.6) 
The derivative terms in Equation 6.1.6 can be rewritten so that it becomes 
2. pew ejua) - pe (eoe HAr) ey - 0. (617) 


Integrating from a to b, we find that 


b 


b 
-x f r(a)[u?(x) + v?(z)] dz = (p(z)[u(z)v' (x) — v(z)u (z)]H, - (6.1.8) 


From the boundary conditions, Equation 6.1.2, 
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a[u(a) + iv(a)] + Blu’ (a) + iv'(a)] = 0, (6.1.9) 


and 


*y[u(b) + iv(0)] + ó[u' (b) + iv' (b)] = 0. (6.1.10) 
Separating the real and imaginary parts yields 
au(a) + Bu'(a)=0, and av(a)+ Bv'(a) — 0, (6.1.11) 


and 
yu(b) + 6u’(b) — 0, and yvu(b) + óv'(b) = 0. (6.1.12) 


Both « and 8 cannot be zero; otherwise, there would be no boundary condition at x = a. 
Similar considerations hold for y and ô. Therefore, 


u(a)v'(a) — v(a)v(a) 20, and u(b)v'(b) — u'(b)v(b) = 0, (6.1.13) 
if we treat a, 6, y, and 6 as unknowns in a system of homogeneous equations, Equation 


6.1.11 and Equation 6.1.12, and require that the corresponding determinants equal zero. 
Applying Equation 6.1.13 to the right side of Equation 6.1.8, we obtain 


b 
a | r(x)[u?(x) + v?(a)] da = 0. (6.1.14) 


Because r(x) > 0, the integral is positive and A; = 0. Since A; = 0, A is purely real. This 
implies that the eigenvalues are real. 


If there is only one independent eigenfunction for each eigenvalue, that eigenvalue is 
simple. When more than one eigenfunction belongs to a single eigenvalue, the problem is 
degenerate. 


Theorem: The regular Sturm-Liouville problem has infinitely many real and simple eigen- 
values An, n = 0,1,2,..., which can be arranged in a monotonically increasing sequence 
ào < Ay < Aa < +++ such that lim, ,55 Àn = oo. Every eigenfunction yn(x) associated 
with the corresponding eigenvalue A, has exactly n zeros in the interval (a,b). For each 
eigenvalue there exists only one eigenfunction (up to a multiplicative constant). 


The proof is beyond the scope of this book but may be found in more advanced treatises.” 


In the following examples we illustrate how to find these real eigenvalues and their 
corresponding eigenfunctions. 


e Example 6.1.1 
Let us find the eigenvalues and eigenfunctions of 


y" + Ay — 0, (6.1.15) 


? See, for example, Birkhoff, G., and G.-C. Rota, 1989: Ordinary Differential Equations. John Wiley & 
Sons, Chapters 10 and 11; Sagan, H., 1961: Boundary and Eigenvalue Problems in Mathematical Physics. 
John Wiley & Sons, Chapter 5. 
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Figure 6.1.1: Graphical solution of tan(zr) = v. 
subject to the boundary conditions 
y(0) — 0, and y(s) — y' (v) = 0. (6.1.16) 


Our first task is to check to see whether the problem is indeed a regular Sturm-Liouville 
problem. A comparison between Equation 6.1.1 and Equation 6.1.15 shows that they are 
the same if p(x) = 1, q(x) = 0, and r(x) = 1. Similarly, the boundary conditions, Equation 
6.1.16, are identical to Equation 6.1.2 if œ = y = 1, ô = —1, 8 = 0, a = 0, and b= m. 

Because the form of the solution to Equation 6.1.15 depends on A, we consider three 
cases: \ negative, positive, or equal to zero. The general solution? of the differential equation 
is 


y(x) = Acosh(mz)+ Bsinh(mx), if A«90, (6.1.17) 
y(z) =C+Dz, if rA=0, (6.1.18) 

and 
y(x) = E cos(kx) + Fsin(kx), if A0, (6.1.19) 


where for convenience \ = —m? < 0 in Equation 6.1.17 and A = k? > 0 in Equation 6.1.19. 
Both k and m are real and positive by these definitions. 

Turning to the condition that y(0) = 0, we find that A = C = E = 0. The other 
boundary condition y(t) — y(t) = 0 gives 


B|sinh(mz) — m cosh(mz)] = 0, (6.1.20) 
D — 0, (6.1.21) 

and 
F[sin(kr) — k cos(kr)] = 0. (6.1.22) 


If we graph sinh(mz) — m cosh(mr) for all positive m, this quantity is always negative. 
Consequently, B = 0. However, in Equation 6.1.22, a nontrivial solution (i.e., F # 0) occurs 
if 

Fcos(km)[tan(km) — k| =0, or tan(km) =k. (6.1.23) 


? In many differential equations courses, the solution to y” — m2y = 0, m > 0 is written y(z) = c1e™? + 


c2e ""*. However, we can rewrite this solution as y(x) = (c1+c2) i (e™* +e—™*)+(c1—c2) i (e * —e 


A cosh(mz) + Bsinh(mz), where cosh(mz) = (e"* + e~™*)/2 and sinh(mz) = (e™* — e~™*)/2. The 
advantage of using these hyperbolic functions over exponentials is the simplification that occurs when we 
substitute the hyperbolic functions into the boundary conditions. 


cma) = 
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k] = 1.29011 ky = 2.37305 


k 3= 3.40918 k4= 4.42032 


5 
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Figure 6.1.2: The first four eigenfunctions sin(knx) corresponding to the eigenvalue problem tan(k7) = k. 


In summary, we found nontrivial solutions only when A, = k2 > 0, where kn is the nth 
root of the transcendental equation, Equation 6.1.23. We can find the roots either graph- 
ically or through the use of a numerical algorithm. Figure 6.1.1 illustrates the graphical 
solution to the problem. We exclude the root k = 0 because A must be greater than zero. 

Let us now find the corresponding eigenfunctions. Because A= B=C=D=E=0, 
we are left with y(x) = Fsin(kx). Consequently, the eigenfunction, traditionally written 
without the arbitrary amplitude constant, is 


Yn(x) = sin(kn 2), (6.1.24) 


because k must equal kn. Figure 6.1.2 shows the first four eigenfunctions. 


e Example 6.1.2 
For our second example let us solve the Sturm-Liouville problem, 
y" + Ay — 0, (6.1.25) 
with the boundary conditions 
y(0) — y/(0) 20, and y(m)- y'(v) ^ 0. (6.1.26) 
Once again the three possible solutions to Equation 6.1.25 are 
y(x) = Acosh(mz)4- Bsinh(mz), if A=—m? <0, (6.1.27) 


y(x)-—C-Dz, if A=), (6.1.28) 


^ Sosov and Theodosiou [Sosov, Y., and C. E. Theodosiou, 2002: On the complete solution of the 


Sturm-Liouville problem (d? X/dz?) + A? X = 0 over a closed interval. J. Math. Phys. (Woodbury, NY), 
43, 2831-2843] have analyzed this problem with the general boundary conditions, Equation 6.1.2. 
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and 
y(x) = E cos(kz) + Fsin(kz), if A=k?>0. (6.1.29) 


Let us first check and see if there are any nontrivial solutions for A < 0. Two simulta- 
neous equations result from the substitution of Equation 6.1.27 into Equation 6.1.26: 


A — mB - 0, (6.1.30) 


nd 
i [cosh(mr) — msinh(mr)]A + [sinh(mr) — m cosh(mr)|B = 0. (6.1.31) 


The elimination of A between the two equations yields 
sinh(mr)(1 — m?)B = 0. (6.1.32) 
If Equation 6.1.27 is a nontrivial solution, then B 4 0, and 
sinh(m7) = 0, or i = 1, (6.1.33) 


The condition sinh(mz) = 0 cannot hold because it implies m = A = 0, which contradicts 
the assumption used in deriving Equation 6.1.27 that A < 0. On the other hand, m? = 1 is 
quite acceptable. It corresponds to the eigenvalue A = —1 and the eigenfunction is 


yo = cosh(x) + sinh(z) = e”, (6.1.34) 
because it satisfies the differential equation 
Vo — yo = 0, (6.1.35) 
and the boundary conditions 


wo(0)— u5(0) — 0, — and ^ yo(m)- y(r) — 0. (6.1.36) 


An alternative method of finding m, which is quite popular because of its use in more 
difficult problems, follows from viewing Equation 6.1.30 and Equation 6.1.31 as a system 
of homogeneous linear equations, where A and B are the unknowns. It is well known? that 
for Equation 6.1.30 and Equation 6.1.31 to have a nontrivial solution (i.e., A 4 0 and/or 
B #0) the determinant of the coefficients must vanish: 


1 -m 
cosh(mm) — msinh(mr) sinh(mr) — mcosh(mm) | — 0. (6.1.37) 


Expanding the determinant, 
sinh(mr)(1 — m?) = 0, (6.1.38) 


which leads directly to Equation 6.1.33. 
We consider next the case of A = 0. Substituting Equation 6.1.28 into Equation 6.1.26, 
we find that 
C—-D=0, and C+ Dr—-D=0. (6.1.39) 


5 See Chapter 3. 
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Figure 6.1.3: The first four eigenfunctions for the Sturm-Liouville problem, Equation 6.1.25 and Equation 
6.1.26. 


This set of simultaneous equations yields C = D = 0 and we have only trivial solutions for 
A = 0. 
Finally, we examine the case when A > 0. Substituting Equation 6.1.29 into Equation 
6.1.26, we obtain 
E —kF — 0, (6.1.40) 


and 
[cos(km) + ksin(k7)|E + [sin(ka) — kcos(kz)|F = 0. (6.1.41) 


The elimination of E from Equation 6.1.40 and Equation 6.1.41 gives 
F(1 +k?) sin(kz) = 0. (6.1.42) 
If Equation 6.1.29 is nontrivial, F Z 0, and 
k? = —1, or sin(km) = 0. (6.1.43) 
The condition k? = —1 violates the assumption that k is real, which follows from the fact 


that A = k? > 0. On the other hand, we can satisfy sin(k7) = 0 if k = 1,2,3,...; a negative 
k yields the same A. Consequently we have the additional eigenvalues An = n?. 
Let us now find the corresponding eigenfunctions. Because E = kF, y(x) = F sin(kz)- 


Fk cos(kr) from Equation 6.1.29. Thus, the eigenfunctions for A > 0 are 
yn (x) = sin(nx) + n cos(nx). (6.1.44) 


Figure 6.1.3 illustrates some of the eigenfunctions given by Equation 6.1.34 and Equation 
6.1.44. 


e Example 6.1.3 
Consider now the Sturm-Liouville problem 


y" + Ay — 0, (6.1.45) 
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with 
y(t) 2 y(-7), and y'(m)- y'(-m). (6.1.46) 
'This is not a regular Sturm-Liouville problem because the boundary conditions are periodic 


and do not conform to the canonical boundary condition, Equation 6.1.2. 
The general solution to Equation 6.1.45 is 


y(x) = Acosh(mz) + Bsinh(mz), if A=—m? <0, (6.1.47) 
y(r)-C-Dz, if rA=0, (6.1.48) 

and 
y(x) = E cos(kx) + Fsin(kz), if A—Kk?20. (6.1.49) 


Substituting these solutions into the boundary condition, Equation 6.1.46, 


A cosh(m) + B sinh(mz) = Acosh(—mo) + B sinh(—m), 6.1.50) 
C 4H Dx =C — Dr, 6.1.51) 

and 
E cos(kz) + F sin(kr) = E cos(—kr) + F sin(—kz), 6.1.52) 

Or 

Bsinh(mz)—0, D=0, and Fsin(km) — 0, 6.1.53) 
because cosh(—mm) = cosh(mm), sinh(—mz) = —sinh(mm), cos(—k7) = cos(km), and 
sin(—k7) = —sin(k7). Because m must be positive, sinh(mr) cannot equal zero and B = 0. 
On the other hand, if sin(kx) = 0 or k = n, n = 1,2,3,..., we have a nontrivial solution 


for positive À and A, = n?. Note that we still have A, C, E, and F as free constants. 
From the boundary condition, Equation 6.1.46, 


Asinh(mz) = Asinh(—mm), (6.1.54) 


and 


—Esin(kz) + F cos(kr) = —E sin(—kr) + F cos(—kr). (6.1.55) 


The solution yo(x) — C identically satisfies the boundary condition, Equation 6.1.46, for 
all C. Because m and sinh(mz) must be positive, A = 0. From Equation 6.1.53, we once 
again have sin(kz) = 0, and k = n. Consequently, the eigenfunction solutions to Equation 
6.1.45 and Equation 6.1.46 are 


Xo = 0, yo(x) = 1, (6.1.56) 
and 
1002 _  sin(nz), 
An — n*, yn(x) = eee (6.1.57) 


and we have a degenerate set of eigenfunctions to the Sturm-Liouville problem, Equation 
6.1.45, with the periodic boundary condition, Equation 6.1.46. 
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Problems 
Find the eigenvalues and eigenfunctions for each of the following: 


1. y" Ay —0, $y'(0)0—0, y(L)=0 


3. y" +y —0, y(0)+y'(0)=0, y(m)t y'(1) 20 


4. y" Ay —-0, y/(0)=0, y(7)-v(n)-0 
5. y? + ry —0, y(0—$9"(0 —0, y(D)—-y"(L) =0 


Find an equation from which you could find A and give the form of the eigenfunction for 
each of the following: 


6. y" +Ay=0, y(0)+y(0)=0, y(1)=0 


for each of the following boundary conditions: (a) u(1) = u(e) = 0, (b) u(1) = w'(e) = 0, 
and (c) v'(1) = w'(e) = 0. 


Find the eigenvalues and eigenfunctions of the following Sturm-Liouville problems: 


€ gy” + 2ay'+rAy=0, y(1)=y(e)=0, 1<r<e 
13. 
f(y) tary =0, y(1)=y(e")=0, 1<0<e" 
14. 
E (<u) +4v=0, wn-we)-9 10s 
15. 


y" E ry = 0, y” (0) y" (0) y" (1) y (1) 0, 0 <“< 1 
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6.2 ORTHOGONALITY OF EIGENFUNCTIONS 


In the previous section we saw how nontrivial solutions to the regular Sturm-Liouville 
problem consist of eigenvalues and eigenfunctions. The most important property of eigen- 
functions is orthogonality. 


Theorem: Let the functions p(x), q(x), and r(x) of the regular Sturm-Liouville problem, 
Equation 6.1.1 and Equation 6.1.2, be real and continuous on the interval [a,b]. If y(x) 
and ya (x) are continuously differentiable eigenfunctions corresponding to the distinct eigen- 
values X, and Am, respectively, then y,(x) and y,,(x) satisfy the orthogonality condition: 


b 
f r(z)ys (z)um (x) dx = 0, (6.2.1) 


if An # Am. When Equation 6.2.1 is satisfied, the eigenfunctions y,(r) and ym(a) are 
said to be orthogonal to each other with respect to the weight function r(x). The term 
orthogonality appears to be borrowed from linear algebra where a similar relationship holds 
between two perpendicular or orthogonal vectors. 


Proof: Let yn(x) and ym(x) denote the eigenfunctions associated with two different eigen- 
values Ap and Àm. Then 


£ peoe] + [g(£) + An? (2)|yn(x) = 0, (6.2.2) 
L (ay | + (g(x) + Amr (2) |¥m (x) = 9, (6.2.3) 


and both solutions satisfy the boundary conditions. Let us multiply the first differential 
equation by ym; the second by yn. Next, we subtract these two equations and move the 
terms containing YnYm to the right side. The resulting equation is 


d 
Yn dx 


a) He] masc) = On — Adrants (624) 


Integrating Equation 6.2.4 from a to b yields 


[oo (ay He Ym LOI = (An - An) f remm dx. | (6.2.5) 


We can simplify the left side of Equation 6.2.5 by integrating by parts to give 


i IE ic PS | — m [poe] ae 
b 


“Gia pe ae = i p(x) [yun — yhymlde. (6.2.6) 


a 


The second integral equals zero since the integrand vanishes identically. Because yn (a) and 
Ym(a) satisfy the boundary condition at x = a, 


ay (a) + By, (a) -— 0, (6.2.7) 
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and 
og (a) + BY; (a) = 0. (6.2.8) 


These two equations are simultaneous equations in a and 8. Hence, the determinant of the 
equations must be zero: 


y, (a)ym(a) — Yin(@)Yn(a) = 0. (6.2.9) 
Similarly, at the other end, 
yn (b)um (b) — Yin (b)ys (b) = 0. (6.2.10) 


Consequently, the right side of Equation 6.2.6 vanishes and Equation 6.2.5 reduces to Equa- 
tion 6.2.1. 


e Example 6.2.1 


Let us verify the orthogonality condition for the eigenfunctions that we found in Ex- 
ample 6.1.1. 
Because r(x) = 1, a = 0, b = m, and yp(x) = sin(k,x), we find that 


b T 
ip r(z)ysym dx = T sin(k,x) sin(k,a) dx 6.2.11) 
a 0 
=} | {cos[(kn — km)x] — cos|(kn + km)ax]} dx 6.2.12) 
_ sin[(kn — km)a] sin|(kn + km)z] 
TRE qo a 6.2.13) 
. sin[(kn —km)t] | sin[(k, + ka )n] 
= Dh UE 2(ks + Fm) 6.2.14) 
os sin(k,7) cos(kmm) — cos(k,7) sin(km) 
2(kn ~ km) 
sin(k,7) cos(kmT) + cos(ky7) sin(km7) 
EEA (6.2.15) 
kn cos(k,m) COS(km™) — km COS(knT) COS(KmT™) 
y 2(kn — km) 
kn COS(Kin 7) COS(Kim 7) + km COS(Kn m) cos(kmr) 
2k, 4 Fm) (6.2.16) 
_ (kn — km) cos(k,m) cos(km) 
B 2(kn — km) 
(kn + km) CoS(kn™) COS(Km™) — 
Oke tk) =0. (6.2.17) 


We used the relationships kn = tan(k,7), and km = tan(k,,7) to simplify Equation 6.2.15. 
Note, however, that if n = m, 


ii 7 in(2kn 
f sin(k,a) sin(k,a) da = 4 T [1 —cos(2knx)] dx = ; Sale x 1Ir— cos? (k,a)] > 0, 
0 0 


4kn 7 
(6.2.18) 
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because sin(2A) = 2sin(A) cos( A), and kn = tan(k,7). That is, any eigenfunction cannot 
be orthogonal to itself. 


In closing, we note that had we defined the eigenfunction in our example as 


sin(k,z) 


n\t) = 6.2.19 
unn) Jin — cos? (knr)]/2 eam) 
rather than y, (x) = sin(k,x), the orthogonality condition would read 
7 |. [0, mn, 
f Yn(2)Ym(x) dx = { a: (6.2.20) 


This process of normalizing an eigenfunction so that the orthogonality condition becomes 


0, m zn, 


1 (6.2.21) 


, m=n, 


b 
[ rmt yas = | 
a 
generates orthonormal eigenfunctions. We will see the convenience of doing this in the next 
section. 


Problems 


1. The Sturm-Liouville problem y" -- Ay = 0, y(0) = y(L) = 0 has the eigenfunction solution 
Ynlx) = sin(nzx/L). By direct integration, verify the orthogonality condition, Equation 
6.2.1. 


2. The Sturm-Liouville problem y" + Ay = 0, y'(0) = y'(D) = 0 has the eigenfunction 
solutions yo(x) = 1 and y, (x) = cos(nzx/L). By direct integration, verify the orthogonality 
condition, Equation 6.2.1. 


3. The Sturm-Liouville problem y" + Ay = 0, y(0) = y'(L) = 0 has the eigenfunction 
solution y,(r) = sin[(2n — 1)rz/(2L). By direct integration, verify the orthogonality 
condition, Equation 6.2.1. 


4. The Sturm-Liouville problem y" + Ay = 0, y'(0) = y(L) = 0 has the eigenfunction 
solution y,(x) = cos|(2n — 1)raz/(2L)]. By direct integration, verify the orthogonality 
condition, Equation 6.2.1. 


6.3 EXPANSION IN SERIES OF EIGENFUNCTIONS 


In calculus we learned that under certain conditions we could represent a function 
f(x) by a linear and infinite sum of polynomials (x — xo)”. In this section we show that an 
analogous procedure exists for representing a piece-wise continuous function by a linear sum 
of eigenfunctions. These eigenfunction expansions wil be used in the next three chapters 
to solve partial differential equations. 

Let the function f(x) be defined in the interval a < x < b. We wish to re-express f(x) 
in terms of the eigenfunctions y,(x) given by a regular Sturm-Liouville problem. Assuming 
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6 


that the function f(x) can be represented by a uniformly convergent series,? we write 


F(a) = X cuya (z). (6.3.1) 


n=1 


The orthogonality relation, Equation 6.2.1, gives us the method for computing the coeffi- 
cients cn. First we multiply both sides of Equation 6.3.1 by r(x)y; (x), where m is a fixed 
integer, and then integrate from a to b. Because this series is uniformly convergent and 
Un (x) is continuous, we can integrate the series term by term, or 


b oo b 
f r(x) f(x)ym(x) dz = 5 cn [ r(z)yn(zx)ym(x) dz. (6.3.2) 


=1 


The orthogonality relationship states that all of the terms on the right side of Equation 
6.3.2 must disappear except the one for which n = m. Thus, we are left with 


b 


b 
f v(2)f Gy (2) dz = cm if (om (22) (2) dz (6.3.3) 


or 


(6.3.4) 


if we replace m by n in Equation 6.3.3. 

Usually, both integrals in Equation 6.3.4 are evaluated by direct integration. In the 
case when the evaluation of the denominator is very difficult, Lockshin" has shown that the 
denominator of Equation 6.3.4 always equals 


i dy ð o? 
[row @ae n) [BR uae 


b 


(6.3.5) 


a 
for a regular Sturm-Liouville problem with eigenfunction solution y, where p(x), q(x), and 
r(x) are continuously differentiable on the interval |a, b]. 

The series, Equation 6.3.1, with the coefficients found by Equation 6.3.4, is a generalized 
Fourier series of the function f(x) with respect to the eigenfunction y,(x). It is called a 
generalized Fourier series because we generalized the procedure of re-expressing a function 
f(x) by sines and cosines into one involving solutions to regular Sturm-Liouville problems. 
Note that if we had used an orthonormal set of eigenfunctions, then the denominator of 


6 If S, (zx) = PO ug(z), S(x) = limn Sn(x), and 0 < |Sn(x) — S(x)| < e for all n > M > 0, the 


series aa uj (x) is uniformly convergent if M is dependent on e alone and not zx. 


T Lockshin, J. L, 2001: Explicit closed-form expression for eigenfunction norms. Appl. Math. Lett., 14, 
553-555. 
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Equation 6.3.4 would equal one and we reduce our work by half. The coefficients cn are the 
Fourier coefficients. 

One of the most remarkable facts about generalized Fourier series is their applicability 

even when the function has a finite number of bounded discontinuities in the range [a,b]. 
We may formally express this fact by the following theorem: 
Theorem: If both f(x) and f'(x) are piece-wise continuous in a < x < b, then f(x) can 
be expanded in a uniformly convergent Fourier series, Equation 6.3.1, whose coefficients Cy 
are given by Equation 6.3.4. It converges to [f(at) + f(x~)|/2 at any point x in the open 
interval a < x < b. 

The proof is beyond the scope of this book but can be found in more advanced treatises.? 
If we are willing to include stronger constraints, we can make even stronger statements about 
convergence. For example,? if we require that f(x) be a continuous function with a piece- 
wise continuous first derivative, then the eigenfunction expansion, Equation 6.3.1, converges 
to f(x) uniformly and absolutely in [a,b] if f(x) satisfies the same boundary conditions as 
does y, (x). 


In the case when f(x) is discontinuous, we are not merely rewriting f(x) in a new form. 
We are actually choosing the coefficients cn so that the eigenfunction expansion fits f(a) in 
the “least squares" sense that 


| "n 


Consequently we should expect peculiar things, such as spurious oscillations, to occur in 
the neighborhood of the discontinuity. These are Gibbs phenomena,'° the same phenomena 
discovered with Fourier series. See Section 5.2. 


2 


f(z) -— ` CnYn(z) 


n=1 


dx = 0. (6.3.6) 


e Example 6.3.1 


To illustrate the concept of an eigenfunction expansion, let us find the expansion for 
f(x) = x over the interval 0 < x < m using the solution to the regular Sturm-Liouville 
problem of 

y" +Ay — 0, y(0) = y(r) ^ 0. (6.3.7) 


'This problem arises when we solve the wave or heat equation by separation of variables in 
the next two chapters. 

Because the eigenfunctions are y,(x) = sin(nx), n = 1,2,3,..., r(x) = 1, a = 0, and 
b = n, Equation 6.3.4 yields 


u Jo c sin(nz) dz E —z cos(nz)/n + sin(nz)/n? |^ i 7 p" 
"m 7 sing?(nz)dr /2 — sin(2nz)/(4n)|; = 4, Cos(nm) E 1)”. 


(6.3.8) 


8 For example, Titchmarsh, E. C., 1962: Eigenfunction Expansions Associated with Second-Order Dif- 
ferential Equations. Part 1. Oxford University Press, pp. 12-16. 


9 Tolstov, G. P., 1962: Fourier Series. Dover Publishers, p. 255. 


10 Apparently first discussed by Weyl, H., 1910: Die Gibbs'sche Erscheinung in der Theorie der Sturm- 
Liouvilleschen Reihen. Rend. Circ. Mat. Palermo, 29, 321-323. 
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Equation 6.3.1 then gives 
sin(nx). (6.3.9) 


--xct 

This particular example is in fact an example of a half-range sine expansion. 
Finally we must state the values of x for which Equation 6.3.9 is valid. At x = 7 the 
series converges to zero while f(a) = «v. At x = 0 both the series and the function converge 
to zero. Hence this series expansion is valid for 0 € z < 7. 


e Example 6.3.2 


For our second example let us find the expansion for f(r) — x over the interval 0 € 
x < T using the solution to the regular Sturm-Liouville problem of 


y" + Ay — 0, y(0) = y(t) — y (x) ^ 0. (6.3.10) 


We will encounter this problem when we solve the heat equation with radiative boundary 
conditions by separation of variables. 

Because r(x) = 1, a = 0, b = 7, and the eigenfunctions are y,(r) = sin(k,x), where 
kn = tan(k,7), Equation 6.3.4 yields 


- Jo c sin(k,a) dx = Jo wsin(k,z)dz ^ — 2sin(k,z)/k — 2x cos(k,a) / ks |^ 


Jo sin? (kna)dz — ifj [1 —cos(2knx)| dæ - z — sin(2k,2)/(2kn)|5 
(6.3.11) 


m sin(knm)/k2 — 2mcos(k,n)/k, ^ 2(1— v) cos(kam)/ ky 


7 — sin(2k,z)/(2k,) x T — cos? (knr) , (6.3.12) 


where we used the property that sin(k,7) = kn cos(k,7). Equation 6.3.1 then gives 


(knr 
f(z)-2(1—m ps E ES km sin(k,z). (6.3.13) 


To illustrate the use of Equation 6.3.5, we note that 


ET m. PNEU 
y(x) = sin( VA), 5r A cos( V/A x), 2 m cos( V/A x), (6.3.14) 
and E E j 
y _ y _ _@ ae 
mor Bu XX cos( VA a) 7 sin( V Ax). (6.3.15) 
Therefore, 
i 2 T 2 : 1 x u 
r(z)y, (a) dx = $= cos" (kng) — sin(k,z) | —— cos(k,z) — = sin(kz) (6.3.16) 
^ 2 2k, 2 1 
dl a " m cos? (knT) 
= E Ts sin(kn x) son) p go ae, (6.3.17) 


Note that we set A = A, = k2 after taking the derivatives with respect to A. 
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Problems 


1. The Sturm-Liouville problem y" + Ay = 0, y(0) = y(L) = 0 has the eigenfunction 
solution y,(x) = sin(nzz/L). Find the eigenfunction expansion for f(x) = x using this 
eigenfunction. 


2. The Sturm-Liouville problem y" + Ay = 0, y'(0) = y'(D) = 0 has the eigenfunction 
solutions yo(z) = 1, and y,(z) = cos(nzc/L). Find the eigenfunction expansion for f(x) = 
x using these eigenfunctions. 


3. The Sturm-Liouville problem y" + Ay = 0, y(0) = y'(L) = 0 has the eigenfunction 
solution yp (x) = sin[((2n — 1)rz/(2L)]. Find the eigenfunction expansion for f(x) = x using 
this eigenfunction. 


4. The Sturm-Liouville problem y" + Ay = 0, y'(0) = y(L) = 0 has the eigenfunction 
solution y,(x) = cos[(2n — 1)yraz/(2L)]. Find the eigenfunction expansion for f(x) = x 
using this eigenfunction. 


5. Consider the eigenvalue problem 
y" +(A—a?)y « 0, 0<a<1, 
with the boundary conditions 


y'(0)+ay(0)=0 and y/(1)+ay(1) =0. 


Step 1: Show that this is a regular Sturm-Liouville problem. 


—ax 


Step 2: Show that the eigenvalues and eigenfunctions are Ag = 0, yo(x) = e and 


An — a? +n? 77, y, (x) = asin(nmz) — nr cos(nnz). 
where n — 1,2,3,.... 
Step 3: Given a function f(x), show that we can expand it as follows: 


f(z)- Coe * + p» C, [asin(nzx) — nr cos(nzz)] , 


n=1 


where 


1 
_ 92a Cy = 2 ak dz 
(1-6) Cy = 20 f fiy de, 


(a? + n?m^)C, = 2 f f(x) [asin(nzz) — nr cos(n7ax)] dx. 


6. Consider the eigenvalue problem 


nt 


y” + Ay" — 0, Dee T. 


with the boundary conditions y(0) — y'(0) 2 y(1) 2 y'(1) 2 0. Prove the following points: 
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Step 1: Show that the eigenfunctions are 


1 — cos(k,) 


yn (x) cos(knx) + i, sine) 


[sin(k,z) — knz], 


where k,, denotes the nth root of 


2 — 2cos(k) — ksin(k) = sin(k/2)[sin(k/2) — (k/2) cos(k/2)] = 0. 


Step 2: Show that there are two classes of eigenfunctions: 
Kn = 2nm, Yn(x) = 1 — cos(2nnz), 


and 


tan(&,/2) = Kn/2, yn(x) = 1 — cos(kn z) + eta Dn) — Kn]. 


Kn 


Step 3: Show that the orthogonality condition for this problem is 


1 
f yi (ayla(a)de=0, n#m, 


where y; (x) and y, (x) are two distinct eigenfunction solutions of this problem. Hint: Follow 
the proof in Section 6.2 and integrate repeatedly by parts to eliminate higher derivative 
terms. 


Step 4: Show that we can construct an eigenfunction expansion for an arbitrary function 


f(x) via 
f(z)= >" Cryn(z),  0«s«1 
n=1 


provided 
_ So fw) AOL 


GC: : 
Jo ly c)? da: 


What are the condition(s) on f(x)? 
6.4 A SINGULAR STURM-LIOUVILLE PROBLEM: LEGENDRE'S EQUATION 


In the previous sections we used solutions to a regular Sturm-Liouville problem in the 
eigenfunction expansion of the function f(x). The fundamental reason why we could form 
such an expansion was the orthogonality condition, Equation 6.2.1. This crucial property 
allowed us to solve for the Fourier coefficient c,, given by Equation 6.3.4. 

In the next few chapters, when we solve partial differential equations in cylindrical 
and spherical coordinates, we will find that f(x) must be expanded in terms of eigenfunc- 
tions from singular Sturm-Liouville problems. Is this permissible? How do we compute the 
Fourier coefficients in this case? The final two sections of this chapter deal with these ques- 
tions by examining the two most frequently encountered singular Sturm-Liouville problems, 
those involving Legendre's and Bessel's equations. 
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Born into an affluent family, Adrien-Marie Legendre’s (1752-1833) modest family 
fortune was sufficient to allow him to devote his life to research in celestial mechanics, 
number theory, and the theory of elliptic functions. In July 1784 he read before the 
Académie des sciences his Recherches sur la figure des planètes. It is in this paper 
that Legendre polynomials first appeared. (Portrait courtesy of the Archives de 
l'Académie des sciences, Paris.) 


We begin by determining the orthogonality condition for singular Sturm-Liouville prob- 
lems. Returning to the beginning portions of Section 6.2, we combine Equation 6.2.5 and 
Equation 6.2.6 to obtain 


b 
(An — Am) J r(z)yato dz —[p(5)/, (b)yn (5) — p(b)y (bu (b) 
— p(a)y'n(a)yn(a) + p(a)y/, (2) ()] (6.4.1) 


From Equation 6.4.1 the right side vanishes and we preserve orthogonality if y, (x) is finite 
and p(z)y/,(x) tends to zero at both endpoints. This is not the only choice but let us see 
where it leads. 

Consider now Legendre's equation: 


(esu) 


F n(n + 1)y — 0, (6.4.2) 


Or 


ja x?) z] - n(n 4- 1)y 0, (6.4.3) 
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where we set a = —1, b = 1, A = n(n + 1), p(x) = 1 — z?, q(x) = 0, and r(x) = 1. This 
equation arises in the solution of partial differential equations involving spherical geometry. 
Because p(—1) = p(1) = 0, we are faced with a singular Sturm-Liouville problem. Before 
we can determine if any of its solutions can be used in an eigenfunction expansion, we must 
find them. 

Equation 6.4.2 does not have a simple general solution. [If n = 0, then y(x) = 1 is a 
solution.] Consequently we try to solve it with the power series: 


- M Aa, (6.4.4) 
k=0 
eo dp. (6.4.5) 
k= 
and 
- M k(k — 1)Auz* 2. (6.4.6) 
k=0 
Substituting into Equation 6.4.2, 
So k(k— 1)A 257? + SO [n(n +1) - 2k — k(k — 1)] Akz* = 0, (6.4.7) 


> 
ll 
o 


k=0 


which equals 


[n(n + 1) — k(k + 1)] Axa = 0. (6.4.8) 


E 


» m(m — 1)Aí x"? + 


m=2 


> 
ll 
o 


If we define k = m — 2 in the first summation, then 


0 k=0 


Me 


> 
ll 


Because Equation 6.4.9 must be true for any x, each power of x must vanish separately. It 
then follows that 
(k + 2)(k + 1) Ange = [k(k + 1) — n(n + D]As, (6.4.10) 
or 
[k(k +1) — n(n + 1)] 
(k 4- 1)(k +2) 
where k = 0,1,2,.... Note that we still have the two arbitrary constants Ag and A, that 


are necessary for the general solution of Equation 6.4.2. 
The first few terms of the solution associated with Ag are 


Apso = Ax, (6.4.11) 


us (m) <1 — PED 2, nn n+ Din *3) 4 
i 21 | ii 
n(n — 2)(n — 4)(n + 1)(n +3)(n +5) 6 


sêt, (6.4.12) 


6! 
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Table 6.4.1: The First Ten Legendre Polynomials 


P(x) = (523 — 32) 
P4(x) = &(352* — 302? + 3) 
P(x) = &(6325 — 7023 + 152) 
Ps(x) = i (23129 — 31524 + 1052? — 5) 


Pr(z) = L(429r' — 69325 + 3152? — 35x 
16 
Ps(z) = 415 (643525 — 1201229 + 69302 — 1260? + 35) 
Po(x) = 435 (121552? — 2574027 + 18018x — 46202? + 3152) 


Pio(a) = i5 (4618921? — 1093952 + 9009029 — 3003024 + 34652? — 63) 


while the first few terms associated with the A; coefficient are 


v(x) — 2 (n — 1)(n + 2) 3 " (n — 1)(n — 3)(n t 2)(n +4) 5 


3! 5! 


ge (6.4.13) 


If n is an even positive integer (including n — 0), then the series, Equation 6.4.12, terminates 
with the term involving x": The solution is a polynomial of degree n. Similarly, if n is an 
odd integer, the series, Equation 6.4.13, terminates with the term involving z". Otherwise, 
for n noninteger the expressions are infinite series. 

For reasons that will become apparent, we restrict ourselves to positive integers m. 
Actually, this includes all possible integers because the negative integer —n — 1 has the 
same Legendre's equation and solution as the positive integer n. These polynomials are 
Legendre polynomials! and we may compute them by the power series: 


uu 2n — 2k)! ag 
Piast Ds — mc mi" "n (6.4.14) 
k=0 


where m = n/2, or m = (n — 1)/2, depending upon which is an integer. We chose to use 
Equation 6.4.14 over Equation 6.4.12 or Equation 6.4.13 because Equation 6.4.14 has the 
advantage that P,,(1) = 1. Table 6.4.1 gives the first ten Legendre polynomials. 

The other solution, the infinite series, is the Legendre function of the second kind, 
Qn(x). Figure 6.4.1 illustrates the first four Legendre polynomials P,,(x) while Figure 6.4.2 


11 Legendre, A. M., 1785: Sur l'attraction des sphéroides homogénes. Mém. math. phys. présentés à 
l'Acad. sci. pars divers savants, 10, 411—434. The best reference on Legendre polynomials is Hobson, E. 
W., 1965: The Theory of Spherical and Ellipsoidal Harmonics. Chelsea Publishing Co., 500 pp. 
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Figure 6.4.1: The first four Legendre functions of the first kind. 


gives the first four Legendre functions of the second kind Q;(x). From this figure we see 
that Qn(x) becomes infinite at the points x = +1. As shown earlier, this is important 
because we are only interested in solutions to Legendre's equation that are finite over the 
interval [-1, 1]. On the other hand, in problems where we exclude the points r = +1, 
Legendre functions of the second kind will appear in the general solution.” 

In the case that n is not an integer, we can construct a solution!’ that remains finite 
at x = 1 but not at x = —1. Furthermore, we can construct a solution that is finite at 
x = —1 but not at x = 1. Because our solutions must be finite at both endpoints so that 
we can use them in an eigenfunction expansion, we must reject these solutions from further 
consideration and are left only with Legendre polynomials. From now on, we will only 
consider the properties and uses of these polynomials. 

Although we have the series, Equation 6.4.14, to compute P,(x), there are several 
alternative methods. We obtain the first method, known as Rodrigues’s formula,!^ by 
writing Equation 6.4.14 in the form 


1c n!  (2n — 2k)! 
P(x) = 1)* fece Al 
(2) = zanl » "xin - Ei (n — 25) 7 pom) 
1 d | n! 
- 1 k . 2n—2k A. 
2"! dz” 2 k(n — k)! eat) 
The last summation is the binomial expansion of (x? — 1)” so that 
1 d" 
P,,(x) = mee ee e. .4. 
(2) = gis 0) (6.4.17) 


1? See Smythe, W. R., 1950: Static and Dynamic Electricity. McGraw-Hill, Section 5.215, for an example. 


13 See Carrier, G. F., M. Krook, and C. E. Pearson, 1966: Functions of the Complex Variable: Theory 
and Technique. McGraw-Hill, pp. 212-213. 


14 Rodrigues, O., 1816: Mémoire sur l'attraction des sphéroides. Correspond. l'École Polytech., 3, 
361—385. 
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4.0 -0.5 0.0 0.5 1.0 
X 


Figure 6.4.2: The first four Legendre functions of the second kind. 


Another method for computing P, (x) involves the use of recurrence formulas. The first 
step in finding these formulas is to establish the fact that 


(1 +h? — 2h) 17? = P(x) + hPi(x) +h? Po(x) +. (6.4.18) 


The function (1 + h? — 2zh)-V/? is the generating function for P,(x). We obtain the 
expansion via the formal binomial expansion 


(1-- Ah? — 2zh) 1/7? =1 + l(2xh — h?) + 131 (23h — h?)?2 4... (6.4.19) 


13 
222! 


Upon expanding the terms contained in 2x — h? and grouping like powers of h, 


(1 +R? — 2zh) 1/7? — 1-- zh + (322 — D)? +. (6.4.20) 


A direct comparison between the coefficients of each power of h and the Legendre polynomial 
P,,(«) completes the demonstration. Note that these results hold only if |x| and |h| < 1. 

Next we define W(a,h) = (1 + h? — 2xh)-!/?. A quick check shows that W(x, h) 
satisfies the first-order partial differential equation 


oW " 
Oh 
The substitution of Equation 6.4.18 into Equation 6.4.21 yields 


(1 — 2xh +h’) (h — z)W =0. (6.4.21) 


(1— 2xh + h?) Y nP, (zx)?! + (h — x) Y P,(x)h” = 0. (6.4.22) 
n=0 


n=0 


Setting the coefficients of h” equal to zero, we find that 


(n + 1)Pr4i(@) — 2nz P, (x) + (n — 1) P4 1(x) + Pa-1 (£) — x P5(x) = 0, (6.4.23) 


Or 


(n + 1)Pr4i(@) — (2n + 1) P, (x) + nPp-1(x) = 0 (6.4.24) 


with n = 1,2,3,.... 
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Similarly, the first-order partial differential equation 


(1— 2xh + ud -hW =0 (6.4.25) 
leads to " " 
(1— 2xh +h?) V ^ P (z)h" — M P, (z)h"*! = 0, (6.4.26) 
n=0 n=0 
which implies 
Pa (2) — 2z P, (x) + P 1(z) — Pa(x) = 0. (6.4.27) 


Differentiating Equation 6.4.24, we first eliminate P5 ,(x) and then P,,,(r) from the 
resulting equations and Equation 6.4.27. This gives two further recurrence relationships: 


PuaOc)-seB)-in41P,)-20, n20,,2,..., (6.4.28) 
and 
xP’ (£) — P.4(2)—nP,(@)=0, n21,2,3,... (6.4.29) 


Adding Equation 6.4.28 and Equation 6.4.29, we obtain the more symmetric formula 


Phai(x) — Py_i(z) = (2n +1)P,(z), n = 1,2,3,.... (6.4.30) 


n 


Given any two of the polynomials P a41 (x£), P,(x), and P,-1(x), Equation 6.4.24 or Equation 
6.4.30 yields the third. 

Having determined several methods for finding the Legendre polynomial P,,(x), we now 
turn to the actual orthogonality condition.!° Consider the integral 


m f. VERRE a -—xpo He: (6.4.31) 
= [ +hP,(x) AP) 

x [Po(z) + £Pi(x) +--+ +4" P,(x) +++] dz (6.4.32) 

B 3 Y uid ra Py (2) Py (2) dz. (6.4.33) 


n=0 m=0 


On the other hand, if a = (1 + h?)/2h, and b = (1 + t?)/2t, the integral J is 


: dx 
J= 6.4.34 
k V1 +R? — 2zh V1 +e -—2rt ( ) 
[ = =. mtu), (6.4.35) 
E = x A. 
2Vht J-1 /a—z V/b—z VhtJ1 Ja—r+vb—-« 
1 D 1 1 E) 
-——ml =+ Vb- = . 6.4.36 
Ja n(va x£ + a| Jia (rt ( ) 


15 See Symons, B., 1982: Legendre polynomials and their orthogonality. Math. Gaz., 66, 152-154. 
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Some Useful Relationships Involving Legendre Polynomials 


Rodrigues’s formula 


Recurrence formulas 
(n + 1)Pr4i(x) — (2n  D)z P, (x) +nPp_-i(x) = 0, mS, A 


Py (z) — P, 4(z) = (2n + DP, (2), n —1,2,3,... 


Orthogonality condition 


" 0, m £n, 
J P,(x)Pm (x) dz = 2 
—1 — 
Daan 2 


But a+1 = (1+h?+2h)/2h = (1+h)?/2h, and a—1=(1—h)?/2h. After a little algebra, 


1 1+ Vht 2 | 1 1 
J= In = Vht + =a (ht) ARES +- 6.4.37 
Vht (Xx) Vht 3V( ) 3 ) ( ) 
ht n Ane” 
Zo ; IE (6.4.38) 


As we noted earlier, the coefficient of h”t™ in this series is f = P,(«)Pm(x) dz. If we match 
the powers of h”t™, the orthogonality condition is 


0, m mn, 


Thi — mn. 


y P, (x) PA (x) dz = { (6.4.39) 


Jan 
2n+1? 


With the orthogonality condition, Equation 6.4.39, we are ready to show that we can 
represent a function f(a), which is piece-wise differentiable in the interval (—1,1), by the 
series: 


f(a) = Y AmPm(z),  —l<a#<1. (6.4.40) 


m=0 


To find Am we multiply both sides of Equation 6.4.40 by P,,(a) and integrate from —1 to 1: 


J f(2)Pa (2) de = V. Am / P, (x) Pym (x) dz. (6.4.41) 
-1 m=0 -1 
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All of the terms on the right side vanish except for n = m because of the orthogonality 
condition, Equation 6.4.39. Consequently, the coefficient A, is 


An | Pi) dx = T f(x)Pa(x)dz, (6.4.42) 
or 
ds mi : I l F(a) P, (2) da. (6.4.43) 


In the special case when f(x) and its first n derivatives are continuous throughout the 
interval (—1,1), we may use Rodrigues’ formula to evaluate 


[5 f(x x)dz = Su TT f(x (x? ax To Dr dts c Le a pope (x) dx 
(6.4.44) 


by integrating by parts n times. Consequently, 


d. A] / (1 — a2)" f™ (x) dz. (6.4.45) 


A particularly useful result follows from Equation 6.4.45 if f(a) is a polynomial of degree k. 
Because all derivatives of f(x) of order n vanish identically when n > k, An =0 if n > k. 
Consequently, any polynomial of degree k can be expressed as a linear combination of the 
first k +1 Legendre polynomials |P (x), ..., P, (z)]. Another way of viewing this result is to 
recognize that any polynomial of degree k is an expansion in powers of z. When we expand 
in Legendre polynomials we are merely regrouping these powers of x into new groups that 
can be identified as Po(x), Pı (x), Po(x),..., P(x). 


e Example 6.4.1 
Let us use Rodrigues’ formula to compute P»(x). From Equation 6.4.17 with n = 2, 


1 d 1d 1 
maga 1)?] = ze 22? 1) = 5327 1). (6.4.46) 


P, (x) = 


e Example 6.4.2 
Let us compute P3(a) from a recurrence relation. From Equation 6.4.24 with n = 2, 
3P3(x) — 5x P(x) + 2P\(x) = 0. (6.4.47) 


But P2(x) = (32? — 1)/2, and P,(x) = x, so that 


3P3(x) = 5aP2(x) — 2P, (x) = 52[(3a? — 1)/2] — 2a = 322? — 3a, (6.4.48) 


The Sturm-Liouville Problem 265 


Or 
P3(x) = (bz? — 32)/2. (6.4.49) 
e Example 6.4.3 
We want to show that 
1 
i, P.(x) dx = 0, n> 0. (6.4.50) 
-1 
From Equation 6.4.30, 
1 1 
(2n 4- » | Py (a) dx = / [Pi 41(z) ^ P,_j(x)| dx (6.4.51) 
-1 -1 
= P.ua(z) — Phila) (6.4.52) 


c wW)ePB.-—PBaciüsBxC1)50, (6453) 


because P,(1) = 1 and P,(—1) = (—1)". 


e Example 6.4.4 


Let us express f(x) = x? in terms of Legendre polynomials. The results from Equation 
6.4.45 mean that we need only worry about P(x), P(x), and P2(x): 


Substituting for the Legendre polynomials, 
a” = Ao + Aiz + 1AS (32? — 1), (6.4.55) 


and 
Ao—i, A120, and A5-$. (6.4.56) 


e Example 6.4.5 


Let us find the expansion in Legendre polynomials of the function: 


0, —]«mc«90, 
f(z) — { Fo Wersi (6.4.57) 


We could have done this expansion as a Fourier series but in the solution of partial differ- 
ential equations on a sphere we must make the expansion in Legendre polynomials. 
In this problem, we find that 


1 
dots Ti P, (x) dz. (6.4.58) 
0 


266 
Therefore, 
1 
Ag = Ti l] dz = i. EU 
0 
1 
A-i[ z(32?—1)dr 20, and A= 
0 
so that 


f(x) = $Po(z) + {Pi (2) - Pale) + S Ps(x) +. 
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(6.4.61) 


Figure 6.4.3 illustrates the expansion, Equation 6.4.61, where we used only the first four 
terms. It was created using the MATLAB script 


clear; 


x 
f 


= zeros(size(x)); % initialize function f(x) 
for k = 1:length(x) % construct function f(x) 
if x(k) < 0; f(k) = 0; else f(k) = 1; end; 


end 


^ initialize Fourier-Legendre series with zeros 


flegendre - zeros(size(x)); 


^ 


a(1) 
a(4) 


read in Fourier coefficients 
1/2; a(2) » 3/4; a(3) 
-7/16; a(5) = 0; a(6) 


0; 
11/32; 


clf % clear any figures 
for n = 1:6 


^ 


compute Legendre polynomial 

N = n-1; P = legendre(N,x); 

compute Fourier-Legendre series 
flegendre = flegendre + a(n) * P(1,:); 


create plot of truncated Fourier-Legendre series 


with n terms 


if n==1 subplot(2,2,1), plot(x,flegendre,x,f,’--’); 
legend(’one term’,’f(x)’); legend boxoff; end 

if n==2 subplot(2,2,2), plot(x,flegendre,x,f,’--’); 
legend(’two terms’,’f(x)’); legend boxoff; end 

if n==4 subplot(2,2,3), plot(x,flegendre,x,f,’--’); 
legend(?four terms’,’f(x)’); legend boxoff; 


xlabel(’x’,’Fontsize’,20); end 


if n--6 subplot(2,2,4), plot(x,flegendre,x,f,’--’); 
legend(?six terms’,’f(x)’); legend boxoff; 


xlabel(’x’,’Fontsize’,20); end 
axis([-1 1 -0.5 1.5]) 


end 


[-1:0.01:1]; 4 create x points in plot 


As we add each additional term in the orthogonal expansion, the expansion fits f(x) better 
in the “least squares” sense of Equation 6.3.5. The spurious oscillations arise from trying 
to represent a discontinuous function by four continuous, oscillatory functions. Even if 
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Figure 6.4.3: Representation of the function f(x) = 1 for 0 < x < 1 and 0 for —1 < x < 0 by various 
partial summations of its Legendre polynomial expansion. The dashed lines denote the exact function. 


we add additional terms, the spurious oscillations persist, although located nearer to the 
discontinuity. This is another example of Gibbs phenomena.!9 See Section 5.2. 


e Example 6.4.6: Iterative solution of the radiative transfer equation 


One of the fundamental equations of astrophysics is the integro-differential equation 
that describes radiative transfer (the propagation of energy by radiative, rather than con- 
ductive or convective, processes) in a gas. 

Consider a gas that varies in only one spatial direction and that we divide into infinites- 
imally thin slabs. As radiation enters a slab, it is absorbed and scattered. If we assume 
that all of the radiation undergoes isotropic scattering, the radiative transfer equation is 


1 

js -I- af I dy, (6.4.62) 
dt E" 

where J is the intensity of the radiation, is the optical depth (a measure of the absorbing 

power of the gas and related to the distance that you travel within the gas), u = cos(0), 

and @ is the angle at which radiation enters the slab. In this example, we show how the 

Fourier-Legendre expansion!” 


I(r.p) = 9 Inl) Pau) (6.4.63) 
n=0 


may be used to solve Equation 6.4.62. Here [,,(7) is the Fourier coefficient in the Fourier- 
Legendre expansion involving the Legendre polynomial P, (y). 
We begin by substituting Equation 6.4.63 into Equation 6.4.62, 


x [(n + 2e i nPa—ı(u)] a = Y LP.) - Io, (6.4.64) 
n=0 ne 


16 Weyl, H., 1910: Die Gibbs'sche Erscheinung in der Theorie der Kugelfunktionen. Rend. Circ. Mat. 
Palermo, 29, 308-321. 


17 See Chandrasekhar, S., 1944: On the radiative equilibrium of a stellar atmosphere. Astrophys. J., 99, 
180-190. 
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where we used Equation 6.4.24 to eliminate LP, (u). Note that only the Jo(7) term remains 
after integrating because of the orthogonality condition: 


1 1 
J i P004 f. PPa) du - 0. (6.4.65) 
-1 -1 
if n > 0. Equating the coefficients of the various Legendre polynomials, 


n dlni n+1 dln4ı 


Wal sm RES de 05 6-268) 
for n = 1,2,... and 
dli 
—— zc. 6.4.6 
dr ? ( 7) 


Thus, the solution for I; is J} = constant = 3£/4, where F is the net integrated flux and 
an observable quantity. 


For n — 1, 
dlo 2dh 3F 
pL Se PER EFT 4.68 
dr 5dr ^| 4 ace) 
Therefore, 
In+ $1; — $Fr +A. (6.4.69) 


The next differential equation arises from n = 2 and equals 


2dh 3d _ 


Because J; is a constant and we only retain Jo, J1, and Jz in the simplest approximation, 
we neglect dI3/dr and Iz = 0. Thus, the simplest approximate solution is 


loj—i$Fr-A, Lh -—$ÍF, and I,-0. (6.4.71) 


To complete our approximate solution, we must evaluate A. If we are dealing with a 
stellar atmosphere where we assume no external radiation incident on the star, [(0,) = 0 
for —1 € u < 0. Therefore, 


1 oo 1 9 
J 1p) = Yrs f PaP d= LL e. (6472) 
—1 mco —1 2n + 1 
Taking the limit 7 — 0 and using the boundary condition, 
2 1 oo 1 
men) = f I(0, 1) P, (u) du = Limo | Pp) Pm (p) du. (6.4.73) 


Thus, we must satisfy, in principle, an infinite set of equations. For example, for n = 0, 1, 
and 2, 


2Io(0) = Io(0) + $15 (0) — $13(0) + 4515(0) +-->, (6.4.74) 
21,(0) = $1o(0) + 47 (0) + 222(0) — 440) +>, (6.4.75) 


and 


2I5(0) = (0) + $12(0) + $13(0) — 33515(0) +. (6.4.76) 
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Using Jı (0) = 3F/4, 


51o(0) + (0) — 3515(0) +--- = AF, 6.4.77) 

$1o(0) + $12(0) — 40) +- = FF, 6.4.78) 
and 

214(0) — 413(0) + GJs(0) + +++ = FF. 6.4.79) 


Of the two possible equations, Equation 6.4.77 or Equation 6.4.78, Chandrasekhar chose 
Equation 6.4.78 from physical considerations. Thus, to first approximation, the solution is 


I(u,r) = 3F (r+) + {Fut 6.4.80) 
Better approximations can be obtained by including more terms; the interested reader is 
referred to the original article. In the early 1950s, Wang and Guth!? improved the proce- 
dure for finding the successive approximations and formulating the approximate boundary 
conditions. 


Problems 


Find the first three nonvanishing coefficients in the Legendre polynomial expansion for the 
following functions: 


1/029, Jal < e€, 


0, —-1«r«90, E 
he c 2-9 ec bel <1 
3. f(x) =e, |z| « 1. 4. f(z) =2°, |z| « 1. 

—1], -1«z«0, | .j-14 -1«z«0, 
? se)={ 7 0 « x « 1. 6. (2-1; 0«zr«l. 


'Then use MATLAB to illustrate various partial sums of the Fourier-Legendre series. 
7. Use Rodrigues’ formula to show that P4(z) = (35x4 — 302? + 3). 


8. Given Ps(x) = $8355 — Vr’ + 4x and P;(x) from Problem 7, use the recurrence formula 
8 8 8 
for P,44(x) to find Ps(x). 


9. Show that (a) P,(1) = 1, (b) P,(-1) = (—1)", (c) P»441(0) = 0, and (d) Pon(0) = 
(—1)" (2n)!/ (2?"^nIn!). 


10. Prove that 


J P (t) dt = Sy Pla) —Pati(z)],  n>0. 


18 Wang, M. C., and E. Guth, 1951: On the theory of multiple scattering, particularly of charged particles. 
Phys. Rev., Ser. 2, 84, 1092-1111. 
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11. Given!’ 
ToT -2 f cos[(n + 1)v] -2f sin[(n + 3)«] "m 
v/2|[cos(z) — cos(0 \/2[cos(8) —cos(z)] ' 


show that the following generalized Fourier series holds: 


Ti E : = Pil cos()] cos [(n+ $)t], O<t<O<7, 
cos(t) — 2 cos(0 


if we use the eigenfunction y,(r) = cos |(n + 3) z], 0 < £ < v, r(x) = 1 and H(-) is 
Heaviside's step function, and 


7 TUE Am EN 5 P„[cos(0)] sin [(n + i) t] , 0<0<tK<z, 
n=0 


if we use the eigenfunction y,(x) = sin|[(n+4)a], 0 < x < m, r(x) = 1 and H(-) is 
Heaviside’s step function. 


12. The series given in Problem 11 are also expansions in Legendre polynomials. In that 
light, show that 

*  Palcos(0)] sin(@) ag sin[(n + 5) t] 
o V/2cos(0) — 2cos(t) nai 


and 
7  P4[cos(0)] sin(0) n cos (n + 3) t] 


+ y/2cos(t) — 2cos(0) ndi 


where 0 «t « m. 


13. (a) Use the generating function, Equation 6.4.18, to show that 


ja] « 1, 1 « |t]. 


eS to PAs 
V1— tae +P 2» 
(b) Use the results from part (a) to show that 


: -= V3 CMM), Jal <1. 


cosh(u) — a 


Hint: 
v2 
vcosh(u) -z vell — 2x +e TH 


19 Hobson, E. W., 1965: The Theory of Spherical and Ellipsoidal Harmonics. Chelsea Publishing Co., 
pp. 26-27. 
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14. The generating function, Equation 6.4.18, actually holds?? for |h| < 1 if || < 1. Using 
this relationship, show that 


1 
Y GT den 
2(1— 2) 


n=0 
and 
neg rt 1 i. — : 


Use these relationships to show that 
2n tl c. P, 1 
Y TERED Pala) ~ Y A = 
22 n+1 n+1 (1 —«x)/2 


if |z| < 1. 


" E , 
(1 = 2)/2 ' 


6.5 ANOTHER SINGULAR STURM-LIOUVILLE PROBLEM: BESSEL'S EQUATION 


In the previous section we discussed the solutions to Legendre's equation, especially 
with regard to their use in orthogonal expansions. In this section we consider another classic 
equation, Bessel's equation?! 


wy! + ay! + (Wa? — n?)y — 0, (S 


d ( dy 2 n? u 
E (e2) + (» z— -) y — 0. (6.5.2) 


Once again, our ultimate goal is the use of its solutions in orthogonal expansions. These 
orthogonal expansions, in turn, are used in the solution of partial differential equations in 
cylindrical coordinates. 

A quick check of Bessel's equation shows that it conforms to the canonical form of the 
Sturm-Liouville problem: p(x) = z, q(x) = —n?/z, r(x) = x, and \ = 42. Restricting 
our attention to the interval [0, L], the Sturm-Liouville problem involving Equation 6.5.2 is 
singular because p(0) = 0. From Equation 6.4.1 in the previous section, the eigenfunctions 
to a singular Sturm-Liouville problem will still be orthogonal over the interval [0, L] if (1) 
y(x) is finite and zy'(x) is zero at x = 0, and (2) y(x) satisfies the homogeneous boundary 
condition, Equation 6.1.2, at x = L. Consequently, we only seek solutions that satisfy these 
conditions. 

We cannot write down the solution to Bessel's equation in a simple closed form; as in 
the case with Legendre's equation, we must find the solution by power series. Because we 
intend to make the expansion about z — 0 and this point is a regular singular point, we must 


20 [pid., p. 28. 


21 Bessel, F. W., 1824: Untersuchung des Teils der planetarischen Störungen, welcher aus der Bewegung 
der Sonne entsteht. Abh. d. K. Akad. Wiss. Berlin, 1-52. See Dutka, J., 1995: On the early history of 
Bessel functions. Arch. Hist. Exact Sci., A9, 105-134. The classic reference on Bessel functions is Watson, 
G. N., 1966: A Treatise om the Theory of Bessel Functions. Cambridge University Press, 804 pp. 
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It was Friedrich Wilhelm Bessel's (1784-1846) apprenticeship to the famous mercantile firm of 
Kulenkamp that ignited his interest in mathematics and astronomy. As the founder of the German 
school of practical astronomy, Bessel discovered his functions while studying the problem of planetary 
motion. Bessel functions arose as coefficients in one of the series that described the gravitational 
interaction between the sun and two other planets in elliptic orbit. (Portrait courtesy of Photo 
AKG, London, with permission.) 


use the method of Frobenius, where n is an integer.?? Moreover, because the quantity n? 
appears in Equation 6.5.2, we may take n to be nonnegative without any loss of generality. 


To simplify matters, we first find the solution when u = 1; the solution for u Z 1 
follows by substituting ux for x. Consequently, we seek solutions of the form 


n=) Bem (6.5.3) 
k=0 
=) (2k + s) Bpa?" t-t, (6.5.4) 
k=0 
and 
y” (x) = XC (2k + s)(2k + s — 1) Bua? 1977, (6.5.5) 


> 
ll 
o 


22 This case is much simpler than for arbitrary n. See Hildebrand, F. B., 1962: Advanced Calculus for 
Applications. Prentice-Hall, Section 4.8. 
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where we formally assume that we can interchange the order of differentiation and summa- 
tion. The substitution of Equation 6.5.3 and Equation 6.5.5 into Equation 6.5.1 with u = 1 
yields 


Y (2k+8) (2k -- s — 1) Bya?**5 +) ks) Buz?** 123 Bpr?" t+ — n? 5 Bya?**5 = 0, 
k=0 k=0 k=0 k=0 
(6.5.6) 
or 
Y QE +s)? — n?]Bya?* + V^ pa??? = 0. (6.5.7) 
k=0 k=0 


If we explicitly separate the k = 0 term from the other terms in the first summation in 
Equation 6.5.7, 


(s? — n?) Bo + V ^ [Qm + s)? — n?]Bma?™ +X ^ Bya?*t? = 0. (6.5.8) 
mæl k=0 


We now change the dummy integer in the first summation of Equation 6.5.8 by letting 
m = k + 1 so that 


(s? — n?) Bo + M (2k + s +2)? — n?] Ba + B )a?*?? = 0. (6.5.9) 
k=0 


Because Equation 6.5.9 must be true for all z, each power of x must vanish identically. This 
yields s = +n, and 


[(2k + s + 2)? —n7]Bp11 + Bk = 0. (6.5.10) 


Since the difference of the larger indicial root from the lower root equals the integer 2n, we 
are only guaranteed a power series solution of the form given by Equation 6.5.3 for s = n. If 
we use this indicial root and the recurrence formula, Equation 6.5.10, this solution, known 
as the Bessel function of the first kind of order n and denoted by J;, (x), is 


jy ee (6.5.11) 


! 
E k(n 4- k)! 


To find the second general solution to Bessel's equation, the one corresponding to 
s — —n, the most economical method?? is to express it in terms of partial derivatives of 
Ja (x) with respect to its order n: 


Yn(x) = M ( REY E (6.5.12) 


Upon substituting the power series representation, Equation 6.5.11, into Equation 6.5.12, 


n—1 


2 1 (n— k—1) say2k-n 
Y, (x) = 7J.(2) (2/2) - — » T (5) 
OF T nt+2k 
2» a = [V(k 4- 1) - v(k +n +1), (6.5.13) 


?3 See Watson, G. N., 1966: A Treatise on the Theory of Bessel Functions. Cambridge University Press, 
Section 3.5, for the derivation. 
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Figure 6.5.1: The first four Bessel functions of the first kind over 0 € x < 8. 


where 


1 1 
Ce eat ae e eee (6.5.14) 


w(1) = —y, and y is Euler’s constant (0.5772157). In the case of n = 0, the first sum in 
Equation 6.5.13 disappears. This function Y, (z) is Neumann’s Bessel function of the second 
kind of order n. Consequently, the general solution to Equation 6.5.1 is 


y(x) = AJs(uz) + BY. (ua). (6.5.15) 
Figure 6.5.1 illustrates the functions Jo(x), J1(x), Jo(x), and Ja(x) while Figure 6.5.2 gives 


Yo(z), Yi(a), Yo(x), and Y3(z). 
An alternative solution to Equation 6.5.1 is 


y(x) = CH (a) + DH® (a), (6.5.16) 
where 
HY (£) = JIn(x) + i¥n (£), (6.5.17) 
and 
HP (x) = Jn(x) — i¥n(a). (6.5.18) 


These functions HP (a), H? (x) are referred to as Bessel functions of the third kind or 
Hankel functions, after the German mathematician Hermann Hankel (1839-1873). The 
advantage of Hankel functions over the conventional Bessel function is most clearly seen in 
their asymptotic expansions: 


2, 
Ho P" .7  ji(z—nm/2—m/4) 6.5.19 
n 08 t ; (6.5.19) 
and 
2 : 
He) A DAE —i(z-nn/2—m/4). 6.5.20 
9o) s 3. e (6.5.20) 
for |z| — oc. 


An equation that is very similar to Equation 6.5.1 is 


dy 


dy 
2 
de 


T r2 (n? + z2)y — 0. (6.5.21) 
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Figure 6.5.2: The first four Bessel functions of the second kind over 0 € x < 8. 


It arises in the solution of partial differential equations in cylindrical coordinates. If we 
substitute iz = t (where i = /—1) into Equation 6.5.21, it becomes Bessel’s equation: 


dy d 
y EU 4 (P — n2)y — 0. (6.5.22) 


2 
: dt? dt 


Consequently, we may immediately write the solution to Equation 6.5.21 as 
y(x) = cı Jn (ix) + c2Yn (ix), (6.5.23) 
if n is an integer. Traditionally the solution to Equation 6.5.21 has been written 


y(x) = es (z) + e Ks (£) (6.5.24) 
rather than in terms of J,, (iz) and Y, (iz), where 


Do (z/2 )2k+n 
lxi 2er (6.5.25) 


and " 
K(x) = iin [Js (iz) + iY, (ix)] . (6.5.26) 


The function In (x) is the modified Bessel function of the first kind, of order n, while Kn (x) 
is the modified Bessel function of the second kind, of order n. Figure 6.5.3 illustrates Io(x), 
I(x), D(x), and I3(z) while in Figure 6.5.3 Ko(x), Ki1(x), Ko(x), and K3(a) are graphed. 
Note that K,,(x) has no real zeros while J„(x) equals zero only at x = 0 for n > 1. 

As our derivation suggests, modified Bessel functions are related to ordinary Bessel 
functions via complex variables. In particular, J4(iz) = i" I,(z), and I,(tz) = i" Ja (z) for 
z complex. 

Although we found solutions to Bessel's equation, Equation 6.5.1, as well as Equation 
6.5.21, can we use any of them in an eigenfunction expansion? From Figures 6.5.1—-6.5.4 we 
see that J;,(z) and L, (x) remain finite at z = 0 while Y, (x) and Kn (x) do not. Furthermore, 
the products xJ} (x) and zI} (x) tend to zero at x = 0. Thus, both J,(x) and L,(x) satisfy 
the first requirement of an eigenfunction for a Fourier-Bessel expansion. 

What about the second condition, that the eigenfunction must satisfy the homogeneous 
boundary condition, Equation 6.1.2, at x = L? From Figure 6.5.3 we see that I,(x) can 
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Figure 6.5.3: The first four modified Bessel functions of the first kind over 0 € x < 3. 


never satisfy this condition, while from Figure 6.5.1, J,(z) can. For that reason, we discard 
I,(«) from further consideration and continue our analysis only with Jn (x). 

Before we can derive the expressions for a Fourier-Bessel expansion, we need to find 
how J,(x) is related to J4,,1(x) and J4 1(x). Assuming that n is a positive integer, we 
multiply the series, Equation 6.5.11, by x^ and then differentiate with respect to x. This 
gives 


pou o NS(CD Qn 2E? pe (ODE a 
"i Ail >, QMPERn-EE 2. Ha-irgr 79 
(6.5.27) 
d n T, 
dg E. Jn] = a Jain) (6.5.28) 


for n = 1,2,3,.... Similarly, multiplying Equation 6.5.11 by z^", we find that 


d —n —n 
T [27^ J,(9)] = 27” Jn (2) (6.5.29) 
for n = 0,1,2,3,.... If we now carry out the differentiation on Equation 6.5.28 and Equation 


6.5.29 and divide by the factors z^", we have that 
JI (2) + EA = Jn—i(x), (6.5.30) 


and 
Ji (a) — Lala) zu ort (6.5.31) 
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Figure 6.5.4: The first four modified Bessel functions of the second kind over 0 € x < 3. 


Equation 6.3.30 and Equation 6.3.31 immediately yield the recurrence relationships 


Jaa (a) + Ina a) = Z (a) (6.5.32) 
and 
Paes Aw (6.5.33) 


for n = 1,2,3,.... For n = 0, we replace Equation 6.5.33 by Jj(x) = —Ji(x). Many of the 
most useful recurrence formulas are summarized in Table 6.5.1 for Bessel functions and in 
Table 6.5.2 for Hankel functions. 

Let us now construct a Fourier-Bessel series. The exact form of the expansion depends 
upon the boundary condition at r = L. There are three possible cases. One of them is 
y(L) = 0 and results in the condition that J,(u,£) = 0. Another condition is y'(L) = 0 
and gives J! (uj L) = 0. Finally, if hy(L) + y'(D) = 0, then hJn(ukL) + wed (py L) = 0. In 
all of these cases, the eigenfunction expansion is the same, namely 


f(x) = M5 AxJs (us), (6.5.34) 
k=1 


where juz is the kth positive solution of either Jn(uk L) = 0, J) (uy D) = 0, or hJn(ukL) + 
pu Jo (HL) = 0. 
We now need a mechanism for computing Az. We begin by multiplying Equation 6.5.34 
by tJn(Mmx) dz and integrate from 0 to L. This yields 
L 
0 


oo L 
Daf Jala) Ilma) dz = f Lf (2) In(Umx) da. (6.5.35) 
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Table 6.5.1: Some Useful Relationships Involving Bessel Functions of Integer Order 


2 
In—1(2) + Inga (Zz) = —Ja(2); n=1,2,3,... 
Jn-1(2)— Jn41(2) HI), n = 1,2,3,...; 6) ==) 
AEE ast (z) —1,2,3 
dz dnl) = 2" Jn), n =1,2,3,... 
i "Ja(z)| = -27"In41(2) = 0,1,2,3 
di n Ve = -Z n+1\4), n —U,1,2,9,... 
2n 
In-1(2) — Ingi(z) = —In(z), ete 8 ee 
z 
In-1(2) + In41(2) = 202), n= 1,2,3,...; He) ed) 
2n 
Ky_-1(2) — Kn41(2) = —-— Ky (2), qaid 2 9. 
z 
K,-1(2) + Kn41(2) = -2K; (2), n= 1, 2,3, ee Ko(z) m —Ki(z) 


Jd tae m = ermm UE] 


Toler = QUT 


mmi _ ,—mmmi A .cos(mnm) 
Ky, (ze) =e Ky(z) BE I, (2) 
In(z) = eni? J, (ze™/?), —r < arg(z) € 7/2 
Ia) eng J, ge omm 7/2 < arg(z) < T 


From the general orthogonality condition, Equation 6.2.1, 
L 
| LI n(x) In(bmx) dx = 0, (6.5.36) 
0 
if k A m. Equation 6.5.35 then simplifies to 


L L 
x 2 x t = x x x H be «9. 
An f I(t) d | f(x), (ugs) da, (6.5.37) 


or 


L 
Ay = x | z f (a) In(uRx) dx, (6.5.38) 


where 


L 
a= f zJ2 (uya) da, (6.5.39) 
0 
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Table 6.5.2: Some Useful Recurrence Relations for Hankel Functions 


d d 
n rr(p) suom (p) = " (p) = (p) 
ES E HX ()] x” Hy (x£), n L2,..53: |n (x)| Hi (x) 
d —n rr(p) —n pr») 
as E HY (x)| = —4 "HY, (2), n —0,1,2,3,... 


| 2n 


HP (x) + HO? (a) H(z),  n-2128.. 


dH? 
HP (x) — H®) (2) = gate () n=1,2,3,... 
L 


and k replaces m in Equation 6.5.37. 
The factor Cp depends upon the nature of the boundary conditions at x = L. In all 
cases we start from Bessel’s equation 


2 
[x J} (ux) + (is 2 =) In (pax) = 0. (6.5.40) 
If we multiply both sides of Equation 6.5.40 by 2zJ; (uz), the resulting equation is 


(ida? — n?) [2] = — eZ Qe (6.5.41) 


An integration of Equation 6.5.41 from 0 to L, followed by the subsequent use of integration 
by parts, results in 


L L 
L 
(i42? — n?) J2(une)| 5 — 22 J zJ3(uxz) dz — — [oJ ue | - (6.5.42) 
0 


Because J;,(0) = 0 for n > 0, Jo(0) = 1 and zJ}, (x) = 0 at x = 0, the contribution from the 
lower limits vanishes. Thus, 


L 
1 
Ck = J xJ? (ugr) dx = TA Ga — n?)J2(uy L) + Pr D|, (6.5.43) 
0 k 


Because h 
Jp (ukt) = PELO) — Bi In+1 (HEX) (6.5.44) 


from Equation 6.5.31, Ck becomes 


Cr = 11722 a (uel), (6.5.45) 
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if Js (uj D) = 0. Otherwise, if J}, (uk L) = 0, then 


2I2. n2 
= Fe Jh). (6.5.46) 


Ck 
2u 


Finally, 


Jo Cobos (6.5.47) 


if iJ, (uL) = —hJs (uL). 

All of the preceding results must be slightly modified when n = 0 and the boundary 
condition is Jo(uy L) = 0 or ukJı(ukL) = 0. This modification results from the additional 
eigenvalue uo = 0 being present and we must add the extra term Ao to the expansion. For 
this case the series reads 


f(x) = Ao + 3 5 ArJo(uee), (6.5.48) 
k=1 


where the equation for finding Apo is 


9 [fL 
Ao = = 6.5.49 
IERES (6.5.49) 
and Equation 6.5.38 and Equation 6.5.46 with n = 0 give the remaining coefficients. 
e Example 6.5.1 


Starting with Bessel's equation, we show that the solution to 


1-2 APOO, 
af yg (eu + x) y=0 (6.5.50) 
x x 

is 

y(x) = Az" J, (bx^) + Bz^Y,(bz*), (6.5.51) 
provided that bx* > 0 so that Y, (bx^) exists. 
'The general solution to 
dn | dm 
2 2.542 

= 5.52 

is 
n = AJ4(£) + BY, (£). (6.5.53) 


If we now let n = y(z)/x^ and € = bx*, then 


d drd xie d 
= = 6.5.54 
dE dé dx bc da’ ood 
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2 2—2c 2 —1 1—2c 
d c d (c — 1)z d 6.5.55) 
dé? 2e dx? bc? dx 
dy 1 dy a 
= 6.5.56 
dx ( ) zadr g” ) 
and g 2 p ( ) 
y ) 1 d^y 2a dy a(l+a 
= 6.5.57 
dx? ( xe dx? xt! dg gate ) 


Substituting Equation 6.5.54 and Equation 6.5.57 into Equation 6.5.52 and simplifying 
yields the desired result. 


e Example 6.5.2 


Let us find?* the general solution to the nonhomogeneous differential equation 


dy 1dy 
| bye 6.5.58 
dr? rdr á (r), ( ) 
where k is a real parameter. 
The homogeneous solution is 
yg (r) = Cy Ip(kr) + C» Ko(kr). (6.5.59) 


Using variation of parameters, we assume that the particular solution can be written 


yp(r) = A(r)o(kr) + B(r)Ko(kr), (6.5.60) 
where (kr) 
/ _ 0 Ko( (kr) Io( (kr) Ko kr 
As | —S(r) kKi(kr) os KI i KK? (kr) |? 165.61) 
and (kr) 
/ Io( (kr) Io( (kr) Ko(kr 
Sc | KI "E E KI a kKi (ker) |: One) 
Expanding the determinants, we find 
A'(r) = S(r) Ko(kr)/ {k [Io(kr) Ko(kr) — I6(kr) Ko(kr)}} (6.5.63) 
and 
B'(r) = —S(r)Io(kr)/ {k [lo(kr) Ko (kr) — Ig(kr)Ko(kr)]) . (6.5.64) 
Evaluating the Wronskian?? for modified Bessel functions, 
Ip(z)Ko(2) — Io(z)Ko(z) = 1/2 (6.5.65) 
A'(r) = —rS(r)Ko(kr) and B'(r) = rS(r)Io(kr) (6.5.66) 


24 See Hassan, M. H. A., 1988: Ion distribution functions during ion cyclotron resonance heating at the 
fundamental frequency. Phys. Fluids, 31, 596—599. 


?5 Watson, op. cit., p. 80, Formula 19. 
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Integrating Equation 6.5.66, 

A(r)2— J xS (x)Ko(kx) dz, and B(r) = f xS (x)Io(kx) dz. (6.5.67) 
Consequently, the general solution is the sum of the particular and homogeneous solution, 


T: 


y(r) = Cilo(kr) + C9 Ko(kr) — Io(kr) i xS (x)Ko(kx) dx + Koler) f xS (x)lHo(kz) dx. 


(6.5.68) 
e Example 6.5.3 
Let us show that 
xr? Je (x) = (n? —n— x°) Ja (£) + tIn 41 (a). (6.5.69) 
From Equation 6.5.31, 
Ji (a) =" Iy(x) — Inga(2), (6.5.70) 
x 
Ji (a) = — J (x) + 2S 6.5.71 
nt) = z2 n(x) T T a(z) ngal); ( ses. ) 
and 
‘i n n [n n+1 
Ha) = -Fle + = [7 Jnl) — Inea(@)] - JA) - — mac) (6.8.72) 
after using Equation 6.5.30 and Equation 6.5.31. Simplifying, 
2 
— J; 
J! (x) = (* ao 1) In (x) + Jasle) (6.5.73) 
x E 
After multiplying Equation 6.5.73 by z?, we obtain Equation 6.5.69. 
e Example 6.5.4 
Let us show that 
a? Jo(x) dx = a? Ja(a) — 2af Ja(a). (6.5.74) 


0 


We begin by integrating Equation 6.5.74 by parts. If u = z?, and dv = x3 Jo(x) dz, 
then 


T a? Jo (x) dx = a?Js(x)|; — 2 z^ J3(x) dz, (6.5.75) 
0 0 


because d[x? J4(x)]/dx = x? Jo(x) by Equation 6.5.28. Finally, 


Jr x Jo(x) dx = a?Ja(a) — 2x4 Ja (ar) |, = a? J3(a) — 2a*J4(a), (6.5.76) 


since z^Ja(z) = d[x*J4(x)]/dx by Equation 6.5.28. 
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e Example 6.5.5 


Let us expand f(r) = z, 0 < a « 1, in the series 


f(x) = M5 Agi (usa), (6.5.77) 


k=1 


where up denotes the kth zero of Jı (u). From Equation 6.5.38 and Equation 6.5.46, 


9 1 
Ay = zo | xr? J x) dz. 6.5.78 
k Piun) Jo (uz) ( ) 
However, from Equation 6.5.28, 
d 
dz [a7 Jo(x)] = 27 J, (x), (6.5.79) 
t 


if n = 2. Therefore, Equation 6.5.78 becomes 


2x? J. "n 2 
A= Jal) = (6.5.80) 
Hy J3 (uk) 0 px Jo (Lk) 
and the resulting expansion is 
e 
gagy UNE — genet (6.5.81) 


£— HkJ2luk) 


Figure 6.5.5 shows the Fourier-Bessel expansion of f(x) — x in truncated form when we 
only include one, two, three, and four terms. It was created using the MATLAB script 


clear; 
x = [0:0.01:1]; 4 create x points in plot 
f = x; % construct function f(x) 
^4 initialize Fourier-Bessel series 
fbessel = zeros(size(x)); 
^4 read in the first four zeros of J_1(mu) = 0 
mu(1) 3.83171; mu(2) = 7.01559; 
mu(3) 10.17347; mu(4) = 13.32369; 
clf % clear any figures 
for n = 1:4 
4 Fourier coefficient 
factor - 2 / (mu(n) * besselj(2,mu(n))); 
% compute Fourier-Bessel series 
fbessel - fbessel * factor * besselj(1,mu(n)*x); 
^4 create plot of truncated Fourier-Bessel series 
^ with n terms 
subplot(2,2,n), plot(x,fbessel,x,f,?--?) 
axis([O 1 -0.25 1.25]) 
if n == 1 legend(’1 term’,’f(x)’); legend boxoff; 
else legend([num2str(n) ? terms?],?f(x)?); legend boxoff; 
end 
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—— 2terms 
T= £09) - 


= 


0 0.5 1 0 0.5 1 


Figure 6.5.5: The Fourier-Bessel series representation, Equation 6.5.81, for f(x) =x, 0 < z « 1, when we 
truncate the series so that it includes only the first, first two, first three, and first four terms. 


if n > 2 xlabel(’x’,’Fontsize’,20); end 
end 


e Example 6.5.6 


Let us expand the function f(x) = z?, 0 < x < 1, in the series 
f(x) = 5 Ay Jo (uii), (6.5.82) 
k=1 
where jz, denotes the kth positive zero of Jo(u). From Equation 6.5.38 and Equation 6.5.46, 
2 d 
Ak = m z^ Jo(ui az) dz. (6.5.83) 
If we let t = ix, the integration, Equation 6.5.83, becomes 
2 PR 
Ay = a t? Jo(t) dt. (6.5.84) 
We now let u = t? and dv = tJo(t) dt so that integration by parts results in 


2 Hk 2 Hk 

Es c ERG es 2 | t? J,(t) a| He Lin) - 2 | t? J, (t) a, 
ui Jiu) | fo 0 ui Ji (uk) r 0 

(6.5.85) 

because v = tJ4(t) from Equation 6.5.28. If we integrate by parts once more, we find that 


BE ER 
Ji(uk) | be IN 


k 


Ax (6.5.86) 


2 3 2 | 
= 55S J- — Quiz d: = 


However, from Equation 6.5.32 with n = 1, 


Si (we) = uk [J2(uk) + Jo(ux)] ; (6.5.87) 
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Figure 6.5.6: The Fourier-Bessel series representation, Equation 6.5.90, for f(x) = 2327, 0 « v « 1, when 
we truncate the series so that it includes only the first, first two, first three, and first four terms. 


Or 


Aus) = zn. (6.5.88) 


because Jo(uk) = 0. Therefore, 


P 2(uy — 4) Ji (ue) 


6.5.89 
URI? (Me) ( ) 


and = 
>> (ui, — 4) Jo(un2) 


; 0«zr«1. 6.5.90 
TT, 


Figure 6.5.6 shows the representation of x? by the Fourier-Bessel series, Equation 6.5.90 
when we truncate it so that it includes only one, two, three, or four terms. As we add each 
additional term in the orthogonal expansion, the expansion fits f(x) better in the “least 
squares" sense of Equation 6.3.5. 


Problems 


1. Show from the series solution that 


A [Jo(ka)] = -k (ka). 


From the recurrence formulas, show the following relations: 


2. 3J! (v) = Ja(z) — Jo(z) 3. Jo(x) = I(x) — Jf 


~ 


Jo (x) = ^) + (3 = i) Jo (x) 
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5. 

h) 1 Jf) 2 Je) _ 2, A) 

Jls) x Jle) s A(x) x Jom) 
6: 48 8 24 
T. 

Jairo nei 2072 aua) eo e ata) 
8. 7 
Ja (a) = (= — 1) Ji (x) a — Jo(x) 

9. 


AJ, (x) = In—2(x) — 2Jn (x) + Jn2(x) 


10. Show that the maximum and minimum values of J;(x) occur when 


nJ, (a) nJ, (a) 


, X : 
Jn tal) Jac) 


and Jn-1(£) = Jn41 (x). 


qp = 


Show that 
11. d 
Ps [7?J3(2z)] = =£ J3 (21) + 2x? Jo (2x) 

x 

12. 
— [x Jo (a?)] = Jo(z?) — 2x? J, (a?) 
13. 
f 262 dx = $23Js (32) + C 

14. 

fa) dx = —h4~7Jn(2x) + C 
15. 

J ias dx = Jo(x) + xIn(x)Jı (£) + C 

16. 


a 2 k 
| tJo(kx) dz = en) 
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1T. 
f z(1— a”) Jo(ka) dx = alk) — pa Jo) 


18. 


n x? Jo(ka) dz = k3 Ji(Kk) I zz Jolk) 


19. Show that 
L3 E 0 € x « 1, 


where ju, is the kth positive root of Jo(u) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


20. Show that 


1-2?  && Jolmer) 
8 » us Jui) 


where up is the kth positive root of Jo(u) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


21. Show that 
Ay — r? = —16 Jil hun) 


0x x x2, 
he < ug Jo (Que) 2uk)' 


where up is the kth positive root of J;(24) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


22. Show that 


= u? — 8) Ji (ua) 
=2 O0<a<l 
exc ui Jo (uu) 


where ju, is the kth positive root of Jı(u) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


23. Show that 


pay Hanlon) Auer) 0czczl, 
nm J? (Hx) 


where up is the kth positive root of Ji (u) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


24. Show that 


1-24 ogy d MD. 0czczl, 
ei 2 J1 (ux) 


where up is the kth positive root of Jo(u) = 0. Then use MATLAB to 
partial sums of the Fourier-Bessel series. 


illustrate 


illustrate 


illustrate 


illustrate 


illustrate 


illustrate 
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various 


various 


various 


various 


various 


various 
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25. Show that 


S Jo(um/L) 
1 = 2aL 0<a<L 
i > (ui, + œL?) Jolu) ues 


where px is the kth positive root of wJi(w) = aLJo(u). Then use MATLAB to illustrate 
various partial sums of the Fourier-Bessel series. 


26. Using the relationship?? 


° 8J, (aa) J! (Ba) — aa Jy (Ba) J! 
[i Talaria) rar = EAA) = np 
show that 
Jo(ba) — Jo(ba) 202 = Jo (tnx) 
Jo (ba) ^ a DET 2 —52)J (upa) 0<2<a, 


where up is the kth positive root of Jo(ua) = 0 and b is a constant. 


27. Given the definite integral?” 


show that 


H(t-s) _ 95> sin (ust) Jon) Vern pure 


Mu IZ (He) 
where up is the kth positive root of Jo(js) = 0 and H(-) is Heaviside's step function. 


28. Using the same definite integral from the previous problem, show?? that 


H(a — na/ b) Jo(pu a [b 
(a — x) E oltnt/b) gen ch, 


Unde (un) 


where a < b, Un is the nth positive root of Jj(u) = —Jı(u) = 0, and H(-) is Heaviside’s 
step function. 


29. Given the definite integral?’ 


a sin(av? +c?) avb + c?) 
Jol bv a? — x? | dz = ; 0 « b, 
| cos(cx) o( a?-—g ) g= E ce 


26 Watson, op. cit., Section 5.11, Equation 8. 
27 Gradshteyn, I. S., and I. M. Ryzhik, 1965: Table of Integrals, Series, and Products. Academic Press, 
Section 6.567, Formula 1 with v = 0 and u = —1/2. 


28 See Wei, X. X., 2000: Finite solid circular cylinders subjected to arbitrary surface load. Part II- 
Application to double-punch test. Int. J. Solids Struct., 37, 5733-5744. 


?9 Gradshteyn and Ryzhik, op. cit., Section 6.677, Formula 6. 
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show that 


cosh(bv/t? — x?) Hü- 2) 9 2 sin(t B p) Jo(uim) 
————— z)-— 
ve — 2 o E vuU Jua) 


where 0 < x < a, uk is the kth positive root of Jo(ua) = 0, H(-) is Heaviside's step function, 
and 6 is a constant. 


30. Using the integral definition of the Bessel function?? for Jı (z): 


2 f' tsin(zt) 
Lic.) BEST 
i AF 


dt, 0 « z, 


show that 


x T nmt . [NTE 


where H(-) is Heaviside’s step function. Hint: Treat this as a Fourier half-range sine 
expansion. 


31. Show that 


d(x Mec ^ Jolurb/a)J o(uer/a) 


a? i J? (ux) 


O0<a,b<a, 


where up is the kth positive root of Jo(u) = 0 and 6(-) is the Dirac delta function. 


32. Show that 


Jo(ukz/a) 
a a Pl) 0<2 <a, 


where up is the kth positive root of Jo(js) = 0 and ó(-) is the Dirac delta function. 


6.6 FINITE ELEMENT METHOD 


In Section 1.7 we showed how to solve ordinary differential equations using finite differ- 
ences. Here we introduce a popular alternative, the finite element method, and will use it 
to solve the Sturm-Liouville problem. One advantage of this approach is that we can focus 
on the details of the numerical scheme. 

The finite element method breaks the global solution domain into a number of simply 
shaped subdomains, called elements. The global solution is then constructed by assembling 
the results from all of the elements. A particular strength of this method is that the 
elements do not have to be the same size; this allows us to have more resolution in regions 
where the solution is rapidly changing and fewer elements where the solution changes slowly. 
Overall the solution of the ordinary differential equation is given by a succession of piecewise 
continuous functions. 


30 Gradshteyn and Ryzhik, Ibid., Section 3.753, Formula 5. 
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Consider the Sturm-Liouville problem 


= [ora + q(x)y(x) — Ar(x)y(x) = 0, a<xx< b, (6.6.1) 

with 
y(a) ^ 0, or X p(a)y'(a) = 0, (6.6.2) 

nd 
i y(b) = 0, or  p(b)y'(b) = 0. (6.6.3) 


Our formulation of the finite element approximation to the exact solution is called the 
Galerkin weighted residual approach. This is not the only possible way of formulating the 
finite element equations but it is similar to the eigenfunction expansions that we highlighted 
in this chapter. Our approach consists of two steps: First we assume that y(x) can be 
expressed over a particular element by 


y(z) = X we;(z), (6.6.4) 


where o;(x) is the jth approrimation or shape function, and J is the total number of 
elements. 

Let us define the residue 
d 
dx 


R(x) = 


(ay SE] + ew) ut. (6.6.5) 


We now require that for each element along the segment Qe = (£n, £n+1), 
| R(x)y;(x) dx = 0, a hs eee A (6.6.6) 
Qe 


The points rz, and x,+, are known as nodes. Substituting Equation 6.6.5 into Equation 
6.6.6, 


2 dy l P RAT 
n { da: LE w] + q(z)y(z) — voco) vila) dx = 0. (6.6.7) 
Because 
i E Lr d qi(x) dz = -p(2) Zila) a + [woes dz, (6.6.8) 
then 
[p BRE eol- ae = vlay Hoses]. 6.6.9) 


Upon using Equation 6.6.4 to eliminate y(x) and reversing the order of summation and 
integration, our second step in the finite element method involves solving for y; via 


EC de) E) dx 4 rd a(x) pi(x)p; (x) dx 


n 


j=l 
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or using matrix notation 


Ky —AMy = b, (6.6.11) 
where 
Kı Kye Kj Mii Miz Mj 
Ko, Ko ... Ks» Mz, Mo. ... Maz 
K= : : f ; , M= : : : . (6.6.12) 
Kj Kye Kj; Mj Mm M;; 
by yi 
b2 y2 
b= "E EE (6.6.13) 
b; YJ 
i= P(zn-1)V (n1)9i(2n41) — p(tn)y' (xn)vei(zs), (6.6.14) 
Tul dy;(x) dp; (x cun 
Kij «f p(x) s ) n ) s 7 q(x)pi(z)o;(x) dz, (6.6.15) 
and 
Zn4l 
Mij =| r(z)oi(x)o;(x) da. (6.6.16) 


Why do we prefer to use Equation 6.6.10 rather than Equation 6.6.7? There are 
two reasons. First, it offers a convenient method for introducing the specified boundary 
conditions, Equation 6.6.2 and Equation 6.6.3. Second, it has lowered the highest-order 
derivatives from a second to a first derivative. This yields the significant benefit that q;(x) 
must only be continuous but not necessarily a continuous slope at the nodes. 

An important question is how we will evaluate the integrals in Equation 6.6.15 and 
Equation 6.6.16. Because p(x), g(x), and r(x) are known, we could substitute these quanti- 
ties along with y;(x) and y;(x) into Equation 6.6.15 and Equation 6.6.16 and perform the 
integration, presumably numerically. In a similar vein, we could develop curve fits for p(x), 
q(x), and r(x) and again perform the integrations. However, we simply use their values at 
the midpoint between the nodes, Zn = (£n+1 + z4)/2, because p(x), q(x) and r(x) usually 
vary slowly over the interval (£n, 2541). 

At this point we will specify J. The simplest case is J = 2 and we have the linear 


element: 


Xo — Tı T2 — T 


gız) = and plx) = ; (6.6.17) 
Ta — Zi X2 —231 


where x; and z» are local nodal points located at the end of the element. It directly follows 


that 3 3 
y v1 p2 Y2 — Yı 
= t = ; 6.6.18 
da dx a dx Y2 £2 — Tı ( ) 


In other words, dy/dx equals the slope of the straight line connecting the nodes. Similarly, 


T y(x) da = $ (ya + y1) (a — 21) (6.6.19) 


1 


and we simply have the trapezoidal rule. 
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Table 6.6.1: The System Topology for 4 Finite-Element Segmentations When a Linear 
Interpolation Is Used 


Node Numbers 


Element Local Global 
1 1 1 
2 2 
2 1 2 
2 3 
3 1 3 
2 4 
4 1 4 
2 5 


Substituting c1(r) and y2(x) into Equation 6.6.15 and Equation 6.6.16 and carrying 
out the integration, we obtain 


Te Te L Xe Te L 
Kı = P(e) + alze) Q K= plz) + alae) » Ka =K, and Kz = Kı, 
L 3 L 6 
(6.6.20) 
with L = z9 — xı and x, = (£1 + z3)/2. Similarly, 
c)L c)L 
Mii = a. = Moə2, and Miz = ree) = M. (6.6.21) 


Finally, because q1(x4) = 0, yi(#2) = 1, q»(z1) = 1, and gs(z5) = 0, bı = —p(zi)y'(21) 
and b» = p(x2)Yy2 (x2). 

Having obtained the finite element representation for nodes 1 and 2, we would like to 
extend these results to an arbitrary number of additional nodes. This is done by setting up 
a look-up table that relates the global nodal points to the local ones. For example, suppose 
we would like 5 nodes between a and b with z = x, £2, £3, £4, and x5. Then Table 6.6.1 
illustrates our look-up table. 

Having developed the spatial layout, we are now ready to assemble the matrix for the 
entire interval (a, b). For clarity we will give the intermediate steps. Taking the first element 
into account, 


KY KY 000 MË MS 0 0 0 
KS) KY 000 MO MS) 0 0 0 
K-2| 0 0 000|. M=] 0 0 00 0]; (6.6.22) 
0 0 000 0 0 000 
0 0 000 0 0 000 
—p(x1)y' (21) yı 
p(x2)y' (x2) yo 
b= 0 , and y=] 0 (6.6.23) 
0 0 
0 0 
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Here we have added a subscript (1) to Kij and Mj; to denote that value for the first element 
should be used in computing p(zc«), q(z«), r(z.), and L. Consequently, when we introduce 
the second element, K, M, and y become 


KY KY 0 0 0 
1 1 2 2 
K=| ọ Ko) KƏ 0 op (6.6.24) 
0 0 0 00 
0 0 0 00 
MY MẸ 0 00 
1 1 2 2 
My) Mà +My? My, 0 0 
M= 0 M M9 o ofj: (6.6.25) 
0 0 0 00 
0 0 0 00 
—p(zi)y' (21) yı 
0 Y2 
b=] p(zs)y'(wz) |; and y=] ys |. (6.6.26) 
0 0 
0 0 


Note that the numbering for y; corresponds to ith global node number. Continuing with 
this process of adding additional elements to the system matrix, we finally have 


KO KY 0 0 0 
1 1 2 2 
R30 ke KË + KË K® 0 |, (6.6.27) 
3 3 4 4 
0 0 0 KË KY 
MX MQ 0 0 0 
MP Qr — Mp 0 0 
M-| 0 Me M + MO M o s (6.6.28) 
0 0 MẸ My) + Mi) Mj 
0 0 0 Mi) MO 
where ; 
—p(azi)y' (21) yı 
0 Y2 
b= 0 , and y=] y|. (6.6.29) 
0 VA 
p(zs)y' (zs) Us 


Let us examine the b vector more closely. In the final form of the finite element formu- 
lation, b has non-zero values only at the end points; the contributions from intermediate 
nodal points vanish because p(z)y'(x) is continuous within the interval (a,b). Furthermore, 
if y'(a) = y'(b) = 0 from the boundary conditions, then the b vector becomes the zero 
vector. On the other hand, if y(a) = y(b) = 0, then y; = ys = 0 and the eigenvalue problem 
involves a 3 x 3 matrix with the unknowns y2, ys, and y4. Similarly, if y'(a) = y(b) = 0, 
then we have a 4 x 4 matrix with the unknowns yi, yo, ys, y4, and ys = 0. Finally, if 
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Table 6.6.2: The Lowest Eigenvalue for Equation 6.6.30, Which Is Solved Using a Finite 
Element Method 


L À 
0.250 10.6745 
0.100 10.2335 
0.050 10.1717 
0.020 10.1544 
0.010 10.1520 
0.002 10.1512 


y(a) = y'(b) = 0, we again have a 4 x 4 matrix involving yı = 0 and the unknowns y2, ys, 


ya, and ys. 
To illustrate this scheme, consider the Sturm-Liouville problem 


y'-(A-a?)y-0, O<zr<l (6.6.30) 


with y(0) = y(1) = 0. Here p(x) = 1, g(x) = z?, and r(z) = 1. 
The MATLAB begins with the choice of the number of elements, N. Once that is done, 
L immediately follows because L = 1/(N-1). We will also need to have the value of x at 
the node points x(n) = L*(n-1) where n = 1:N. 
With these preliminaries out of the way, we begin by setting up the matrices K and 
M given by Equation 6.6.27 and Equation 6.6.28. The corresponding MATLAB code is 
for i - 1:N-1 
xc = 0.5*(x(i) + x(i*1)); 
p= 1; q = x-c*x_c; r = 1; 
K11 = p/L + q*L/3; K22 = K 11; 
K12 = -p/L + q«L/6; K21 = K 12; 
M11 = r*L/3; M22 = M.11; 
M.12 = r*L/6; M_21 = M.12; 


KK(C i, i ) = KKC i , i) + K_11; 
KK( i ,i*1) = KKC i ,it1) + K_12; 
KK(it1, i ) = KK(i+1, i) + K_21; 
KK(it1,i+1) = KK(i+1,i+1) + K_22; 
MMC i, i) = MMC i ,i) + M411; 
MMC i ,i*1) = MMC i ,it1) + M_12; 
MM(i*1, i ) = MM(i+1, i ) + M21; 
MM(it1,i+1) = MM(i*i,i*1) + M_22; 
end 


Note that the arrays KK and MM have already been defined as N x N arrays with all of their 
elements set to zero. 
Finally, because y; and yw are zero, we must extract that portion of K and M for 

which ym 4 0. This is done as follows: 
for j = 1:N-2 
for i = 1:N-2 

AG,j = KK(it1,j+1); 

B(i,j) = MM(it1,j+1); 
end; end 
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02 04 06 08 1 

x 
Figure 6.6.1: Numerical solution of y" -- 2y' +y = 2z 4-3sin(z) with y’(0) = —2 and y'(1) = 3 using finite 
elements with Az — 0.1. The crosses indicate the exact solution. 


Finally, the eigenvalues are found by eig(A,B). If the corresponding eigenfunction is 
desired, then the corresponding eigenfunction gives y; for 7 = 2,3,..., N —1 using Equation 
6.6.4. Table 6.6.2 illustrates how the lowest eigenvalue for Equation 6.6.30 improves in 
accuracy as the number of nodes is increased. 


Project: Finite Element Solution of Boundary-Value Problems 


In addition to solving the Sturm-Liouville problem, finite element methods can be used 
to solve the standard boundary-value problem: 


da Ld +q(r)y=f(x), O<#<1, 


where we specify y(0) or y'(0) at zx = 0 and y(1) or y'(1) at x = 1. Although you could 
create your MATLAB code to solve this problem, MATLAB code has already been developed 
and is available online.*! The purpose of this project is for you to become comfortable using 
this scheme. 


Step 1: Using the method of undetermined coefficients, solve the boundary-value problem 
y” +2y' *ty-—2r-3sin(x)  y'(0)=-2, y(1)=3. 
Show that 


y(x) = 2x — 4 — 3 cos(x) + 1 [3sin(1) — 2] e! * + 1 [Bsin(1) — 2 — 8/e] xe! *. 


Step 2: Show that the ordinary differential equation can be written as 


d d 
m (ot) + ey = 2xe?* + 3e?* sin(x), y(0 2-2, y/(1)=3. 


Step 3: Find the numerical solution of the boundary-value problem using finite elements. 
Figure 6.6.1 illustrates the solution. 


31 For example, http://people.sc.fsu.edu/^burkardt/m src/femid/femid.html 
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SOLUTION 


Chapter 7 
The Wave Equation 


In this chapter we will study problems associated with the equation 


u u 
uu OU 7.0.1 
0m ^ Aa?’ Tp 
where u = u(x,t), x and t are the two independent variables, and c is a constant. This 
equation, called the wave equation, serves as the prototype for a wider class of hyperbolic 
equations 


Oru Oru o?u ðu Qu 
a(x, t) zzz + b(z, t) dxdt ] c(z, t) EIZ EE rtu Ox’ a) , (7.0.2) 


where b? > 4ac. It arises in the study of many important physical problems involving 
wave propagation, such as the transverse vibrations of an elastic string and the longitudinal 
vibrations or torsional oscillations of a rod. 


7.1 THE VIBRATING STRING 


The motion of a string of length L and constant density p (mass per unit length) is 
a simple example of a physical system described by the wave equation. See Figure 7.1.1. 
Assuming that the equilibrium position of the string and the interval [0, L] along the x-axis 
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u(x,t)! 


0 


Figure 7.1.1: The vibrating string. 


coincide, the equation of motion that describes the vertical displacement u(x, t) of the string 
follows by considering a short piece whose ends are at x and «+ Ax and applying Newton’s 
second law. 

If we assume that the string is perfectly flexible and offers no resistance to bending, 
Figure 7.1.1 shows the forces on an element of the string. Applying Newton’s second law 
in the x-direction, the sum of forces equals 


—T (x) cos(o1) + T(x + Ax) cos(az), (7.1.1) 


where T(x) denotes the tensile force. If we assume that a point on the string moves only 
in the vertical direction, the sum of forces in Equation 7.1.1 equals zero and the horizontal 
component of tension is constant: 


—T (x) cos(o4) + T(x + Ax) cos(a3) = 0, (7.1.2) 


and 
T(x) cos(o1) = T(x + Ax) cos(az) = T, a constant. (7.1.3) 


If gravity is the only external force, Newton’s law in the vertical direction gives 


82 

—T (x) sin(o4) + T(z + Az) sin(ag) — mg = mo (7.1.4) 

where wz; is the acceleration. Because 

T T 
T(x) = —.—., and T(r+Az)= , 7.1.5 
(2) cos(a1) si ATAN cos(@2) ( ) 
then 
Oru 

—T tan(a,) + T tan(a2) — pgAx = PATT: (7.1.6) 


The quantities tan(aı) and tan(a2) equal the slope of the string at x and x + Az, respec- 
tively; that is, 


t Aat 
dme ud mej OA (7.1.7) 
Ox Ox 
Substituting Equation 7.1.7 into Equation 7.1.6, 
Ou(a + Az,t) u(x,t) 0?u 
T = pz | — ; 7.1. 
Ox Ox PaT jog Ng Vo) 
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After dividing through by Az, we have a difference quotient on the left: 


T [Ou(r-Az,t) Ou(z,t)|  [(90?u 
Az Ba 5: |m yore) 


In the limit as Ax — 0, this difference quotient becomes a partial derivative with respect 
to z, leaving Newton's second law in the form 


^u u 
Taz = Pag + pg, (7.1.10) 


Or 
u 18u g 
= 7.1.11 
Ox? e ot? i e?’ ( ) 
where c? = T/p. Because ut is generally much larger than g, we can neglect the last term, 
giving the equation of the vibrating string as 


2 2 
PL RAN (7.1.12) 
Or? cO 

Equation 7.1.12 is the one-dimensional wave equation. 

As a second example! we derive the threadline equation, which describes how a thread 
composed of yarn vibrates as we draw it between two eyelets spaced a distance L apart. 
We assume that the tension in the thread is constant, the vibrations are small, the thread 
is perfectly flexible, the effects of gravity and air drag are negligible, and the mass of the 
thread per unit length is constant. Unlike the vibrating string between two fixed ends, we 
draw the threadline through the eyelets at a speed V so that a segment of thread experiences 
motion in both the x and y directions as it vibrates about its equilibrium position. The 
eyelets may move in the vertical direction. 

From Newton's second law, 


d dy\ | 
5 (mG) = 5 forces, (7.1.13) 


where m is the mass of the thread. But 


= + . (7.1.14) 


Because dz/dt = V, 


= = ae 7.1.1 
d m Oe” ats) 
mE d d o Ó o o Ó o 
UY | V, y y y 
3i (n a) = a" (3 | va) + va (3 + væ]. (7.1.16) 
Because both m and V are constant, it follows that 
d dy\ _ O7y 0?y l 20°y 
5 (mo) =ma5 "Vai + mV DE (7.1.17) 


1 See Swope, R. D., and W. F. Ames, 1963: Vibrations of a moving threadline. J. Franklin Inst., 275, 
36-55. 
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The sum of the forces again equals 


0?y 

so that the threadline equation is 

0?y Ory oy 28y 

T3 zAr= mg 1 2mV 3 8i FE mV ag (7.1.19) 

m 2 2 2 

Ory Ory 2 JIT \ Oy 

— +2 ———j—-— 7.1.20 

om * V ond b (v p ) Ox? 0, ( ) 


where p is the density of the thread. Although Equation 7.1.20 is not the classic wave 
equation given in Equation 7.1.12, it is an example of a hyperbolic equation. As we shall see, 
the solutions to hyperbolic equations share the same behavior, namely, wave-like motion. 


7.2 INITIAL CONDITIONS: CAUCHY PROBLEM 


Any mathematical model of a physical process must include not only the governing 
differential equation but also any conditions that are imposed on the solution. For exam- 
ple, in time-dependent problems the solution must conform to the initial condition of the 
modeled process. Finding those solutions that satisfy the initial conditions (initial data) is 
called the Cauchy problem. 

In the case of partial differential equations with second-order derivatives in time, such 
as the wave equation, we correctly pose the Cauchy boundary condition if we specify the 
value of the solution u(x, to) = f(t) and its time derivative u(x, to) = g(t) at some initial 
time to, usually taken to be to = 0. The functions f(t) and g(t) are called the Cauchy data. 
We require two conditions involving time because the differential equation has two time 
derivatives. 

In addition to the initial conditions, we must specify boundary conditions in the spa- 
tial direction. For example, we may require that the end of the string be fixed. In the 
next chapter, we discuss boundary conditions in greater depth. However, one boundary 
condition that is uniquely associated with the wave equation on an open domain is the 
radiation condition. It requires that the waves radiate off to infinity and remain finite as 
they propagate there. 

In summary, Cauchy boundary conditions, along with the appropriate spatial boundary 
conditions, uniquely determine the solution to the wave equation; any additional information 
is extraneous. Having developed the differential equation and initial conditions necessary to 
solve the wave equation, let us now turn to the actual methods used to solve this equation. 


7.3 SEPARATION OF VARIABLES 


Separation of variables is the most popular method for solving the wave equation. 
Despite its current widespread use, its initial application to the vibrating string problem 
was controversial because of the use of a half-range Fourier sine series to represent the initial 
conditions. On one side, Daniel Bernoulli claimed (in 1775) that he could represent any 
general initial condition with this technique. To d'Alembert and Euler, however, the half- 
range Fourier sine series, with its period of 2L, could not possibly represent any arbitrary 
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function. However, by 1807 Bernoulli was proven correct by the use of separation of 
variables in the heat conduction problem and it rapidly grew in acceptance.? In the following 
examples we show how to apply this method. 

Separation of variables consists of four distinct steps that convert a second-order par- 
tial differential equation into two ordinary differential equations. First, we assume that 
the solution equals the product X(x)T(t). Direct substitution into the partial differential 
equation and boundary conditions yields two ordinary differential equations and the corre- 
sponding boundary conditions. Step two involves solving a boundary-value problem of the 
Sturm-Liouville type. In step three we find the corresponding time dependence. Finally we 
construct the complete solution as a sum of all product solutions. Upon applying the initial 
conditions, we have an eigenfunction expansion and must compute the Fourier coefficients. 
The substitution of these coefficients into the summation yields the complete solution. 


e Example 7.3.1 


Let us solve the wave equation for the special case when we clamp the string at z — 0 
and « = L. Mathematically, we find the solution to the wave equation 


Dres Dae. 0<t, (7.3.1) 
which satisfies the initial conditions 
u(x,0) = f(x), 24.0) —g(xr, O0<a<L, (7.3.2) 
and the boundary conditions 
u(0,t) = u(L,t)=0, 0<t. (7.3.3) 


For the present, we leave the Cauchy data quite arbitrary. 
We begin by assuming that the solution u(x,t) equals the product X(x)T(t). (Here T 
no longer denotes tension.) Because 


"L 
op = X(z)T" (t), (7.3.4) 
and ga 
PE = X"(x)T(t), (7.3.5) 
the wave equation becomes 
ex" T=T" Xx, (7.3.6) 
or Xx" T" 
cn e 7.3.7 
X eT ( ) 


2 See Hobson, E. W., 1957: The Theory of Functions of a Real Variable and the Theory of Fourier’s 
Series, Vol. 2. Dover Publishers, Sections 312-314. 


3 Lützen, J., 1984: Sturm and Liouville’s work on ordinary linear differential equations. The emergence 
of Sturm-Liouville theory. Arch. Hist. Exact Sci., 29, 317. 
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after dividing through by c? X (z)T(t). Because the left side of Equation 7.3.7 depends only 
on x and the right side depends only on t, both sides must equal a constant. We write this 
separation constant — A and separate Equation 7.3.7 into two ordinary differential equations: 


T" E AT — 0, 0 « t, (7.3.8) 


and 
X" E AX — 0, 0c z«L. (7.3.9) 


We now rewrite the boundary conditions in terms of X (x) by noting that the boundary 


conditions become 
u(0,t) = X(0)T(t) = 0, (7.3.10) 


and 
u(L,t) = X(L)T(t) =0 (7.3.11) 


for 0 < t. If we were to choose T(t) = 0, then we would have a trivial solution for u(x, t). 


Consequently, 
X(0) = X(L) = 0. (7.3.12) 
This concludes the first step. 
In the second step we consider three possible values for A: A < 0, A = 0, and A > 0. 


Turning first to \ < 0, we set \ = —m? so that square roots of A will not appear later on 
and m is real. The general solution of Equation 7.3.9 is 


X(x) = Acosh(mz) + B sinh(mz). (7.3.13) 


Because X(0) = 0, A = 0. On the other hand, X(L) = Bsinh(mL) = 0. The function 
sinh(mL) does not equal to zero since mL # 0 (recall m > 0). Thus, B = 0 and we have 
trivial solutions for a positive separation constant. 

If \ = 0, the general solution now becomes 


X(x) 2 C + Dz. (7.3.14) 


The condition X(0) = 0 yields C = 0 while X(L) = 0 yields DE = 0 or D = 0. Hence, we 
have a trivial solution for the A = 0 separation constant. 
If \ = k? > 0, the general solution to Equation 7.3.9 is 


X(x) = Ecos(kx) + F sin(kz). (7.3.15) 


The condition X(0) = 0 results in E = 0. On the other hand, X(L) = Fsin(kL) = 0. 
If we wish to avoid a trivial solution in this case (F Z 0), sin(kL) = 0, or kn = nm/L, 
and A, = n?n? /L?. The z-dependence equals X,(r) = F,sin(nzz/L). We added the n 
subscript to k and A to indicate that these quantities depend on n. This concludes the 
second step. 

Turning to Equation 7.3.8 for the third step, the solution to the T(t) equation is 


Tn (t) = Gn cos(k,ct) + Hn sin(k,ct), (7.3.16) 


where G,, and H,, are arbitrary constants. For each n = 1,2,3,..., a particular solution 
that satisfies the wave equation and prescribed boundary conditions is 


t 
us (zx, t) = F, sin( 2) fen cos( 27%) TH, sin( )} (7.3.17) 
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Or 


t 
us (z,t) = sin(77) |^ cos( 272) t B, sin( 27 y (7.3.18) 


where An = F,,G, and B, = F,,H,. This concludes the third step. 

An equivalent method of finding the product solution is to treat Equation 7.3.9 along 
with X(0) = X(L) = 0 as a Sturm-Liouville problem. In this method we obtain the 
spatial dependence by solving the Sturm-Liouville problem and finding the corresponding 
eigenvalues Àn and eigenfunctions. Next we solve for T,,(t). Finally we form the product 
solution un (x,t) by multiplying the eigenfunction times the temporal dependence. 

For any choice of A, and B,, Equation 7.3.18 is a solution of the partial differential 
equation, Equation 7.3.1, also satisfying the boundary conditions, Equation 7.3.3. There- 
fore, any linear combination of u, (x, t) also satisfies the partial differential equation and the 
boundary conditions. In making this linear combination we need no new constants because 
An and B,, are still arbitrary. We have, then, 


dd ys (272) | cos( 27%) PE; x (S5 (7.3.19) 


Our method of using particular solutions to build up the general solution illustrates 
the powerful principle of linear superposition, which is applicable to any linear system. 
This principle states that if u; and uz are any solutions of a linear homogeneous partial 
differential equation in any region, then u = cu4 + cou» is also a solution of that equation 
in that region, where cı and c9 are any constants. We can generalize this to an infinite 
sum. It is extremely important because it allows us to construct general solutions to partial 
differential equations from particular solutions to the same problem. 

Our fourth and final task remains to determine A, and B,. At t — 0, 


u(a,0) = P» An sin(777) = f(z), (7.3.20) 
and M" 
u(x, 0) = Z MEB, sin( 2) = g(z). (7.3.21) 


Both of these series are Fourier half-range sine expansions over the interval (0, L). Applying 
the results from Section 5.3, 


A -2 f jesin( 2) a (7.3.22) 
^— rj, z)sin| -7 x, 3. 
and " 
NTC 2 NTL 
TTE p ET inf 79 qe, 7.3.23 
L B a g(x) sin( L ) dz (7.3.23) 
Or L 
2 NTE 
n = — in| —— : 7.3.24 
B er d g(x) sin( F ) dz ( ) 


At this point we might ask ourselves whether the Fourier series solution to the wave equation 
always converges. For the case g(a) = 0, Carslaw* showed that if the initial position of the 


^ Carslaw, H. S., 1902: Note on the use of Fourier's series in the problem of the transverse vibrations of 
strings. Proc. Edinburgh Math. Soc., Ser. 1, 20, 23-28. 
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string forms a curve so that f(x) or the slope f'(x) is continuous between x = 0 and x = L, 
then the series converges uniformly. 
As an example, let us take the initial conditions 


0, 0<a<L/4, 
4h (€ — 1), L/4< z < L/2, 
f(x) = (z 4) (7.3.25) 
4h ($ — £), L/2 € x € 3L/A, 
0, ADAE x< L, 
and 
g(x) = 0, 0 «z«L. (7.3.26) 


In this particular example, Bn = 0 for all n because g(a) = 0. On the other hand, 
8h [P f(x 1 nca 8h [97^ (3 zx NTE 
iga in(77) dz 4 in( ZE) de (7.3.27 
Id (G i) sin L £ D G z) sin L x£ ) 


a |o) -s in( z) sin( (2) 7.3.28) 
r) 
te 


= as [i F) 299 (5 ut | 7.3.20) 
= ae) Eom 1 m Cr) (m). 7.3.30) 


because sin(A) + sin(B) = 2sin[į (A + B)]cos|4(A — B)], and 1 — cos(2A) = 2sin?(A). 
Therefore, 


u(x,t) = 3 sin(*) sin? (E) E sin(“**) cos( 272) : (7.3.31) 


Because sin(n7/2) vanishes for n even, so does A,. If Equation 7.3.31 were evaluated on 
a computer, considerable time and effort would be wasted. Consequently it is preferable to 
rewrite Equation 7.3.31 so that we eliminate these vanishing terms. The most convenient 
method introduces the general expression n — 2m — 1 for any odd integer, where m — 
1,2,3,..., and notes that sin|(2m — 1)1/2] = (—1)"**!. Therefore, Equation 7.3.31 becomes 


ids a S des yet o. Ten 2z sin] Om Rd cos P= Dzet l 


(2m 12 ^^ L L 
=1 
(7.3.32) 
Although we completely solved the problem, it is useful to rewrite Equation 7.3.32 as 
u(z, t) H7 [7 t)| + = ( + ct)] (7.3.33) 
u(x, 22. sin -p (s-e sin [ (s +e .3. 


through the application of the trigonometric identity sin(A)cos(B) = }sin(A —B) + 
4 sin(A + B). From general physics we find expressions like sin[k,(« — ct)] or sin(ka — wt) 
arising in studies of simple wave motions. The quantity sin(kx — wt) is the mathematical 
description of a propagating wave in the sense that we must move to the right at the speed 
c if we wish to keep in the same position relative to the nearest crest and trough. The 
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cL = 1 
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Bh cUL = 0.8 


ct/L = 0.6 


cL = 0.4 


u(x,t)/h 
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xL 

Figure 7.3.1: The vibration of a string u(z,t)/h at various positions z/L at the times ct/L = 0, 0.2, 0.4, 

0.6, 0.8, and 1. For times 1 « ct/L « 2 the pictures appear in reverse time order. 


quantities k, w, and c are the wavenumber, frequency, and phase speed or wave velocity, 
respectively. The relationship w = kc holds between the frequency and phase speed. 

It may seem paradoxical that we are talking about traveling waves in a problem dealing 
with waves confined on a string of length L. Actually we are dealing with standing waves 
because at the same time that a wave is propagating to the right its mirror image is running 
to the left so that there is no resultant progressive wave motion. Figures 7.3.1 and 7.3.2 
illustrate our solution. Figure 7.3.1 gives various cross sections. The single large peak at 
t = 0 breaks into two smaller peaks that race towards the two ends. At each end, they 
reflect and turn upside down as they propagate back towards x = L/2 at ct/L = 1. This 
large, negative peak at « = L/2 again breaks apart, with the two smaller peaks propagating 
towards the endpoints. They reflect and again become positive peaks as they propagate 
back to x = L/2 at ct/ L = 2. After that time, the whole process repeats itself. 

MATLAB can be used to examine the solution in its totality. The script 


^4 set parameters for the calculation 
clear; M = 50; dx = 0.02; dt = 0.02; 
^4 compute Fourier coefficients 
sign = 32; 
for m = 1:M 
tempi = (2*m-1)*pi; temp2 = sin(temp1/8) ; 
a(m) = sign * temp2 * temp2 / (tempi * tempi); 
sign - -sign; 
end 
^ compute grid and initialize solution 
X [0:dx:1]; T = [0:dt:2]; 
u = zeros(length(T) ,length(X)) ; 
XX = repmat(X,[length(T) 11); 
TT = repmat(T?,[1 length(X)]); 
^4 compute solution from Equation 7.3.32 
for m = 1:M 
tempi = (2*m-1)*pi; 
u = u + a(m) .* sin(temp1*XX) .* cos(temp1*TT) ; 
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Figure 7.3.2: Two-dimensional plot of the vibration of a string u(x, t)/h at various times ct/L and positions 
z/L. 


end 

^ plot space/time picture of the solution 

surf (XX,TT,u) 

xlabel(?DISTANCE!,?Fontsize?,20); ylabel(’TIME’,’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ , 20) 


gives a three-dimensional view of Equation 7.3.32. The solution can be viewed in many 
different prospects using the interactive capacity of MATLAB. 

An important dimension to the vibrating string problem is the fact that the wavenum- 
ber kn is not a free parameter but has been restricted to the values of nz/L. This restriction 
on wavenumber is common in wave problems dealing with limited domains (for example, a 
building, ship, lake, or planet) and these oscillations are given the special name of normal 
modes or natural vibrations. 

In our problem of the vibrating string, all of the components propagate with the same 
phase speed. That is, all of the waves, regardless of wavenumber kn, move the characteristic 
distance cAt or —cAt after the time interval At elapsed. In the next example we will see 
that this is not always true. 


e Example 7.3.2: Dispersion 


In the preceding example, the solution to the vibrating string problem consisted of two 
simple waves, each propagating with a phase speed c to the right and left. In many problems 
where the equations of motion are a little more complicated than Equation 7.3.1, all of the 
harmonics no longer propagate with the same phase speed but at a speed that depends 
upon the wavenumber. In such systems the phase relation varies between the harmonics 
and these systems are referred to as dispersive. 

A modification of the vibrating string problem provides a simple illustration. We now 
subject each element of the string to an additional applied force that is proportional to its 


displacement: 
Ou 0u 
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where h > 0 is constant. For example, if we embed the string in a thin sheet of rubber, 
then in addition to the restoring force due to tension, there is a restoring force due to the 
rubber on each portion of the string. From its use in the quantum mechanics of “scalar” 
mesons, Equation 7.3.34 is often referred to as the Klein-Gordon equation. 

We shall again look for particular solutions of the form u(«#,t)=X(x)T(t). This time, 
however, 


XT" =CX"T +AXT =0, (7.3.35) 

Dr T" h X 
Mu exe 7.3.36 
cT c X ( ) 


which leads to two ordinary differential equations 
X" +AX — 0, (7.3.37) 
and 
T" + (Ac? +h)T — 0. (7.3.38) 
If we attach the string at x = 0 and z = L, the X(x) solution is 


X.(z) = sin(“**) (7.3.39) 


with kn = n/L, and A, = n?z?/L?. On the other hand, the T(t) solution becomes 
T (t) = An cos( kii it) 4B, sin( keh ) (7.3.40) 
so that the product solution is 
us (x,t) = sin(^77) |^ cos( k2c? + At) + Bn sin( k2c? +h ) : (7.3.41) 
Finally, the general solution becomes 


u(z,t) = jn sin(^77) An cos( kii ht) + B, sin( k24 io) (7.3.42) 


from the principle of linear superposition. Let us consider the case when Bn = 0. Then we 
can write Equation 7.3.42 as 


u(x,t) = Y m inta + Ki ht) + sin(k,z - Ske +h ) . (7.3.43) 


n=1 


Comparing our results with Equation 7.3.33, the distance that a particular mode kn moves 
during the time interval At depends not only upon external parameters such as h, the tension 
and density of the string, but also upon its wavenumber (or equivalently, wavelength). 
Furthermore, the frequency of a particular harmonic is larger than that when h = 0. This 
result is not surprising, because the added stiffness of the medium should increase the 
natural frequencies. 

The importance of dispersion lies in the fact that if the solution u(x,t) is a superpo- 
sition of progressive waves in the same direction, then the phase relationship between the 
different harmonics changes with time. Because most signals consist of an infinite series 
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Figure 7.3.3: The vibration of a string u(z, t)/h embedded in a thin sheet of rubber at various positions 
x / L at the times ct/ L = 0, 0.2, 0.4, 0.6, 0.8, and 1 for hL?/c? = 10. The same parameters were used as in 
Figure 7.3.1. 


of these progressive waves, dispersion causes the signal to become garbled. We show this 
by comparing the solution, Equation 7.3.42 given in Figures 7.3.3 and 7.3.4 for the initial 
conditions, Equation 7.3.25 and Equation 7.3.26, with hL?/c? = 10, to the results given in 
Figures 7.3.1 and 7.3.2. In the case of Figure 7.3.4, the MATLAB script line 

u = u + a(m) .* sin(tempi*XX) .* cos(temp1*TT) ; 

has been replaced with 

temp2 = tempi * sqrt(1 + H/(tempi*tempi)); 

u = u + a(m) .* sin(tempi*XX) .* cos(temp2*TT) ; 

where H = 10 is defined earlier in the script. Note how garbled the picture becomes at 
ct/ L = 2 in Figure 7.3.4 compared to the nondispersive solution at the same time in Figure 
7.9.2. 


e Example 7.3.3: Damped wave equation 


In the previous example a slight modification of the wave equation resulted in a wave 
solution where each Fourier harmonic propagates with its own particular phase speed. In 
this example we introduce a modification of the wave equation that results not only in 
dispersive waves but also in the exponential decay of the amplitude as the wave propagates. 

So far we neglected the reaction of the surrounding medium (air or water, for example) 
on the motion of the string. For small-amplitude motions this reaction opposes the motion 
of each element of the string and is proportional to the element's velocity. The equation of 
motion, when we account for the tension and friction in the medium but not its stiffness or 
internal friction, is 


o?u ðu u 
Ot? Ot ^ Og? 


Because Equation 7.3.44 first arose in the mathematical description of the telegraph,? it is 


O0<a<L, 0<t. (7.3.44) 


5 The first published solution was by Kirchhoff, G., 1857: Über die Bewegung der Electritat in Drahten. 
Ann. Phys. Chem., 100, 193-217. English translation: Kirchhoff, G., 1857: On the motion of electricity 
in wires. Philos. Mag., Ser. 4, 13, 393-412. 
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Figure 7.3.4: The two-dimensional plot of the vibration of a string u(#,t)/h embedded in a thin sheet of 
rubber at various times ct/L and positions x/L for hL?/c? = 10. 


generally known as the equation of telegraphy. The effect of friction is, of course, to damp 
out the free vibration. 

Let us assume a solution of the form u(x,t) = X(x)T(t) and separate the variables to 
obtain the two ordinary differential equations: 


X" +X — 0, (7.3.45) 


and 
T" +2hT' +AT — 0 (7.3.46) 


with X(0) = X(L) = 0. Friction does not affect the shape of the normal modes; they are 
still 


X,(z) = sin( 777) (7.3.47) 
with kn = n7/L and A, = n?n? / L. 
The solution for the T(t) equation is 
T,(t) = e^"! | A, cos( kg — h? t) + B, sin( kic t) (7.3.48) 


with the condition that kne > h. If we violate this condition, the solutions are two ex- 
ponentially decaying functions in time. Because most physical problems usually fulfill this 
condition, we concentrate on this solution. 

From the principle of linear superposition, the general solution is 


u(x,t) = e^" y ssi (77) [An cos( k2c? — h? t) +B sin( k2c? — h? t)], (7.3.49) 
n=l 


where zc > AL. From Equation 7.3.49 we see two important effects. First, the presence 
of friction slows all of the harmonics. Furthermore, friction dampens all of the harmonics. 
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Figure 7.3.5: The vibration of a string u(z, t)/h with frictional dissipation at various positions z/L at the 
times ct/ L = 0, 0.2, 0.4, 0.6, 0.8, and 1 for hL/c = 1. The same parameters were used as in Figure 7.3.1. 
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Figure 7.3.6: The vibration of a string u(x,t)/h with frictional dissipation at various times ct/L and 
positions z/ L for hL/c = 1. 


Figures 7.3.5 and 7.3.6 illustrate the solution using the initial conditions given by Equation 
7.3.25 and Equation 7.3.26 with hL/c = 1. In the case of Figure 7.3.6, the script line that 
produced Figure 7.3.2: 
u = u + a(m) .* sin(tempi*XX) .* cos(temp1*TT) ; 
has been replaced with 
temp2 = tempi * sqrt(1 - (H*H)/(temp1i*temp1)) ; 
u = u + a(m) .* exp(-H*TT) .* sin(tempi*XX) .* cos(temp2*TT); 
where H = 1 is defined earlier in the script. Because this is a rather large coefficient of 
friction, Figures 7.3.5 and 7.3.6 exhibit rapid damping as well as dispersion. 

'This damping and dispersion of waves also occurs in solutions of the equation of teleg- 
raphy where the solutions are progressive waves. Because early telegraph lines were short, 
time delay effects were negligible. However, when engineers laid the first transoceanic cables 
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Table 7.3.1: Technological Innovation on Transatlantic Telegraph Cables 
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Year Technological Innovation Performance 
(words/min) 
1857-58 Mirror galvanometer 3-7 
1870 Condensers 12 
1872 Siphon recorder 17 
1879 Duplex 24 
1894 Larger diameter cable 72-90 
1915-20 Brown drum repeater and Heurtley 100 
magnifier 
1923-28 Magnetically loaded lines 300—320 
1928-32 Electronic signal shaping amplifiers 480 
and time division multiplexing 
1950 Repeaters on the continental shelf 100-300 
1956 Repeater telephone cables 21600 


From Coates, V. T., and B. Finn, 1979: A Retrospective Technology Assessment: Submarine Telegraphy. 
The Transatlantic Cable of 1866. San Francisco Press, Inc., 268 pp. 


in the 1850s, the time delay became seconds and differences in the velocity of propagation of 
different frequencies, as predicted by Equation 7.3.49, became noticeable to the operators. 
Table 7.3.1 gives the transmission rate for various transatlantic submarine telegraph lines. 
As it shows, increases in the transmission rates during the nineteenth century were due 
primarily to improvements in terminal technology. 

When they instituted long-distance telephony just before the turn of the twentieth 
century, this difference in velocity between frequencies should have limited the circuits to a 
few tens of miles. However, in 1899, Prof. Michael Pupin at Columbia University showed 
that by adding inductors (“loading coils") to the line at regular intervals the velocities at 
the different frequencies could be equalized.’ Heaviside? and the French engineer Vaschy? 
made similar suggestions in the nineteenth century. Thus, adding resistance and inductance, 
which would seem to make things worse, actually made possible long-distance telephony. 
Today you can see these loading coils as you drive along the street; they are the black 
cylinders, approximately one between each pair of telephone poles, spliced into the telephone 
cable. The loading of long submarine telegraph cables had to wait for the development of 
permalloy and mu-metal materials of high magnetic induction. 


6 Rayleigh, J. W., 1884: On telephoning through a cable. Brit. Assoc. Rep., 632-633; Jordan, D. W., 
1982: The adoption of self-induction by telephony, 1886-1889. Ann. Sci., 39, 433-461. 


7 There is considerable controversy concerning who is exactly the inventor. See Brittain, J. E., 1970: 
The introduction of the loading coil: George A. Campbell and Michael I. Pupin. Tech. Culture, 11, 36-57. 


8 First published 3 June 1887. Reprinted in Heaviside, O., 1970: Electrical Papers, Vol. 2. Chelsea 
Publishing, pp. 119-124. 


9 See Devaux-Charbonnel, X. G. F., 1917: La contribution des ingénieurs frangais à la téléphonie à 


grande distance par cables souterrains: Vaschy et Barbarat. Rev. Gén. Électr., 2, 288-295. 
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e Example 7.3.4: Axisymmetric vibrations of a circular membrane 


'The wave equation 


9? 18 19? 
d IP 0<r<a, 0<t (7.3.50) 


governs axisymmetric vibrations of a circular membrane, where u(r,t) is the vertical dis- 
placement of the membrane, r is the radial distance, t is time, c is the square root of the 
ratio of the tension of the membrane to its density, and a is the radius of the membrane. 
We will solve Equation 7.3.50 when the membrane is initially at rest, u(r, 0) = 0, and struck 
so that its initial velocity is 


Ou(r,0) _ pes O<r<e, (7.3.51) 


Ot 0, e«r«a. 


If this problem can be solved by separation of variables, then u(r,t) = R(r)T(t). 
Following the substitution of this u(r,t) into Equation 7.3.50, separation of variables leads 
to 


1d dR 1 dT 
= — —f? 3.52 
rR dr "ar ) CET dt? : espa) 
or 
ld dR 2 
and 2 
d^T 3 9 


The separation constant —k? must be negative so that we obtain solutions that remain 
bounded in the region 0 € r « a and can satisfy the boundary condition. This boundary 
condition is u(a,t) = R(a)T(t) = 0, or R(a) = 0. 

The solutions of Equation 7.3.53 and Equation 7.3.54, subject to the boundary condi- 
tion, are 


Fa (r) = Jo (=) (7.3.55) 
a 
and 
To = An sa (%2) + B, co (=<) (7.3.56) 


where A, satisfies the equation Jo(A) = 0. Because u(r,0) = 0, and T,(0) = 0, Bn = 
Consequently, the product solution is 


= An’ Anct 
t)- An —— |si : 7.3.57 
u(r, t) > Jo( a ) sin( 2 ) ( ) 
To determine An, we use the condition 
Ou(r,O) «A Anc An? P/ (nep) O<r<e 
— —— An —— = 3 za i 7.3.58 
ot »3 a z(=) l 0, € « r « a. ( ) 
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Figure 7.3.7: The axisymmetric vibrations u'(r,t) = capu(r,t)/P of a circular membrane at various 


positions r/a at the times ct/a = 0, 0.2, 0.4, 0.6, 0.8, and 1 for e = a/4. Initially the membrane is struck 
by a hammer. 


Equation 7.3.58 is a Fourier-Bessel expansion employing the orthogonal function Jo(Anr/a), 


where 2 J T à 
nC nv 
An = d 7.3.5 

a a? J? (An) Jo ol a )' á ( 9) 


from Equation 6.5.38 and Equation 6.5.45 in Section 6.5. Carrying out the integration, 


2PJ1(A,e/a) 
n — ; 7.3.60 
crepà? J? (An) ( ) 
or 
2P Z Jı(ànc/a) Anr\ . [Anct 
— J : 7.3.61 
a i SITER n=1 A2J1 (An) " a m a l : 


Figures 7.3.7, 7.3.8, and 7.3.9 illustrate the solution, Equation 7.3.61, for various times 
and positions when e = a/4, and e = a/20. They were generated using the MATLAB script 


^ initialize parameters 
clear; eps over .a = 0.25; M = 20; dr = 0.02; dt = 0.02; 
^4 load in zeros of J0 
zero( 1) 2.40483; zero( 2) = 5.52008; zero( 3) = 8.65373; 
zero( 4) = 11.79153; zero( 5) = 14.93092; zero( 6) = 18.07106; 
zero( 7) = 21.21164; zero( 8) = 24.35247; zero( 9) = 27.49347; 
zero(10) = 30.63461; zero(11) = 33.77582; zero(12) = 36.91710; 
zero(13) = 40.05843; zero(14) = 43.19979; zero(15) = 46.34119; 
zero(16) = 49.48261; zero(17) = 52.62405; zero(18) = 55.76551; 
zero(19) = 58.90698; zero(20) = 62.04847; 
^4 compute Fourier-Bessel coefficients 
for m= 1:M 

a(m) = 2 * besselj(1,eps over a*zero(m)) 

/ (eps.over a*pi*zero(m)*zero(m)*besselj(1,zero(m))^2); 


end 
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Figure 7.3.8: The axisymmetric vibrations capu(r,t)/P of a circular membrane resulting from an initial 
hammer blow with € = a/4. The solution is plotted at various times ct/a and positions r/a. 


R = [0:dr:1]; [0:dt:4] ; 

u = zeros(length(T) ,length(R)) ; 
RR = repmat(R,[length(T) 11); 
TT = repmat(T?,[1 length(R)]); 


^4 compute solution from series solution 
for m = 1:M 
u * a(m) 


.* besselj(O,zero(m)*RR) .* sin(zero(m)*TT); 
end 

^4 plot results 

surf (RR,TT,u) 

xlabel(’R’,’Fontsize’ ylabel(’TIME’ ,’Fontsize’ 


zlabel( SOLUTION’ , ,20) 


Figures 7.3.8 and 7.3.9 show that striking the membrane with a hammer generates a pulse 
that propagates out to the rim, reflects, inverts, and propagates back to the center. This 
process then repeats forever. 


,20); 
?^"IFontsize? 


,20) 


Problems 


Solve the wave equation uj, = c?u,44, 0 < £ < L, 0 < t, subject to the boundary conditions 
that u(0,t) = u(L,t) = 0,0 < t, and the following initial conditions for 0 < x < L. Use 


MATLAB to illustrate your solution. 
u(z,0)—0, u(x,0)—1 
u(r,0)—]1, u(xz,0)-0 
3 ute dc 3ha/2L, 0«z < 2L/3, neo 


3h(L — z)/L, 


2L/3 « zx « L, 
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SOLUTION 


Figure 7.3.9: Same as Figure 7.3.8 except € = a/20. 


4. u(z,0) = [3sin(zz/L) —sin(3zz/L)]/4, ^ u(x,0) = 0, 


0, 0<a<L/A4 


5. u(z,0) =sin(ra/L), u(x,0) =< a 


, ^ L/A«zx «3L/A 


0 3L/4<a<L 


E x, 0<a<L/2, u 
T. i9 - 4%, Leen i, uz(z,0) = 0 


8. Solve the wave equation 


u u iene 
Sea L<T 
Ot? Ox?’ , 
subject to the boundary conditions 


Ou(0,t)  Ou(r, t) 
ðr | . ôr 


— 0, 


and the initial conditions 


Ou(x, 0) 


u(z,0) = 0, AE 


= 1+ cos? (z)), 


Hint: You must include the separation constant of zero. 


ax/L, O0<a<L/2 
a(L — x)/L, L/2<a<L 


0<t, 


0<t, 


0crcm. 
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9. Solve!? the wave equation 


o?u o Qu 

— = —|[ r— 0< 1 0<t 

OF xx). idis so 
subject to the boundary conditions 

lim |u(z, t)| < oo, u(1,t) — 0, 0 « t, 

z—0 


and the initial conditions 


T, 0<2 <a, 


0, a<a2<1l, 


u(z,0)=0, O<a<1, ae | 


where a < 1. Hint: Use the substitution 4x = r?. 


10. The differential equation for the longitudinal vibrations of a rod within a viscous fluid 
is 
0?u Qu ^u 
2h— = c? 
gp y ceu 
where c is the velocity of sound in the rod and h is the damping coefficient. If the rod is 
fixed at x = 0 so that u(0,t) = 0, and allowed to freely oscillate at the other end x = L, so 
that u4 (L,t) = 0, find the vibrations for any location z and subsequent time t if the rod has 
the initial displacement of u(x,0) = x and the initial velocity u;(z,0) = 0 for 0 < x < L. 
Assume that h < cx/(2L). Why? 


0<a<L, 0 « t, 


11. A closed pipe of length L contains air whose density is slightly greater than that of the 
outside air in the ratio of 1 + sọ to 1. Everything being at rest, we suddenly draw aside the 
disk closing one end of the pipe. We want to determine what happens inside the pipe after 
we remove the disk. 
As the air rushes outside, it generates sound waves within the pipe. The wave equation 
u 20u 


ae ^ ar? 
governs these waves, where c is the speed of sound and u(x,t) is the velocity potential. 
Without going into the fluid mechanics of the problem, the boundary conditions are 
a. No flow through the closed end: u,(0,t) = 0. 
b. No infinite acceleration at the open end: u,4(L,t) = 0. 
c. Air is initially at rest: u,(z,0) = 0. 


d. Air initially has a density greater than the surrounding air by the amount so: u;(r,0) = 
—c? sg. 
Find the velocity potential at all positions within the pipe and all subsequent times. 


12. One of the classic applications of the wave equation has been the explanation of the 
acoustic properties of string instruments. Usually we excite a string in one of three ways: 


10 Solved in a slightly different manner by Bailey, H., 2000: Motions of a hanging chain after the free 
end is given an initial velocity. Am. J. Phys., 68, 764-767. 
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by plucking (as in the harp, zither, etc.), by striking with a hammer (piano), or by bowing 
(violin, violoncello, etc.). In all of these cases, the governing partial differential equation is 


Ou 0u 


ae © Ox? 


with the boundary conditions u(0,t) = u(L,t) = 0, 0 < t. For each of the following methods 
of exciting a string instrument, find the complete solution to the problem: 


(a) Plucked string 


For the initial conditions: 


B Bx/a, 0<4<a, 
1650) as a. a« c « l, 


and 
uz(x, 0) = 0, 0<a<L, 


show that 


We note that the harmonics are absent where sin(nza/L) = 0. Thus, if we pluck 
the string at the center, all of the harmonics of even order are absent. Furthermore, the 
intensity of the successive harmonics varies as n~?. The higher harmonics (overtones) are 
therefore relatively feeble compared to the n = 1 term (the fundamental). 


(b) String excited by impact 


The effect of the impact of a hammer depends upon the manner and duration of the contact, 
and is more difficult to estimate. However, as a first estimate, let 


u(x, 0) = 0, 0<2¢<L, 
and 


a-e<u<ate, 


_ fb 
uz, 0) = lo. otherwise, 


where e «& 1. Show that the solution in this case is 


AuL e . {NTEN , (NTAN , (NTEN . (nret 
u(m, t) = nÈ sin( T ) sin( 7 ) sin( 7 Ja (552). 


As in part (a), the nth mode is absent if the origin is at a node. The intensity of 
the overtones are now of the same order of magnitude; higher harmonics (overtones) are 
relatively more in evidence than in part (a). 
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(c) Bowed violin string 


The theory of the vibration of a string when excited by bowing is poorly understood. The 
bow drags the string for a time until the string springs back. After a while the process 
repeats. It can be shown!! that the proper initial conditions are 


u(z,0) = 0, 0<a<L, 


and 
u(x, 0) = 48c(L — x)/L?, O0<a<L, 


where f is the maximum displacement. Show that the solution is now 


88 1 , ynna , [nret 
u(x,t) = 5 >> -5sin( L )sin( L | 
n=1 


7.4 D'ALEMBERT'S FORMULA 


In the previous section we sought solutions to the homogeneous wave equation in the 
form of a product X (x)T(t). For the one-dimensional wave equation there is a more general 
method for constructing the solution, published by d'Alembert!? in 1747. 

Let us determine a solution to the homogeneous wave equation 


Ou 8u 


ag am coo«r«oo, Oct, (7.4.1) 
which satisfies the initial conditions 
o 0 
u(x, 0) = f(x), one») = g(x), -o<4<o. (7.4.2) 


ot 


We begin by introducing two new variables £, n defined by € = x + ct, and ņ = x — ct, 
and set u(x,t) = w(€,7). The variables € and ņ are called the characteristics of the wave 
equation. Using the chain rule, 


ð 00 Md ð a 


Ox Ox0E JAxOn OE ON (1:43) 
8. WO mö D ð 
Di u | Bian BE On eae) 
B aaro .0Y,.05979 2 
Ba? da dE (s i: 5) T Bis Bn (s > Z) (OER) 
2 2 2. 
2 g i (7.4.6) 


= 88 ^as agn 


11 See Lamb, H., 1960: The Dynamical Theory of Sound. Dover Publishers, Section 27. 


1? D'Alembert, J., 1747: Recherches sur la courbe que forme une corde tendué mise en vibration. Hist. 
Acad. R. Sci. Belles Lett., Berlin, 214—219. 


The Wave Equation 319 


Although largely self-educated in mathematics, Jean Le Rond d'Alembert (1717-1783) gained equal 
fame as a mathematician and philosophe of the continental Enlightenment. By the middle of the 
eighteenth century, he stood with such leading European mathematicians and mathematical physi- 
cists as Clairaut, D. Bernoulli, and Euler. Today we best remember him for his work in fluid 
dynamics and applying partial differential equations to problems in physics. (Portrait courtesy of 
the Archives de l'Académie des sciences, Paris.) 


and similarly 


9? gf 8? 9? 9? 
= 2 7.4.7 
om 4 (ja deOn + sz) ea) 
so that the wave equation becomes 
LA (7.4.8) 
Den S A. 
The general solution of Equation 7.4.8 is 
w(&,5) = F(g) + G(n). (7.4.9) 


Thus, the general solution of Equation 7.4.1 is of the form 
u(z,t) = F(x + ct) + G(x — ct), (7.4.10) 


where F and G are arbitrary functions of one variable and are assumed to be twice differen- 
tiable. Setting t = 0 in Equation 7.4.10 and using the initial condition that u(x,0) = f(x), 


F(x) + G(x) = f(a). (7.4.11) 
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SOLUTION 


Figure 7.4.1: D'Alembert's solution, Equation 7.4.18, to the wave equation. 
The partial derivative of Equation 7.4.10 with respect to t yields 


Ou(a, t) 
Ot 


= cF'(x + ct) — cG" (x — ct). (7.4.12) 


Here primes denote differentiation with respect to the argument of the function. If we set 
t = 0 in Equation 7.4.12 and apply the initial condition that u;(z,0) = g(x), 


cF'(x) — cG'(x) = g(a). (7.4.13) 
Integrating Equation 7.4.13 from 0 to any point x gives 


F(y-G(g) = : i; Ec (7.4.14) 


where C is the constant of integration. Combining this result with Equation 7.4.11, 


F(x) = fe) + af g(t) dr + E: (7.4.15) 
and if a 
G(x) = = — Al g(r)dr — 3 (7.4.16) 


If we replace the variable x in the expression for F and G by x + ct and x — ct, respectively, 
and substitute the results into Equation 7.4.10, we finally arrive at the formula 


flea) +fi@—d) 1 ZR 
2 | zl 


u(x,t) = g(r) dr. (7.4.17) 


—ct 


This is known as d’Alembert’s formula for the solution of the wave equation, Equation 7.4.1, 
subject to the initial conditions, Equation 7.4.2. It gives a representation of the solution in 
terms of known initial conditions. 
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e Example 7.4.1 

To illustrate d’Alembert’s formula, let us find the solution to the wave equation, Equa- 
tion 7.4.1, satisfying the initial conditions u(z,0) = H(x -- 1)— H(z — 1) and u;(z,0) = 0, 
—oo < x < oo. By d'Alembert's formula, Equation 7.4.17, 


u(x,t) = 3 [H(z * ct -- 1) - H(z — ct -1) - H(z -ct-1) - H(x -ct—1)]. (7.4.18) 


We illustrate this solution in Figure 7.4.1 generated by the MATLAB script 


^4 set mesh size for solution 
clear; dx = 0.1; dt = 0.1; 
^4 compute grid 
X=[-10:dx:10]; T = [0:dt:10]; 
for j-1:length(T); t = T(j); 
for i-1:length(X); x = X(i); 
^4 compute characteristics 
characteristic 1 = x + t; characteristic 2-2 x - t; 
^ compute solution 
XX(i,j) = x; TT(i,j) = t; 
u(i,j) = O.5*(stepfun(characteristic 1,-1)*stepfun(characteristic 2,-1)... 
-stepfun(characteristic 1, 1)-stepfun(characteristic_2, 1)); 
end; end 
surf(XX,TT,u); colormap autumn; 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ , 20) 


In this figure, you can clearly see the characteristics as they emanate from the discontinuities 
at x = +1. 


e Example 7.4.2 


Let us find the solution to the wave equation, Equation 7.4.1, when u(z,0) = 0, and 
u;(x,0) = sin(2z), —oo < x < oo. By d'Alembert's formula, the solution is 


p gere in(2z) sin(2ct 
ust) = = f sin(2r) dr = PED nN, (7.4.19) 


z—ci 

In addition to providing a method of solving the wave equation, d’Alembert’s solution 

can also provide physical insight into the vibration of a string. Consider the case when we 

release a string with zero velocity after giving it an initial displacement of f(x). According 
to Equation 7.4.17, the displacement at a point x at any time t is 


f(z + ct) + f(z —ct) 


u(x,t) = s 


(7.4.20) 


Because the function f(x -— ct) is the same as the function of f(x) translated to the right by 
a distance equal to ct, f (z— ct) represents a wave of form f(x) traveling to the right with the 
velocity c, a forward wave. Similarly, we can interpret the function f(x -- ct) as representing 
a wave with the shape f(x) traveling to the left with the velocity c, a backward wave. Thus, 
the solution, Equation 7.4.17, is a superposition of forward and backward waves traveling 
with the same velocity c and having the shape of the initial profile f(x) with half of the 
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Figure 7.4.2: The propagation of waves due to an initial displacement according to d'Alembert's formula. 


amplitude. Clearly the characteristics x + ct and x — ct give the propagation paths along 
which the waveform f(x) propagates. 


e Example 7.4.3 


To illustrate our physical interpretation of d'Alembert's solution, suppose that the 
string has an initial displacement defined by 


= a — |z], —axmcxa, 
fi) = { 0, otherwise. (7.4.21) 


In Figure 7.4.2(A) the forward and backward waves, indicated by the dashed line, coincide 
at t = 0. As time advances, both waves move in opposite directions. In particular, at t = 
a/(2c), they moved through a distance a/2, resulting in the displacement of the string shown 
in Figure 7.4.2(B). Eventually, at t = a/c, the forward and backward waves completely 
separate. Finally, Figures 7.4.2(D) and 7.4.3(E) show how the waves radiate off to infinity 
at the speed of c. Note that at each point the string returns to its original position of rest 
after the passage of each wave. 


Consider now the opposite situation when u(x,0) = 0, and w,(z,0) = g(x) 


. The 
displacement is 
1 z--ct 
u(x,t) = l g(t) dr. (7.4.22) 
2c z—ci 
If we introduce the function "NT 
(x) = F g(r) dr, (7.4.23) 
2c 0 


then we can write Equation 7.4.22 as 


u(x,t) = v(x + ct) — (x — ct), (7.4.24) 


which again shows that the solution is a superposition of a forward wave —y(a# — ct) and 
a backward wave y(x + ct) traveling with the same velocity c. The function o, which we 
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Figure 7.4.3: Displacement of an infinite, moving threadline when c = 10, and V = 1. 


compute from Equation 7.4.23 and the initial velocity g(x), determines the exact form of 
these waves. 


e Example 7.4.4: Vibration of a moving threadline 


The characterization and analysis of the oscillations of a string or yarn have an impor- 
tant application in the textile industry because they describe the way that yarn winds on a 
bobbin.!? As we showed in Section 7.1, the governing equation, the “threadline equation,” 
is 

Ou En o?u i g9*u 
ðt? ' OzOt |" Ox? 
where a = 2V, 8 = V? — gT/p, V is the windup velocity, g is the gravitational attraction, 
T is the tension in the yarn, and p is the density of the yarn. We now introduce the 
characteristics € = x + Ait, and 7 = x + Ast, where A; and Az are yet undetermined. Upon 
substituting € and 7 into Equation 7.4.25, 


=0, (7.4.25) 


(A2 + 2VÀi + V? — gT/p)uge + (MÀ +2V A2 + V? — GT/p) Un 
+ [2V? — 2gT/p + 2V (A4 + Az) + 224A9]ug,, = 0. (7.4.26) 


If we choose A1 and As to be roots of the equation 
M --2VA +V? — gT'/p — 0, (7.4.27) 


Equation 7.4.26 reduces to the simple form 


ugn = 0, (7.4.28) 
which has the general solution 
u(x,t) = F(£) + G(n) = F(x + At) + G(x + Ast). (7.4.29) 
Solving Equation 7.4.27 yields 
à =c- V, and Ag = —c — V, (7.4.30) 


13 See Swope, R. D., and W. F. Ames, 1963: Vibrations of a moving threadline. J. Franklin Inst., 275, 
36-55. 


324 Advanced Engineering Mathematics with MATLAB 


DISPLACEMENT OF STRING 
D'ALEMBERT SOLUTION 
SE Il 
yal 
`=- 19 


a Re N QUU uso y 
-8 -6 -4 -2 02 4 6 8 10 I2 14 6 i8 20 22 


-0 


Figure 7.4.4: Displacement of an infinite, moving threadline when c = 11, and V = 10. 


where c = \/gT/p. If the initial conditions are 


u(x, 0) = f(x), and u(x, 0) = g(x), (7.4.31) 
then "m 
1 x 1t 
u(x,t) = F Arf (x + Aot) — àa f (z + Ait) 4 i g(r) ar]. (7.4.32) 


Because A4 does not generally equal to A2, the two waves that constitute the motion of 
the string move with different speeds and have different shapes and forms. For example, if 


f(x) = , and g(x)=0, (7.4.33) 


E iiid \ (7.4.34) 


Figures 7.4.3 and 7.4.4 illustrate this solution for several different parameters. 
Problems 


Use d’Alembert’s formula to solve the wave equation, Equation 7.4.1, for the following initial 
conditions defined for |x| < oo. Then illustrate your solution using MATLAB. 


1. u(z,0) = 2sin(z) cos(x) uz(x,0) = cos(x) 

2. u(x,0) = xsin(x) u(x, 0) = cos(2x) 

3. u(x,0) 2 1/(x? +1) u(x, 0) = e” 

4. u(x, 0) = e7? u(x, 0) = 1/(a? + 1) 

5. u(x,0) = cos(ma/2) u(x, 0) = sinh(az) 

6. u(x#,0) = sin(3x) u(x, 0) = sin(2x) — sin(x) 


7. Assuming that the functions F and G are differentiable, show by direct substitution that 


u(x,t) = EF (x + ct) - EG(z — ct) — ¢ke7t? + 2ka’, 


The Wave Equation 325 


and pe 
t 
v(x, t) 2 cF (x + ct) +cG(x — ct) E 
are the d'Alembert solutions to the hyperbolic system 
o 
d pt Du LN —oo«mz«oo, 0<t, 


ðt Or — Or lO 
where c? — E/p and E, k, and p are constants. 
8. D’Alembert’s solution can also be used in problems over the limited domain 0 < x < L. 
To illustrate this, let us solve the wave equation, Equation 7.4.1, with the initial conditions 


u(z,0) = 0, u;(z, 0) = Vmaz(1 — z/L), 0 < x < L, and the boundary conditions u(0,t) = 
u(L,t) 20,0 « t. 


Step 1: Show that the solution to this problem is 


u(x,t) = $[Vo(z + ct) — Vo(z — ct), 


where 


1 X Vmax 
Vo = 5 f° wig ojag= “BX (1X), «xe 


along with the periodicity conditions Vo(x) = Vo(—x), and Vo(L + x) = Vo(L — x) to take 
care of those cases when the argument of Vo(-) is outside of (0, D). Hint: Substitute the 
solution into the boundary conditions. 


Step 2: Show that at any point x within the interval (0, L), the solution repeats with a 
period of 2L/c if ct > 2L. Therefore, if we know the behavior of the solution for the time 
interval 0 « ct « 2L, we know the behavior for any other time. 


Step 3: Show that the solution at any point x within the interval (0, L) and time t + L/c, 
where 0 < ct < L, is the mirror image (about u = 0) of the solution at the point L — z and 
time t, where 0 < ct < L. 


Step 4: Show that the maximum value of u(x,t) occurs at x = ct, where 0 < x < L and 
when 0 < ct < L. At that point, 


where Umax equals the largest magnitude of u(x,t) for any time t. Plot Umag as a function 
x and show that it is a parabola. Hint: Find the maximum value of u(x,t) when 0 < x € ct 
andct<a<LwithO0<a+ct<LorL<a+4+et < 2L. 


7.5 NUMERICAL SOLUTION OF THE WAVE EQUATION 


Despite the powerful techniques shown in the previous sections for solving the wave 
equation, often these analytic techniques fail and we must resort to numerical techniques. In 
contrast to the continuous solutions, finite difference methods, a type of numerical solution 
technique, give discrete numerical values at a specific location (£m, tn), called a grid point. 
These numerical values represent a numerical approximation of the continuous solution 
over the region (£m — Az/2, £m + Az/2) and (tn — At/2,t, + At/2), where Ax and At are 
the distance and time intervals between grid points, respectively. Clearly, in the limit of 
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Az, At — 0, we recover the continuous solution. However, practical considerations such as 
computer memory or execution time often require that Az and At, although small, are not 
negligibly small. 

The first task in the numerical solution of a partial differential equation is the re- 
placement of its continuous derivatives with finite differences. The most popular approach 
employs Taylor expansions. If we focus on the z-derivative, then the value of the solution 
at u[(m + 1)Az, nAt] in terms of the solution at (mAz, nAt) is 


u _ Ag Ou(zm, tn) , (Az)? Pultm, tn) 
ul(m + 1)Az, nAt] = u(£m, tn) 4 il Da ti 5x2 
E (Aa)? Pulm, tn) | (Ag)? O^ u(zsss tn) 


3! Ox? 
= U(£m, tn) Ax 


where O[(A)?] gives a measure of the magnitude of neglected terms.'4 


From Equation 7.5.2, one possible approximation for wu, is 
Ou(zm, tn) Uc — Ur 


ee = TAL 71. + O(An), (7.5.3) 


where we use the standard notation that u?, = u(x,,,t,). This is an example of a one- 
sided finite difference approximation of the partial derivative u,. The error in using this 
approximation grows as Az. 

Another possible approximation for the derivative arises from using u(mAz,nAt) and 
u|(m — 1)Aa,nAt]. From the Taylor expansion: 


Az 0u(tm,tn) | (Ax)? 07u(2m, tn) 


ul(m — 1)Aa,nAt] = u(zi tn) 


1! Ox 2! Ox? 
(Az)? 09u(zq tn) l (Az)* 0^u(z,, tn) 
3 O8 ^ 4 óx po qs 
we can also obtain the one-sided difference formula 
Ug, tn) Um — Um—1 
= + O(Az). 7.5. 
Ox Ax Cd) US) 
A third possibility arises from subtracting Equation 7.5.4 from Equation 7.5.1: 
Qu(zm, tn 
um ut = 2A Def) + O[(Az)*}, (7.5.6) 
or Bul 
U(Xm , tn ni — mca 2 
= Ax)*). 7.5.7 
m ed + O((Az" (7.5.7) 


Thus, the choice of the finite differencing scheme can produce profound differences in the 
accuracy of the results. In the present case, centered finite differences can yield results that 
are markedly better than using one-sided differences. 


14 The symbol O is a mathematical notation indicating relative magnitude of terms, namely that f(e) = 
O(e") provided lime.,0|f(c)/e"| < oo. For example, as e — 0, sin(e) = O(c), sin(c?) = O(c?), and 
cos(e) = O(1). 
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Figure 7.5.1: Schematic of the numerical solution of the wave equation with fixed endpoints. 


To solve the wave equation, we need to approximate uz,. If we add Equation 7.5.1 and 
Equation 7.5.4, 


O^ u(am, tn) 


a. (Ac) + O[(Ax)"), (7.5.8) 


n n "- n 
Um+1 # Um—1 mn 2uj, F 


or 
u(x tn) Umi — 2Um t Umi 
mtn) _ “m m m OI Aa 21. 7.5.9 
Ne (Az + O[(Az)?] (7.5.9) 


Similar considerations hold for the time derivative. Thus, by neglecting errors of O[(Az)?] 
and O[(At)?], we may approximate the wave equation by 


n+1 n n—1 n = n n 
Um — 2uy, t Um ee’) Um+1 2Um "i Um 1 


(At)? E (Az)? 


(7.5.10) 


Because the wave equation represents evolutionary change of some quantity, Equation 7.5.10 
is generally used as a predictive equation where we forecast u?*+! by 


n-l, (= 


n+1 = 
s Ag 


2 
Un, Quy, — U ) (umi — 2un + um i1). (7.5.11) 


Figure 7.5.1 illustrates this numerical scheme. 
The greatest challenge in using Equation 7.5.11 occurs with the very first predic- 
tion. When n = 0, clearly u9,,,, u9,, and u?,_, are specified from the initial condition 


u(mAz,0) = f(am). But what about u,!? Recall that we still have u;(z,0) = g(x). If we 
use the backward difference formula, Equation 7.5.5, 


= g(zq). (7.5.12) 


Solving for ul, 


un! = ul, — Atg(zm). (7.5.13) 
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One disadvantage of using the backward finite-difference formula is the larger error 
associated with this term compared to those associated with the finite-differenced form of 
the wave equation. In the case of the barotropic vorticity equation, a partial differential 
equation with wave-like solutions, this inconsistency eventually leads to a separation of 
solution between adjacent time levels.!? This difficulty is avoided by stopping after a certain 
number of time steps, averaging the solution, and starting again. 

A better solution for computing that first time step employs the centered difference 


form 
1 1 


Um — Um 


along with the wave equation 
Um i 2up, + Um gu 7 2u, + uhi 
TOE —c (Ac (7.5.15) 
so that 
At\? f(amt1) + f(em— At\? 
ua (S) f(@m41) : Hen) | | (=) f(xs)- Atg(zs). — (7.5.16) 


Although it appears that we are ready to start calculating, we need to check whether 
our numerical scheme possesses three properties: convergence, stability, and consistency. 
By consistency we mean that the difference equations approach the differential equation as 
Ax, At — 0. To prove consistency, we first write u?, 4, u^, 4, u^; |, and u% in terms of 


u(x,t) and its derivatives evaluated at (am,tn). From Taylor expansions, 


Ou m 8u m Ou m 
umiri m= n t Ary + (Ax) DS + g(A2) 573 Fey 7.5.17) 
" du|”™ ^u |" u|” 
um Sun Ax. -— (4r) a a (Ar) 5-3 Te 7.5.18) 
ü " Qu |^ Q?u |^ u|” 
urti =u? + AUX + $(At)? wlt (At)? ae). dere 7.5.19) 
n-l. ,n u 1 u 1 u 
Substituting Equations 7.5.17 through 7.5.20 into Equation 7.5.10, we obtain 
utl _ Ju” E un-1 á un 417 2u”? zb yr i 
(At) NEUE ied 
Pu go us|" tu |" tu |" 
= ( T Maa oa) ae au] de 


The first term on the right side of Equation 7.5.21 vanishes because u(z, t) satisfies the wave 
equation. As Az — 0, At — 0, the remaining terms on the right side of Equation 7.5.21 


15 Gates, W. L., 1959: On the truncation error, stability, and convergence of difference solutions of the 
barotropic vorticity equation. J. Meteorol., 16, 556—568. See Section 4. 
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tend to zero and Equation 7.5.10 is a consistent finite difference approximation of the wave 
equation. 

Stability is another question. Under certain conditions the small errors inherent in 
fixed precision arithmetic (round off) can grow for certain choices of Ax and At. During 
the 1920s the mathematicians Courant, Friedrichs, and Lewy!® found that if cAt/Az > 1, 
then our scheme is unstable. This CFL criterion has its origin in the fact that if cAt > Az, 
then we are asking signals in the numerical scheme to travel faster than their real-world 
counterparts and this unrealistic expectation leads to instability! 

One method of determining stability, commonly called the von Neumann method,!* 
involves examining solutions to Equation 7.5.11 that have the form 

us espuma. (7.5.22) 


m 


where 0 is an arbitrary real number and A is a yet undetermined complex number. Our 
choice of Equation 7.5.22 is motivated by the fact that the initial condition u9, can be 
represented by a Fourier series where a typical term behaves as e/""9, 
If we substitute Equation 7.5.22 into Equation 7.5.10 and divide out the common factor 
ete we have that 
e^ — 2+ e ^ E ci? — 2+ "m 


(At)? (Ar)? C 


sin? (3) = = sin? (5) ; (7.5.24) 


The behavior of u?, is determined by the values of À given by Equation 7.5.24. If cAt/Ax < 
1, then A is real and u”, is bounded for all 0 as n — oo. If cAt/Ax > 1, then it is possible 
to find a value of 0 such that the right side of Equation 7.5.24 exceeds unity and the 
corresponding values of À occur as complex conjugate pairs. The A with the negative 
imaginary part produces a solution with exponential growth because n = tn/At — oo as 
At — 0 for a fixed t, and cAt/Az. Thus, the value of u?, becomes infinitely large, even 
though the initial data may be arbitrarily small. 

Finally, we must check for convergence. A numerical scheme is convergent if the numer- 
ical solution approaches the continuous solution as Az, At — 0. The general procedure for 
proving convergence involves the evolution of the error term ez, which gives the difference 
between the true solution u(am,tn) and the finite difference solution u?,. From Equation 


7.5.21, 
(2) 
Az 
(7.5.25) 


Let us apply Equation 7.5.25 to work backwards from the point (am,tn) by changing n to 
n — 1. The nonvanishing terms in e} reduce to a sum of n + 1 values on the line n = 1 plus 
i (n + 1)n terms of the form A(Az)*. If we define the max norm ||en|| = max; |e”,|, then 


(7.5.23) 


Or 


At\? 
cit = (SE) (Cart eh) +2 e - ex + OKA]  O[(Az) (At? 


llen|| € nB(Az)? + (n + 1)nA(Ax)*. (7.5.26) 


16 Courant, R., K. O. Friedrichs, and H. Lewy, 1928: Uber die partiellen Differenzengleichungen der 
mathematischen Physik. Math. Annalen, 100, 32-74. Translated into English in IBM J. Res. Dev., 11, 
215-234. 

17 After its inventor, J. von Neumann. See O’Brien, G. G., M. A. Hyman, and S. Kaplan, 1950: A study 
of the numerical solution of partial differential equations. J. Math. Phys. (Cambridge, MA), 29, 223-251. 
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Figure 7.5.2: The growth of error ||e4|| as a function of ct for various resolutions. For the top line, 
Az = 0.1; for the middle line, Az = 0.01; and for the bottom line, Az = 0.001. 


Because nAz < ctn, Equation 7.5.26 simplifies to 
|len|| € ctnB(Aa)? + 1802 A(Ax)?. (7.5.27) 


Thus, the error tends to zero as Az — 0, verifying convergence. We illustrate Equation 
7.5.27 by using the finite difference equation, Equation 7.5.11, to compute ||e,|| during a 
numerical experiment that used cAt/Az = 0.5, f(x) = sin(nx), and g(x) = 0; |len|| is 
plotted in Figure 7.5.2. Note how each increase of resolution by 10 results in a drop in the 
error by 100. 

In the following examples we apply our scheme to solve a few simple initial and bound- 
ary conditions: 


e Example 7.5.1 


For our first example, we resolve Equation 7.3.1 through Equation 7.3.3 and Equation 
1.3.25 and Equation 7.3.26 numerically using MATLAB. The MATLAB code is 


clear 
coeff = 0.5; coeffsq = coeff * coeff % coeff = cAt/Ax 
dx = 0.04; dt = coeff * dx; N = 100; x = O:dx:1; 
M = 1/dx + 1; 4 M = number of spatial grid points 
^4 introduce the initial conditions via F and G 
F = zeros(M,1); G = zeros(M,1); 
for m= 1:M 
if x(m) >= 0.25 & x(m) <= 0.5 
F(m) = 4 * x(m) - 1; end 
if x(m) >= 0.5 & x(m) <= 0.75 
F(m) = 3 - 4 * x(m); end; end 
^4 at t = 0, the solution is: 
tplot(1) = 0; u = zeros(M,N+1); u(1:M,1) = F(1:M); 
^ at t = At, the solution is given by Equation 7.5.16 
tplot(2) = dt; 
for m = 2:M-1 
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Figure 7.5.3: The numerical solution u(z,t)/h of the wave equation with cAt/Ax = i using Equation 


7.5.11 at various positions x’ = z/L and times t’ = ct/ L. The exact solution is plotted in Figure 7.3.2. 


u(m,2) = O.5*coeffsq*(F(m*i1)*F(m-1)) + (1-coeffsq) *F(m)+dt*G(m) ; 
end 
^ in general, the solution is given by Equation 7.5.11 
for n = 2:N 
tplot(n+1) = dt * n; 
for m = 2:M-1 
u(m,n+1) = 2*u(m,n)-u(m,n-1) + coeffsq*(u(m+1,n)-2*u(m,n)t+u(m-1,n)); 
end; end 
X = x’ * ones(1,length(tplot)); T = ones(M,1) * tplot; 
surf (X,T,u) 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ ,20) 


Overall, the numerical solution shown in Figure 7.5.3 approximates the exact or analytic 
solution well. However, we note small-scale noise in the numerical solution at later times. 
Why does this occur? Recall that the exact solution could be written as an infinite sum 
of sines in the x dimension. Each successive harmonic adds a contribution from waves of 
shorter and shorter wavelength. In the case of the numerical solution, the longer-wavelength 
harmonics are well represented by the numerical scheme because there are many grid points 
available to resolve a given wavelength. As the wavelengths become shorter, the higher 
harmonics are poorly resolved by the numerical scheme, move at incorrect phase speeds, 
and their misplacement (dispersion) creates the small-scale noise that you observe rather 
than giving the sharp angular features of the exact solution. The only method for avoiding 
this problem is to devise schemes that minimize dispersion. 


e Example 7.5.2 


Let us redo Example 7.5.1 except that we introduce the boundary condition that 
u,(L,t) = 0. This corresponds to a string where we fix the left end and allow the right 
end to freely move up and down. This requires a new difference condition along the right 
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Figure 7.5.4: The numerical solution u(z, t)/h of the wave equation when the right end moves freely with 
cAt/ Avr = i using Equation 7.5.11 and Equation 7.5.30 at various positions x’ = z/L and times t = ct/L. 


boundary. If we employ centered differencing, 


Up, ~ pa 
ar  07—* a0 7.5.28 
2Ar i l l 
and 2 
At 
up = Quy -up ' + (=) (uri — 2u up i). (0929) 


Eliminating u7 ,, between Equation 7.5.28 and Equation 7.5.29, 


At\? 
ut = 2u? — ut! 4 (=) (2u2.., — 2u£). (7.5.30) 


For the special case of n — 1, Equation 7.5.30 becomes 


up = f(zr) + (=) [f (zz) — f(zz)| + Atf (zL). (7.5.31) 


The MATLAB code used to numerically solve the wave equation with a Neumann bound- 
ary condition is very similar to the one used in the previous example that we must add the 
line 
u(M,2) = coeffsq * F(M-1) + (1-coeffsq) * F(M) + dt*G(M); 
after 
for m = 2:M-1 

u(m,2) = O.5*coeffsq*(F(m*i)*F(m-1)) + (1-coeffsq) *F (m)+dt*G(m) ; 
end 
and 

u(M,n+1) = 2*u(M,n)-u(M,n-1) + 2*coeffsq*(u(M-1,n)-u(M,n)); 
after 
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Figure 7.5.5: The numerical solution u(x,t) of the first-order hyperbolic partial differential equation 
ut + ug = 0 using the Lax-Wendroff formula as observed at t = 0,1,2,...,20. The initial conditions are 
given by Equation 7.3.25 with h = 1, At/Ax = i and Az — 0.02. 


for m = 2:M-1 
u(m,n+1) = 2*u(m,n)-u(m,n-1) + coeffsq*(u(mt1,n)-2*u(m,n)*u(m-1,n)); 


end 


Figure 7.5.4 shows the results. The numerical solution agrees well with the exact solution 


32h — 1 E - pze] z (2n — oes 


«[as[ Ee] — oof ORDA] ER DI er. 


The results are also consistent with those presented in Example 7.5.1, especially with regard 
to small-scale noise due to dispersion. 


Project: Numerical Solution of First-Order Hyperbolic Equations 


The equation u;--u;, = 0 is the simplest possible hyperbolic partial differential equation. 
Indeed, the classic wave equation consists of a system of these equations: uz + cu, = 0, and 
Ut + cu, = 0. In this project you will examine several numerical schemes for solving such a 
partial differential equation using MATLAB. 


Step 1: One of the simplest numerical schemes is the forward-in-time, centered-in-space of 


n+1 
Um = 


At 


n n n 
Um \ Um-+1 — Um -1 


Use von Neumann's stability analysis to show that this scheme is always unstable. 
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Figure 7.5.6: Same as Figure 7.5.5 except that the centered-in-time, centered-in-space scheme was used. 


Step 2: The most widely used method for numerically integrating first-order hyperbolic 
equations is the Lax-Wendroff method:'® 


Ua = Hin 7 gag meri Um) + apaga imei 7 Pus sai) 


This method introduces errors of O[(At)?] and O((Az)?]. Show that this scheme is stable 
if it satisfies the CFL criteria of At/Ax < 1. 

Using the initial condition given by Equation 7.3.25, write a MATLAB code that uses 
this scheme to numerically integrate us + Us = 0. Plot the results for various At/Ax over 
the interval 0 € x < 1 given the periodic boundary conditions of u(0,t) = u(1,t) for the 
temporal interval 0 € t < 20. See Figure 7.5.5. Discuss the strengths and weaknesses of 
the scheme with respect to dissipation or damping of the numerical solution and preserving 
the phase of the solution. Most numerical methods books discuss this.!? 


Step 3: Another simple scheme is the centered-in-time, centered-in-space of 


n+1 
m 


1 


n— n Eo 
u 7 Um f Um+1 u 


m-—1 . 
2At l 2A^x = 


This method introduces errors of O[(At)?] and O[(Az)?]. 
Repeat the analysis from Step 1 for this scheme. One of the difficulties is taking the 
first time step. Use the scheme in Step 1 to take this first time step. See Figure 7.5.6. 


Further Reading 


King, G. C., 2009: Vibrations and Waves. Wiley, 228 pp. This book emphasises the physical 
principles, rather than the mathematics. 


18 Lax, P. D., and B. Wendroff, 1960: Systems of conservative laws. Comm. Pure Appl. Math., 13, 
217-237. 


19 For example, Lapidus, L., and G. F. Pinder, 1982: Numerical Solution of Partial Differential Equations 
in Science and Engineering. John Wiley & Sons, 677 pp. 
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Koshlyakov, N. S., M. M. Smirnov, and E. B. Gliner, 1964: Differential Equations of Math- 
ematical Physics. North-Holland Publishing, 701 pp. See Part I. Detailed presentations of 
solution techniques. 


Morse, P. M., and H. Feshback, 1953: Methods of Theoretical Physics. McGraw -Hill Book 
Co., 997 pp. Chapter 11 is devoted to solving the wave equation. 
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SOLUTION 


Chapter 8 
The Heat Equation 


In this chapter we deal with the linear parabolic differential equation 


Ou 207 u 


in the two independent variables x and t. This equation, known as the one-dimensional 
heat equation, serves as the prototype for a wider class of parabolic equations 


Oru 9? Oru Ou Ou 
a(x, t) z Hb(x,t) " + c(z,t) ag ZI (8.0.2) 


where b? = 4ac. It arises in the study of heat conduction in solids as well as in a variety 
of diffusive phenomena. The heat equation is similar to the wave equation in that it is also 
an equation of evolution. However, the heat equation is not “conservative” because if we 
reverse the sign of t, we obtain a different solution. This reflects the presence of entropy, 
which must always increase during heat conduction. 


8.1 DERIVATION OF THE HEAT EQUATION 


To derive the heat equation, consider a heat-conducting homogeneous rod, extending 
from x = 0 to z = L along the z-axis (see Figure 8.1.1). The rod has uniform cross section 
A and constant density p, is insulated laterally so that heat flows only in the x-direction, 
and is sufficiently thin so that the temperature at all points on a cross section is constant. 
Let u(x,t) denote the temperature of the cross section at the point x at any instant of time 
t, and let c denote the specific heat of the rod (the amount of heat required to raise the 
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DE 


x+Ax 
Figure 8.1.1: Heat conduction in a thin bar. 


temperature of a unit mass of the rod by a degree). In the segment of the rod between the 
cross section at x and the cross section at x + Ax, the amount of heat is 


zc4Acx 
Q(t) — / cpAu(s, t) ds. (8.1.1) 


On the other hand, the rate at which heat flows into the segment across the cross section 
at x is proportional to the cross section and the gradient of the temperature at the cross 
section (Fourier's law of heat conduction): 


(8.1.2) 


where & denotes the thermal conductivity of the rod. The sign in Equation 8.1.2 indicates 
that heat flows in the direction of decreasing temperature. Similarly, the rate at which heat 
flows out of the segment through the cross section at x + Ax equals 


A Out + Az,t) 
Ox 


The difference between the amount of heat that flows in through the cross section at x and 
the amount of heat that flows out through the cross section at x+ Ax must equal the change 
in the heat content of the segment z < s € x + Az. Hence, by subtracting Equation 8.1.3 
from Equation 8.1.2 and equating the result to the time derivative of Equation 8.1.1, 


x] EE E E m n 


(8.1.3) 


(8.1.4) 


Assuming that the integrand in Equation 8.1.4 is a continuous function of s, then by the 
mean value theorem for integrals, 


z-d-Ax 
i 2 i) ds = ou ) Ag, u<€<a+Az, (8.1.5) 
so that Equation 8.1.4 becomes 
ae E OQu(r--FAx,t) ulz, t) l (8.1.6) 


Ot Ox Ox 
Dividing both sides of Equation 8.1.6 by cpAz and taking the limit as Ax — 0, 


Ou(r,t) 30? u(a, t) 
T E ae (8.1.7) 


with a? = &/(cp). Equation 8.1.7 is called the one-dimensional heat equation. The constant 
a? is called the diffusivity within the solid. 
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If an external source supplies heat to the rod at a rate f(x, t) per unit volume per unit 


time, we must add the term io 


E f (s, t) ds to the time derivative term of Equation 8.1.4. 
Thus, in the limit Az — 0, 


Qu(z, t) 2 0^ u(z, t) 
ð (Oa? 


= F(z,t), (8.1.8) 


where F(z,t) = f(x, t)/(cp) is the source density. This equation is called the nonhomoge- 
neous heat equation. 


8.2 INITIAL AND BOUNDARY CONDITIONS 


In the case of heat conduction in a thin rod, the temperature function u(x,t) must 
satisfy not only the heat equation, Equation 8.1.7, but also how the two ends of the rod 
exchange heat energy with the surrounding medium. If (1) there is no heat source, (2) the 
function f(x), 0 < x < L describes the temperature in the rod at t = 0, and (3) we maintain 
both ends at zero temperature for all time, then the partial differential equation 


Ou — Pu 

— = a —, O0<a<L, 0<t, 8.2.1 

Ot Ou? ( ) 
describes the temperature distribution u(x,t) in the rod at any later time 0 < t subject to 
the conditions 


u(z,0) = f(x), 0<a<L, (8.2.2) 


and 
u(0,t) = u(L,t) = 0, 0 « t. (8.2.3) 


Equations 8.2.1 and 8.2.3 describe the initial-boundary-value problem for this particular 
heat conduction problem; Equation 8.2.3 is the boundary condition while Equation 8.2.2 
gives the initial condition. Note that in the case of the heat equation, the problem only 
demands the initial value of u(x,t) and not u;(x,0), as with the wave equation. 

Historically most linear boundary conditions have been classified in one of three ways. 
The condition, Equation 8.2.3, is an example of a Dirichlet problem! or condition of the 
first kind. This type of boundary condition gives the value of the solution (which is not 
necessarily equal to zero) along a boundary. 

'The next simplest condition involves derivatives. If we insulate both ends of the rod so 
that no heat flows from the ends, then according to Equation 7.1.2 the boundary condition 
assumes the form 


QOu(0,t)  Ou(L,t) | 
Aa = ye -06  0«t (8.2.4) 


2 


This is an example of a Neumann problem^ or condition of the second kind. This type of 
boundary condition specifies the value of the normal derivative (which may not be equal to 
zero) of the solution along the boundary. 


1 Dirichlet, P. G. L., 1850: Über einen neuen Ausdruck zur Bestimmung der Dichtigkeit einer unendlich 
dünnen Kugelschale, wenn der Werth des Potentials derselben in jedem Punkte ihrer Oberfläche gegeben 
ist. Abh. Königlich. Preuss. Akad. Wiss., 99-116. 


? Neumann, C. G., 1877: Untersuchungen über das Logarithmische und Newton’sche Potential. 
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Finally, if there is radiation of heat from the ends of the rod into the surrounding 
medium, we shall show that the boundary condition is of the form 


au hu(0, t) = a constant, (8.2.5) 
Ox 
and Batic 
DuC) + hu(L,t) = another constant (8.2.6) 


for 0 < t, where h is a positive constant. This is an example of a condition of the third kind 
or Robin problem? and is a linear combination of Dirichlet and Neumann conditions. 


8.3 SEPARATION OF VARIABLES 


As with the wave equation, the most popular and widely used technique for solving 
the heat equation is separation of variables. Its success depends on our ability to express 
the solution u(x,t) as the product X (x)T(t). If we cannot achieve this separation, then the 
technique must be abandoned for others. In the following examples we show how to apply 
this technique. 


e Example 8.3.1 


Let us find the solution to the homogeneous heat equation 


Dag O<e< Ll, 0t, (8.3.1) 
which satisfies the initial condition 
u(r,0)— f(r), 0<x< L, (8.3.2) 
and the boundary conditions 
u(0,t) = u(L,t) 20, 0<t. (8.3.3) 


This system of equations models heat conduction in a thin metallic bar where both ends are 
held at the constant temperature of zero and the bar initially has the temperature f(x). 

We shall solve this problem by the method of separation of variables. Accordingly, we 
seek particular solutions of Equation 8.3.1 of the form 


u(z,t) = X(x)T(t), (8.3.4) 


which satisfy the boundary conditions, Equation 8.3.3. Because 


Ou j 
and 
0?u 
pe EU Te, (8.3.6) 


? Robin, G., 1886: Sur la distribution de l'électricité à la surface des conducteurs fermés et des conduc- 
teurs ouverts. Ann. Sci. l'Ecole Norm. Sup., Ser. 3, 3, S1-S58. 
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Equation 8.3.1 becomes 
T'(t) X (x) = a? X"(x)T(t). (8.3.7) 


Dividing both sides of Equation 8.3.7 by a? X (z)T(t) gives 
M C Md (8.3.8) 


where —A is the separation constant. Equation 8.3.8 immediately yields two ordinary dif- 
ferential equations: 
X" 4 AX — 0, (8.3.9) 


and 
T' +a AT =0 (8.3.10) 


for the functions X (x) and T(t), respectively. 

We now rewrite the boundary conditions in terms of X (x) by noting that the boundary 
conditions are u(0,t) = X(0)T(t) = 0, and u(L,t) = X(L)T(t) = 0 for 0 < t. If we 
were to choose T(t) = 0, then we would have a trivial solution for u(x,t). Consequently, 
X(0) = X(L) =0. 


We now solve Equation 8.3.9. There are three possible cases: A = —m?, A = 0, and 
A = k?. If A = —m? < 0, then we must solve the boundary-value problem 
X" — m?X — 0, X(0) = X(L) — 0. (8.3.11) 


The general solution to Equation 8.3.11 is 
X(x) = Acosh(mz) + B sinh(mz). (8.3.12) 


Because X(0) = 0, it follows that A — 0. The condition X(L) — 0 yields B sinh(mL) = 0. 
Since sinh(mL) Z 0, B = 0, and we have a trivial solution for A < 0. 
If \ = 0, the corresponding boundary-value problem is 


X" (x) 0,  X(0)2 X(L) -0. (8.3.13) 


The general solution is 
X(x) 2 C + Da. (8.3.14) 


From X (0) = 0, we have that C = 0. From X(L) —0, DL —0, or D = 0. Again, we obtain 
a trivial solution. 
Finally, we assume that À = k? > 0. The corresponding boundary-value problem is 
X" E K?X — 0, X(0) = X(L) =0. (8.3.15) 
'The general solution to Equation 8.3.15 is 


X(x) = Ecos(kx) + F sin(kz). (8.3.16) 


Because X(0) = 0, it follows that E = 0; from X(L) = 0, we obtain Fsin(kL) = 0. 
For a nontrivial solution, F Z 0 and sin(kL) = 0. This implies that k,L = nz, where 


n = 1,2,3,.... In summary, the z-dependence of the solution is 
X.(z) = Fy sin( Z2) , (8.3.17) 
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where A, = n?a?/ L7. 


Turning to the time dependence, we use A, = n?2?/L? in Equation 8.3.10 


qu E T; =0. (8.3.18) 
The corresponding general solution is 
T, (t) = Gn exp (- € ) (8.3.19) 
Thus, the functions 
un (z, t) = B, sin(^77) exp (- n t) pn =T; 2 3a (8.3.20) 


where B,, = F4G,, are particular solutions of Equation 8.3.1 and satisfy the homogeneous 
boundary conditions, Equation 8.3.3. 

As we noted in the case of the wave equation, we can solve the z-dependence equation 
as a regular Sturm-Liouville problem. After finding the eigenvalue A, and eigenfunction, 
we solve for Ta(t). The product solution u,(a,t) equals the product of the eigenfunction 
and Tn (t). 

Having found particular solutions to our problem, the most general solution equals a 
linear sum of these particular solutions: 


oo 2 
u(a@,t) = 5 Bn sin( 22) ew( s: = t) ; (8.3.21) 
n=1 


The coefficient Bn is chosen so that Equation 8.3.21 yields the initial condition, Equation 
8.3.2, if t = 0. Thus, setting t = 0 in Equation 8.3.21, we see from Equation 8.3.2 that the 
coefficients Bn must satisfy the relationship 


f(z) = ?» B, sin(77) , 0<g<L. (8.3.22) 


This is precisely a Fourier half-range sine series for f(x) on the interval (0, L). Therefore, 
the formula 


2 L 
Bale L 


gives the coefficients B,,. For example, if L = 7 and u(x,0) = x(a — x), then 


9 [7 z 2 f7 1- (—1)” 
Ban = =| x(n — x) sin(na) dx = 2 f xsin(nx) dx — E | z? sin(nz) dx = oc 
(8.3.24) 
Hence, 
8 Š sin[(2n — 1)z] —(2n—1)3a?t 
- n-1?aht 8.3.25 
u(x,t) a 5 (on — 18 € ( ) 
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1 
TIME ` 0 0 of 


Figure 8.3.1: The temperature u(x,t) within a thin bar as a function of position x and time a?t when we 
maintain both ends at zero and the initial temperature equals z(z — x). 


Figure 8.3.1 illustrates Equation 8.3.25 for various times. It was created using the 
MATLAB script 


clear 
M = 20; dx = pi/25; dt = 0.05; 
^ compute grid and initialize solution 
X = [0:dx:pi]; T = [0:dt:2]; 
u = zeros(length(T) , length (X)) ; 
XX = repmat(X,[length(T) 1]); TT = repmat(T’,[1 length(X)]); 
^ compute solution from Equation 8.3.25 
for m = 1:M 

tempi = 2*m-1; coeff = 8 / (pi * tempi * tempi * tempi); 

u = u + coeff * sin(templ*XX) .* exp(-tempi * tempi * TT); 
end 
surf (XX,TT,u) 
xlabel(’DISTANCE’ ,’Fontsize’,20); ylabel(’TIME’,’Fontsize’ ,20) 
zlabel(’?U(X,T)’,’Fontsize’ ,20) 
Note that both ends of the bar satisfy the boundary conditions, namely that the temperature 
equals zero. As time increases, heat flows out from the center of the bar to both ends where 
it is removed. This process is reflected in the collapse of the original parabolic shape of the 
temperature profile toward zero as time increases. 


e Example 8.3.2 


As a second example, let us solve the heat equation 
Ou 2 07u 
— =e, 
Ot Ox? 


which satisfies the initial condition 


O0<a<L, 0<t, (8.3.26) 


u(r,0 =z, O<a<L, (8.3.27) 
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and the boundary conditions 


Ou(0, t) 
Ox 


=u(L,t)=0, 0«t. (8.3.28) 


The condition u,(0,t) = 0 expresses mathematically the constraint that no heat flows 
through the left boundary (insulated end condition). 

Once again, we employ separation of variables; as in the previous example, the positive 
and zero separation constants yield trivial solutions. For a negative separation constant, 
however, 

X" BR X =0, (8.3.29) 


with 
X'(0) = X(L) — 0, (8.3.30) 

because u,(0,t) = X'(0)T(t) = 0, and u(L,t) = X(L)T(t) = 0. This regular Sturm- 

Liouville problem has the solution 

(2n — 1)rax 


X, (a) = cos 
(x) cos] JT 


| n=1,2,3,.... (8.3.31) 


The temporal solution then becomes 


(8.3.32) 


a?(2n 1r t 
Tut); Ba ep] ( - | : 


Consequently, a linear superposition of the particular solutions gives the total solution, 
which equals 


u(x,t) = > B, cos 8 = Rd ew dU J ; (8.3.33) 


2L AL? 


Our final task remains to find the coefficients Bn. Evaluating Equation 8.3.33 at t = 0, 


= 2n —1 
u(z,0) 2 3 B. cos Eres ; ^ OK<a<L. (8.3.34) 
n=1 


Equation 8.3.34 is not a half-range cosine expansion; it is an expansion in the orthog- 
onal functions cos[(2n — 1)za/(2L)| corresponding to the regular Sturm-Liouville problem, 
Equation 8.3.29 and Equation 8.3.30. Consequently, B, is given by Equation 6.3.4 with 
r(r)—1as 


oe fo’ x cos((2n — 1)ra/(2L)] de RS 
fo’ cos?[(2n — 1)mar/(2L)| dx 


AL? „| (2n—1) L 2L . | (2n-1)gz ||L 

(0 Cnr? cos| "2L ze] lo + Gn-1)r sin| 2L | lo sad 
7 z|L, L ; | (2n-1)zz | | E r ) 
2 li F 2n-1); 51 L la 


= es cos] | i} sn AE 8.3.37) 
8L AL(—1)" 


= 8.3.38 
(2n— 13? (Qn —1)r’ ) 
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Figure 8.3.2: The temperature u(x, t)/ L within a thin bar as a function of position z/L and time a?t/L? 
when we insulate the left end and hold the right end at the temperature of zero. The initial temperature 
equals zx. 


as cos((2n — 1)1/2] = 0, and sin[(2n — 1)1/2] = (—1)"*!. Consequently, the complete 
solution is 


HE 1S u$ t SF) | 2 | exp =e 


n=1 
(8.3.39) 


Figure 8.3.2 illustrates the evolution of the temperature field with time. It was gener- 
ated using the MATLAB script 


clear 
M = 200; dx = 0.02; dt = 0.05; 
% compute Fourier coefficients 
sign = -1; 
for m = 1:M 
tempi = 2*m-1; 
a(m) = 2/(pi*tempi*tempi) + sign/temp1; 
Sign = - sign; 
end 
% compute grid and initialize solution 
X = [0:dx:1]; T = [0:dt:1]; 
u = zeros(length(T) ,length(X)) ; 
XX = repmat(X,[length(T) 1]); 
TT = repmat(T?,[1 length(X)]); 
^ compute solution from Equation 8.3.39 
for m = 1:M 
tempi = (2*m-1)*pi/2; 
u = u + a(m) * cos(tempi*XX) .* exp(-temp1 * tempi * TT); 
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u-- (4/pi) * u; 

surf (XX,TT,u); axis([O 1 O 1 0 1]); 

xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ , 20) 


Initially, heat near the center of the bar flows toward the cooler, insulated end, resulting in 
an increase of temperature there. On the right side, heat flows out of the bar because the 
temperature is maintained at zero at x = L. Eventually the heat that has accumulated at 
the left end flows rightward because of the continual heat loss on the right end. In the limit 
of t — oc, all of the heat has left the bar. 


e Example 8.3.3 


A slight variation on Example 8.3.1 is 


— =a, O0<a<L, 0<t, (8.3.40) 
T 


where 
u(x,0) = u(0,t)=0, and u(L,t)- 90. (8.3.41) 


We begin by blindly employing the technique of separation of variables. Once again, we 
obtain the ordinary differential equation, Equation 8.3.9 and Equation 8.3.10. The initial 
and boundary conditions become, however, 


X(0)=T(0)=0, (8.3.42) 
and 
X(L)T(t) — 0. (8.3.43) 


Although Equation 8.3.42 is acceptable, Equation 8.3.43 gives us an impossible condition 
because T(t) cannot be constant. If it were, it would have to equal to zero by Equation 
8.3.42. 

To find a way around this difficulty, suppose that we want the solution to our problem 
at a time long after t = 0. From experience we know that heat conduction with time- 
independent boundary conditions eventually results in an evolution from the initial condi- 
tion to some time-independent (steady-state) equilibrium. If we denote this steady-state 
solution by w(x), it must satisfy the heat equation 


a^w" (a) = 0, (8.3.44) 
and the boundary conditions 
w(0)=0, and w(L)=9. (8.3.45) 
We can integrate Equation 8.3.44 immediately to give 
w(x) = A + Ba, (8.3.46) 
and the boundary condition, Equation 8.3.45, results in 


_ 0x 


T (8.3.47) 


w(a) 
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Clearly Equation 8.3.47 cannot hope to satisfy the initial conditions; that was never 
expected of it. However, if we add a time-varying (transient) solution v(z,t) to w(x) so 
that 

u(x,t) = w(x) + v(z, t), (8.3.48) 


we could satisfy the initial condition if 
v(x, 0) = u(x, 0) — w(x), (8.3.49) 


and v(z, t) tends to zero as t — oo. Furthermore, because w"(x) = w(0) = 0, and w(L) = 9, 


Ov | 40?v 
— —a — L .3.50 
or^ Bg Oc«rc«L, 0<t, (8.3.50) 
with the boundary conditions 
v(0,t) 2v(L,t) 20, 0<t. (8.3.51) 


We can solve Equation 8.3.49, Equation 8.3.50, and Equation 8.3.51 by separation of vari- 
ables; we did it in Example 8.3.1. However, in place of f(x) we now have u(z, 0) — w(x), or 
—w(«) because u(x,0) = 0. Therefore, the solution v(z, t) is 


oo 2,,2,.2 
v(x, t) = 5 Bn sin( 25) ew( i: = t) (8.3.52) 
n=1 


with 


B„ = 2 —w(2) sin(“**) ins L i sin( 77) dx (8.3.53) 


L L L L L 
20| L? NTT rL nzzN]" 20 
= i 3 = (-1)"—. 3.54 
L2 ES sin( L ) n cos( L ) T (-1) nc G ) 
Thus, the entire solution is 
bx 204 (—1) , (nae an? 7? 
u(x,t) = L + E 5 » sin( 7 )ew( T2 tla (8.3.55) 


n=1 


The quantity a?t/ L? is the Fourier number. 
Figure 8.3.3 illustrates our solution and was created with the MATLAB script 


clear 
M = 1000; dx = 0.01; dt = 0.01; 
^4 compute grid and initialize solution 
X = [0:dx:1]; T = [0:dt:0.2]; 
XX = repmat(X,[length(T) 1]); TT = repmat(T’,[1 length(X)]); 
u = XX; 
^ compute solution from Equation 8.3.55 
sign = -2/pi; 
for m = 1:M 
coeff - sign/m; 
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Figure 8.3.3: The temperature u(z,t)/@ within a thin bar as a function of position z/L and time a?t/L? 
with the left end held at a temperature of zero and right end held at a temperature 0 while the initial 
temperature of the bar is zero. 


u = u + coeff * sin((m*pi)*XX) .* exp(-(n*m*pi*pi) * TT); 
sign - -sign; 
end 
surf(XX,TT,u); axis([O 1 O 0.2 0 1]); 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(C TIME? ,?Fontsize?,20) 
zlabel(’SOLUTION’ ,?Fontsize',20) 
Clearly it satisfies the boundary conditions. Initially, heat flows rapidly from right to left. 
As time increases, the rate of heat transfer decreases until the final equilibrium (steady- 
state) is established and no more heat flows. 


e Example 8.3.4 


Let us find the solution to the heat equation 

Ou 2 07u 
esp cm 

Ot Ox? 
subject to the Neumann boundary conditions 


du(0,t) — Ou(L,t) 


0<a<L, 0<t, (8.3.56) 


= =0 0<t 8.3.57 
Ox Ox : <i ( ) 
and the initial condition that 
u(z,0) = 2, O0<a<L. (8.3.58) 
We have now insulated both ends of the bar. 
Assuming that u(x,t) = X(«x)T(t), 
T" X" 
= =k? (8.3.59) 
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where we have presently assumed that the separation constant is negative. The Neumann 
conditions give u,(0,t) = X'(0)T(t) = 0, and u;(L,t) = X'(L)T(t) = 0 so that X'(0) = 
X'(L) = 0. 

'The Sturm-Liouville problem 


X" -k?X — 0, (8.3.60) 
and 
X'(0) 2 X'(L) 20 (8.3.61) 
gives the z-dependence. The eigenfunction solution is 


X,(2) = eos (272) (8.3.62) 
where kn = nz /L and n = 1,2,3,.... 
'The corresponding temporal part equals the solution of 


2,22 
T! +027, =T! + = T, = 0, (8.3.63) 
which is TNT 
afn r 
Thus, the product solution given by a negative separation constant is 
2,,2,.2 
us (z,t) = Xp(x)T,(t) = An cos (27) ew( 2 = t) : (8.3.65) 


Unlike our previous problems, there is a nontrivial solution for a separation constant 
that equals zero. In this instance, the z-dependence equals 


X(z) = Az + B. (8.3.66) 


The boundary conditions X'(0) = X'(L) = 0 force A to be zero but B is completely free. 
Consequently, the eigenfunction in this particular case is 


Xo(x) — 1. (8.3.67) 


Because Tj (t) = 0 in this case, the temporal part equals a constant that we shall take to be 
Ao/2. Therefore, the product solution corresponding to the zero separation constant is 


'The most general solution to our problem equals the sum of all of the possible solutions: 


2,42 


Ae. o 2 
u(z,t) = + 3 An eos (272) ew( 2 TE 7 (8.3.69) 


Upon substituting t = 0 into Equation 8.3.69, we can determine A,, because 


u(r,0)—r-— x + Y eos (277) (8.3.70) 
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Figure 8.3.4: The temperature u(z, t)/ L within a thin bar as a function of position z/L and time a?t/L? 


when we insulate both ends. The initial temperature of the bar is x. 
is merely a half-range Fourier cosine expansion of the function x over the interval (0, L). 


From Equation 5.1.22 and Equation 5.1.23, 


(8.3.71) 


(8.3.72) 


The complete solution is 


1 


because all of the even harmonics vanish and we may rewrite the odd harmonics using 


n — 2m — 1, where m — 1,2,3,4,.... 


Figure 8.3.4 illustrates Equation 8.3.73 for various positions and times. It was generated 


using the MATLAB script 


clear 


, 


0.01; 


100; dx = 0.01; dt 
^ compute grid and initialize solution 


X = [0:dx:1]; T 


M= 


[0:dt:0.3]; 


zeros(length(T),length(X)); u = 0.5; 
XX = repmat (X, [length(T) 1]); TT = repmat(T’,[1 length(X)]); 


^ compute solution from Equation 8.3.73 


for m 


u 


1:M 


(2*m-1) * pi; 


tempi 
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coeff = 4 / (tempi*tempi); 

u = u - coeff * cos(templ*XX) .* exp(-tempi * tempi * TT); 
end 
surf(XX,TT,u); axis([O 1 O 0.3 O 1]); 
xlabel(’DISTANCE’ ,’Fontsize’,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ , 20) 


The physical interpretation is quite simple. Since heat cannot flow in or out of the rod 
because of the insulation, it can only redistribute itself. Thus, heat flows from the warm 
right end to the cooler left end. Eventually the temperature achieves steady-state when the 
temperature is uniform throughout the bar. 


e Example 8.3.5 


So far we have dealt with problems where the temperature or flux of heat has been 
specified at the ends of the rod. In many physical applications, one or both of the ends 
may radiate to free space at temperature ug. According to Stefan’s law, the amount of heat 
radiated from a given area dA in a given time interval dt is 


c(u* — ug) dA dt, (8.3.74) 
where c is called the Stefan-Boltzmann constant. On the other hand, the amount of heat 


that reaches the surface from the interior of the body, assuming that we are at the right 
end of the bar, equals 


Qu 
—K-— dA dt 8.3.75 
non dA dt, (8.3.75) 
where & is the thermal conductivity. Because these quantities must be equal, 
" Ou _ 4 8A E 3 2 2 3 
i c(u^ — ug) = oa(u — uo) (u? + u^ug + uug + uj). (8.3.76) 
z 


If u and uo are nearly equal, we may approximate the second bracketed term on the right 
side of Equation 8.3.76 as 4ug. We write this approximate form of Equation 8.3.76 as 


——— = h(u — uo), (8.3.77) 


where h, the surface conductance or the coefficient of surface heat transfer, equals 4oug /k. 
Equation 8.3.77 is a “radiation” boundary condition. Sometimes someone will refer to it 
as “Newton’s law” because Equation 8.3.77 is mathematically identical to Newton’s law of 
cooling of a body by forced convection. 

Let us now solve the problem of a rod that we initially heat to the uniform temperature 
of 100. We then allow it to cool by maintaining the temperature at zero at x = 0 and 
radiatively cooling to the surrounding air at the temperature of zero’ at r = L. We may 
restate the problem as 


Ou — Fu 


— =a => L 3.7 
or ^ gym O0<a<L, 0<t, (8.3.78) 


^ Although this would appear to make h = 0, we have merely chosen a temperature scale so that the 
air temperature is zero and the absolute temperature used in Stefan's law is nonzero. 
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Table 8.3.1: The First Ten Roots of Equation 8.3.87 and C,, for hL = 1 


n Qn Approximate ay, Ch 
1 2.0288 2.2074 118.9221 
2 4.9132 4.9246 31.3414 
3 7.9787 7.9813 27.7549 
4 11.0855 11.0865 16.2891 
5 14.2074 14.2079 14.9916 
6 17.3364 17.3366 10.8362 
7 20.4692 20.4693 10.2232 
8 23.6043 23.6044 8.0999 
9 26.7409 26.7410 7.7479 
10 29.8786 29.8786 6.4626 
with 
u(z,0) = 100, O0<a<L, (8.3.79) 
u(0,t)=0, O<t, (8.3.80) 
and ur 
t 
out) +hu(L,t)=0, 0«t. (8.3.81) 
x 


Once again, we assume a product solution u(x,t) = X(x)T(t) with a negative separa- 


tion constant so that 
XU T" 


| Xm UE —k’. (8.3.82) 
We obtain for the x-dependence that 
X" +k’ X — 0, (8.3.83) 
but the boundary conditions are now 
X(0)=0, and X'(L)-4RAX(L)- OO. (8.3.84) 
'The most general solution of Equation 8.3.83 is 
X(x) = Acos(kx) + B sin(kz). (8.3.85) 


However, A = 0, because X(0) = 0. On the other hand, 
k cos(kL) + hsin(kL) = kL cos(kL) + hLsin(kL) = 0, (8.3.86) 


if B £0. The nondimensional number AL is the Biot number and depends completely upon 
the physical characteristics of the rod. 
In Chapter 6 we saw how to find the roots of the transcendental equation 


a+ hLtan(o) = 0, (8.3.87) 


where a = kL. Consequently, if a, is the nth root of Equation 8.3.87, then the eigenfunction 


is 
Xn(x) = sin(osz/L). (8.3.88) 


In Table 8.3.1, we list the first ten roots of Equation 8.3.87 for hL = 1. 
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In general, we must solve Equation 8.3.87 either numerically or graphically. If a is 
large, however, we can find approximate values? by noting that 


cot(a) = —hL/a z 0, (8.3.89) 
Or 
Qn = (2n — 1)x/2, (8.3.90) 
where n — 1,2,3,.... We can obtain a better approximation by setting 
Qn = (2n — 1)7/2 — en, (8.3.91) 


where en «& 1. Substituting into Equation 8.3.89, 
((2n — 1)x/2 — en] cot[((2n — 1)»/2 — en] + AL = 0. (8.3.92) 
We can simplify Equation 8.3.92 to 
€ + (2n — 1)men/2 + hL = 0, (8.3.93) 
because cot[(2n — 1)7/2 — 0| = tan(0), and tan(0) ~ 0 for 0 « 1. Solving for en, 


2hL 


ue (8.3.94) 


En Š — 


and 
_ (n — l)e 2hL 


ý 2 Qn — 1r adii 


In Table 8.3.1 we compare the approximate roots given by Equation 8.3.95 with the actual 
roots. 
'The temporal part equals 


a2 a?t 
Tn (t) = Cn exp(—k2a7t) = C, exp (- L ) : (8.3.96) 


Consequently, the general solution is 


oo ^ 24 
u(x,t) = 5 Cr sin( %2) ew( E ) à (8.3.97) 
n=1 


5 Using the same technique, Stevens and Luck [Stevens, J. W., and R. Luck, 1999: Explicit approxima- 
tions for all eigenvalues of the 1-D transient heat conduction equations. Heat Transfer Eng., 20(2), 35-41] 
have found approximate solutions to n tan(¢,) = Bi. They showed that 


—B + VB? — 4C 
Cn F3 Zn d ; 
2 
where 
B = zn + (1 + Bi) tan(za), C = Bi — zn tan(zn), 


"T = (2-9) : (» 3 
En = ! i = ——]m. 
"n 7 4 XBi- en á 4 
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Figure 8.3.5: The temperature u(x,t) within a thin bar as a function of position z/L and time a?t/L? 
when we allow the bar to radiatively cool at x = L while the temperature is zero at x = 0. Initially the 
temperature was 100. 


where a, is the nth root of Equation 8.3.87. 
To determine Cn, we use the initial condition, Equation 8.3.79, and find that 


100 — 3 C, sin( %7) . (8.3.98) 
n=1 


Equation 8.3.98 is an eigenfunction expansion of 100 employing the eigenfunctions from the 
Sturm-Liouville problem 


X" 4k? X — 0, (8.3.99) 
and 
X(0) = X'(L) + hX(L) — 0. (8.3.100) 
Thus, the coefficient Cn is given by Equation 6.3.4 or 
L ; 
100 nt/ L) d 
c, = Je : sneer (8.3.101) 
Jo. Sin? (anx/L) dx 


as r(x) = 1. Performing the integrations, 


100L[1 — cos(a,)|/an 200|1 — cos(o;,)] 


Cn = = 8.3.102 
$[L — Lsin(2an)/(2an)] — an[1 + cos? (an) /(AL)] ( ) 
because sin(2an) = 2cos(a,) sin(an), and a, = —hL tan(an). The complete solution is 
L 200[1 — cos(o;,)] NC. a2 a?t 
= z : .9.1 
vius 2. eee EO TA sin(^7^) exo( - 75 9209) 


Figure 8.3.5 illustrates this solution for hL = 1 at various times and positions. It was 
generated using the MATLAB script 
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clear 

hL = 1; M = 200; dx = 0.02; dt = 0.02; 
^ create initial guess at alpha n 

zero - zeros(M,1); 


for n - 1:M 
temp = (2*n-1)*pi; zero(n) = 0.5*temp + 2*hL/temp; 
end; 


% use Newton-Raphson method to improve values of alpha_n 
for n = 1:M; for k = 1:10 
f = zero(n) + hL * tan(zero(n)); fp =1 + hL * sec(zero(n))^2; 
zero(n) = zero(n) - f / fp; 
end; end; 
^4 compute Fourier coefficients 
for m = 1:M 
a(m) = 200*(1-cos(zero(m)))/(zero(m)*(1*cos(zero(m))^2/hL)); 
end 
^4 compute grid and initialize solution 
X [0:dx:1]; T = [0:dt:0.5]; 
u 7 zeros(length(T),length(X)); 
XX = repmat(X,[length(T) 11); 
TT = repmat(T’,[1 length(X)]); 
^ compute solution from Equation 8.3.103 
for m - 1:M 
u = u + a(m) * sin(zero(m)*XX) .* exp(-zero(m)*zero(m)*TT) ; 
end 
surf(XX,TT,u) 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’U(X,T)’,’Fontsize’ ,20) 


It is similar to Example 8.3.1 in that the heat lost to the environment occurs either be- 
cause the temperature at an end is zero or because it radiates heat to space that has the 
temperature of zero. 


e Example 8.3.6: Refrigeration of apples 


Some decades ago, shiploads of apples, going from Australia to England, deteriorated 
from a disease called “brown heart,” which occurred under insufficient cooling conditions. 
Apples, when placed on shipboard, are usually warm and must be cooled to be carried in 
cold storage. They also generate heat by their respiration. It was suspected that this heat 
generation effectively counteracted the refrigeration of the apples, resulting in the “brown 
heart.” 

This was the problem that induced Awberry? to study the heat distribution within a 
sphere in which heat is being generated. Awberry first assumed that the apples are initially 
at a uniform temperature. We can take this temperature to be zero by the appropriate 
choice of temperature scale. At time t = 0, the skins of the apples assume the temperature 
0 immediately when we introduce them into the hold. 


6 Awberry, J. H., 1927: The flow of heat in a body generating heat. Philos. Mag., Ser. 7, 4, 629-638. 
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Because of the spherical geometry, the nonhomogeneous heat equation becomes 


1 Ou zx") G 
+ 


a?0t r2 Or d Or 


, O<r<b, Oct, (8.3.104) 
K 


where a? is the thermal diffusivity, b is the radius of the apple, & is the thermal conductivity, 
and G is the heating rate (per unit time per unit volume). 

If we try to use separation of variables on Equation 8.3.104, we find that it does not 
work because of the G/« term. To circumvent this difficulty, we ask the simpler question 
of what happens after a very long time. We anticipate that a balance will eventually be 
established where conduction transports the heat produced within the apple to the surface 
of the apple where the surroundings absorb it. Consequently, just as we introduced a steady- 
state solution in Example 8.3.3, we again anticipate a steady-state solution w(r) where the 
heat conduction removes the heat generated within the apples. The ordinary differential 


equation 
ld 
(eS) gal (8.3.105) 


r2 dr dr K 


gives the steady-state. Furthermore, just as we introduced a transient solution that allowed 
our solution to satisfy the initial condition, we must also have one here, and the governing 
equation is 


Ov a? 0 (f 40v 
= . .3.10 
Ot rr? Or ¢ 5) s ©) 
Solving Equation 8.3.106 first, 
D Gr? 


The constant D equals zero because the solution must be finite at r = 0. Since the steady- 
state solution must satisfy the boundary condition w(b) = 0, 
Gb? 


=6+——. 8.3.1 
C-64 Z (8.3.108) 


Turning to the transient problem, we introduce a new dependent variable y(r,t) = 
rv(r,t). This new dependent variable allows us to replace Equation 8.3.106 with 
Oy _ 307y 
= =O aS 8.3.109 
0t ^ Ar?’ 
which we can solve. If we assume that y(r,t) = R(r)T(t) and we only have a negative 
separation constant, the R(r) equation becomes 


dR 
— 4 -?R = 3. 
d +k°R i (8.3.110) 
which has the solution 
R(r) = Acos(kr) + Bsin(kr). (8.3.111) 


The constant A equals zero because the solution, Equation 8.3.111, must vanish at r = 0 
so that v(0,t) remains finite. However, because 0 = w(b) + v(b, t) for all time and v(b, t) = 
R(b)T(t)/b = 0, then R(b) = 0. Consequently, kn = n7/b, and 


Bn . (mar n?g?a?t 
vs (r,t) = — sin( =) exp(——3u ] - (8.3.112) 


r 
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Superposition gives the total solution, which equals 


oo Bn 2.2 21 
id oq E gio yc sin( 77) ew( B = ) (8.3.113) 


Finally, we determine the coefficients B, by the initial condition that u(r,0) = 0. 
Therefore, 


b 3 
By, = = T L + Gop m r?) sin(“**) dr = a 1)” H zn ( : ) ( 1)”. 
0 f b 


b 6 nT K Nnm 

(8.3.114) 
'The complete solution is 
20b — (—1)* nur n?n att 
t)=04 i ( ) 8.3.115 

u(r, t) - > | Sint = ew( m ( ) 

Gre a 2600 eM (-1* , rmar n?n at 

l miu s ren n3 sin( b ) bic p? ` 


n=l 


The first line of Equation 8.3.115 gives the temperature distribution due to the imposition 
of the temperature 0 on the surface of the apple while the second line gives the rise in the 
temperature due to the interior heating. 

Returning to our original problem of whether the interior heating is strong enough to 
counteract the cooling by refrigeration, we merely use the second line of Equation 8.3.115 
to find how much the temperature deviates from what we normally expect. Because the 
highest temperature exists at the center of each apple, its value there is the only one of 
interest in this problem. Assuming b = 4 cm as the radius of the apple, a?G/k = 1.33 x 1075 
°C/s, and a? = 1.55 x 107? cm?/s, the temperature effect of the heat generation is very 
small, only 0.0232 °C when, after about 2 hours, the temperatures within the apples reach 
equilibrium. Thus, we must conclude that heat generation within the apples is not the 
cause of brown heart. 

We now know that brown heart results from an excessive concentration of carbon diox- 
ide and an insufficient amount of oxygen in the storage hold.’ Presumably this atmosphere 
affects the metabolic activities that are occurring in the apple? and leads to low-temperature 
breakdown. 


e Example 8.3.7 


In this example we illustrate how separation of variables can be employed in solving 
the axisymmetric heat equation in an infinitely long cylinder. In circular coordinates the 
heat equation is 
ðu — E (55 1 ðu 


2 EE Lx) O<r<b, 0<t, (8.3.116) 


T Thornton, N. C., 1931: The effect of carbon dioxide on fruits and vegetables in storage. Contrib. 
Boyce Thompson Inst., 3, 219—244. 


8 Fidler, J. C., and C. J. North, 1968: The effect of conditions of storage on the respiration of apples. 
IV. Changes in concentration of possible substrates of respiration, as related to production of carbon dioxide 
and uptake of oxygen by apples at low temperatures. J. Hortic. Sci., A3, 429—439. 


358 Advanced Engineering Mathematics with MATLAB 


where r denotes the radial distance and a? denotes the thermal diffusivity. Let us assume 

that we heated this cylinder of radius b to the uniform temperature Tọ and then allowed it 

to cool by having its surface held at the temperature of zero starting from the time t — 0. 
We begin by assuming that the solution is of the form u(r,t) = R(r)T(t) so that 


1 /d?R 1dR 1 dT k? 
R (5s p 2) adt b D 
The only values of the separation constant that yield nontrivial solutions are negative. The 
nontrivial solutions are R(r) = Jo(kr/b), where Jo is the Bessel function of the first kind 
and zeroth order. A separation constant of zero gives R(r) — In(r), which becomes infinite 
at the origin. Positive separation constants yield the modified Bessel function lo(kr/b). 
Although this function is finite at the origin, it cannot satisfy the boundary condition that 
u(b,t) = R(b)T(t) = 0, or R(b) = 0. 
The boundary condition that R(b) = 0 requires that Jo(k) = 0. This transcendental 
equation yields an infinite number of constants kn. For each kn, the temporal part of the 
solution satisfies the differential equation 


—— a 3.11 
à ag -65 (8.3.118) 
which has the solution 
kra 

Ta (t) = An exp (- m 7 ; (8.3.119) 

Consequently, the product solutions are 

m 2 

us(r,t) = AnJo( kn ) exp (55) l (8.3.120) 


The total solution is a linear superposition of all of the particular solutions or 
u(r,t) = Y Av (inr) ew (- 0) (8.3.121) 
Our final task remains to determine Apn. From the initial condition that u(r,0) = To, 
u(r,0) = Ty = 2 An do(Fu ) (8.3.122) 


From Equation 6.5.38 and Equation 6.5.45, 


oT, p r 2T» knr r\ jè Ou 
Ame os | wd dnm peo ae | - 8.3.123 
cU É o( "i "= KR) ( b ) i( ;) hae T 


from Equation 6.5.28. Thus, the complete solution is 


2 il T k2a? 
t) = 2T( Jol kn nj. 8.3.124 
uet = al *) exp( : ) (8.3.124) 
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Figure 8.3.6: The temperature u(r,t)/To within an infinitely long cylinder at various positions r/b and 
times a?t/b? that we initially heated to the uniform temperature To and then allowed to cool by forcing its 
surface to equal zero. 


Figure 8.3.6 illustrates the solution, Equation 8.3.124, for various Fourier numbers a?t/b?. 
It was generated using the MATLAB script 


clear 
M = 20; dr = 0.02; dt = 0.02; 
% load in zeros of J.0 
zero( 1) = 2.40482; zero( 2) = 5.52007; zero( 3) = 8.65372; 
zero( 4) = 11.79153; zero( 5) = 14.93091; zero( 6) = 18.07106; 
zero( 7) = 21.21164; zero( 8) = 24.35247; zero( 9) = 27.49347; 
zero(10) = 30.63461; zero(11) = 33.77582; zero(12) = 36.91710; 
zero(13) = 40.05843; zero(14) = 43.19979; zero(15) = 46.34119; 
zero(16) = 49.48261; zero(17) = 52.62405; zero(18) = 55.76551; 
zero(19) = 58.90698; zero(20) = 62.04847; 
% compute Fourier coefficients 
for m= 1:M 

a(m) = 2 / (zero(m)*besselj(1,zero(m))); 
end 
^ compute grid and initialize solution 
R = [0:dr:1]; T = [0:dt:0.5]; 
u = zeros(length(T) , length (R)) ; 
RR = repmat(R,[length(T) 11); 
TT = repmat(T?,[1 length(R)]); 
^ compute solution from Equation 8.3.124 
for m= 1:M 

u = u + a(m)*besselj(0,zero(m)*RR).*exp(-zero(m)*zero(m)*TT); 
end 
surf (RR,TT,u) 
xlabel(’R’,’Fontsize’,20); ylabel(’TIME’,’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ ,20) 
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It is similar to Example 8.3.1 except that we are in cylindrical coordinates. Heat flows from 
the interior and is removed at the cylinder's surface where the temperature equals zero. The 
initial oscillations of the solution result from Gibbs phenomena because we have a jump in 
the temperature field at r — b. 


e Example 8.3.8 


In this example? we find the evolution of the temperature field within a cylinder of 
radius b as it radiatively cools from an initial uniform temperature Tọ. The heat equation 
is 
Ou o/u | 10u 
— = —+-—], 0<r<b, 0<t, 8.3.125 
8t ^ (5: r Or 22 ( ) 


which we will solve by separation of variables u(r,t) = R(r)T(t). Therefore, 


1/d? ld 1 dT k? 
€ d 3 = — (8.3.126) 


R\dr? rdrj Td db?’ 


because only a negative separation constant yields an R(r) that is finite at the origin and 
satisfies the boundary condition. This solution is R(r) = Jo(kr/b), where Jo is the Bessel 
function of the first kind and zeroth order. 

The radiative boundary condition can be expressed as 


a ) + hu(b,t) = T(t) [S + nn) =0. (8.3.127) 
Because T(t) 4 0, 
T kJo(k) + hbJo(k) = —kJ3(k) + hbJo(k) = 0, (8.3.128) 


where the product hb is the Biot number. The solution of the transcendental equation, 
Equation 8.3.128, yields an infinite number of distinct constants kn. For each kn, the 
temporal part equals the solution of 


dT, ka? 
SOY ume qe es 8.3.129 
d 5» ) 
or 
k2a? 
Ta (t) = An exp (- m t) : (8.3.130) 
'The product solution is, therefore, 
k2 2 
Un(r,t) = AnJo (ka) exp (- n t) (8.3.131) 


and the most general solution is a sum of these product solutions 


oo k2 2 
u(r,t) = V AnJo (a) exp (- na 7 (8.3.132) 
n=l 


9 For another example of solving the heat equation with Robin boundary conditions, see Section 3.2 in 
Balakotaiah, V., N. Gupta, and D. H. West, 2000: A simplified model for analyzing catalytic reactions in 
short monoliths. Chem. Eng. Sci., 55, 5367—5383. 
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Figure 8.3.7: The temperature u(r,t)/To within an infinitely long cylinder at various positions r/b and 
times a?t/b? that we initially heated to the temperature To and then allowed to radiatively cool with hb = 1. 


Finally, we must determine An. From the initial condition that u(r, 0) = To, 


2 T 
u(r,0) = Ty = 3 AnJo( kn) (8.3.133) 
where 
2k2Tp r 2To knr r\ jè 
An, = n kn- = "ni 

PIE E re! ro(ku) dr [KZ + ORI) ( b ) (raz), 
(8.3.134) 
2k, To Ji (kn) 2k, To Ji (kn) 2k, To Ji (kn) (js 

O [k3 + BRT (kn) — KJ (kn) + B2hZ TG (kn) RIG (kn) + RR IT (kn) ` 

= 2ToJ (kn) (8.3.136) 


Kin |J (kn) + J? (kn)l? 


which follows from Equation 6.5.28, Equation 6.5.38, Equation 6.5.45, and Equation 8.3.128. 
Consequently, the complete solution is 


= J1 (Kn, r kia? 
u(r,t) = 27 ZU TRO Jo (kn) ew( m 7 . (8.3.1377) 


Figure 8.3.7 illustrates the solution, Equation 8.3.137, for various Fourier numbers 
a?t/b? with hb = 1. It was created using the MATLAB script 
clear 
hb = 1; m=0; M = 100; dr = 0.02; dt = 0.02; 
% find km which satisfies hb J_0(k) = k J_1(k) 
for n = 1:10000 
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ki = 0.05*n; k2 = 0.05*(n*1) ; 
yl = hb * besselj(O,k1) - ki * besselj(1,k1); 
y2 = hb * besselj(0,k2) - k2 * besselj(1,k2); 


if yl*y2 <= 0; m = m*i; zero(m) = ki; end; 
end; 
^ use Newton-Raphson method to improve values of kn 
for n = 1:M; for k = 1:5 
term0 = besselj(0,zero(n)); 
termi = besselj(1,zero(n)); 
term2 = besselj(2,zero(n)); 
f = hb * termO - zero(n) * termi; 
fp = O.5*zero(n)*(term2-termO) - (1*hb)*termi; 
zero(n) = zero(n) - f / fp; 
end; end; 
^4 compute Fourier coefficients 
for m= 1:M 
denom = zero(m)*(besselj(0,zero(m))^2*besselj(1,zero(m))^2); 
a(m) = 2 * besselj(1,zero(m)) / denom; 
end 


^ compute grid and initialize solution 
R = [0:dr:1]; T = [0:dt:0.5]; 
u = zeros(length(T) ,length(R)); 
RR = repmat(R,[length(T) 1]); 
TT = repmat(T’,[1 length(R)]); 
% compute solution from Equation 8.3.137 
for m= 1:M 
u = u + a(m)*besselj(0,zero(m) *RR) . *exp(-zero(m)*zero(m)*TT) ; 
end 
surf (RR,TT,u) 
xlabel(’R’,’Fontsize’,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ ,20) 


These results are similar to Example 8.3.5 except that we are in cylindrical coordinates. 
Heat flows from the interior and is removed at the cylinder’s surface where it radiates to 
space at the temperature zero. Note that we do not suffer from Gibbs phenomena in this 
case because there is no initial jump in the temperature distribution. 


e Example 8.3.9: Temperature within an electrical cable 


In the design of cable installations we need the temperature reached within an elec- 
trical cable as a function of current and other parameters. To this end,!° let us solve 
the nonhomogeneous heat equation in cylindrical coordinates with a radiation boundary 
condition. 

The derivation of the heat equation follows from the conservation of energy: 


heat generated = heat dissipated + heat stored, 


10 Iskenderian, H. P., and W. J. Horvath, 1946: Determination of the temperature rise and the maximum 
safe current through multiconductor electric cables. J. Appl. Phys., 17, 255-262. 
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or 


Ou 
Or 


I? RN dt = —& [ar 


5 dt + 21r Arcp du, (8.3.138) 
» 


T 


m 


where J is the current through each wire, R is the resistance of each conductor, N is the 
number of conductors in the shell between radii r and r + Ar = 2rmrAr/(rb), b is the 
radius of the cable, m is the total number of conductors in the cable, & is the thermal 
conductivity, p is the density, c is the average specific heat, and u is the temperature. In 
the limit of Ar — 0, Equation 8.3.138 becomes 


ES "Or 


Ou a? ð Ou 
Ot r Or 


) , O<r<db, 0<t, (8.3.139) 


where A = I? Rm/(xU?cp), and a? = &/(pc). 

Equation 8.3.139 is the nonhomogeneous heat equation for an infinitely long, axisym- 
metric cylinder. From Example 8.3.3, we know that we must write the temperature as the 
sum of a steady-state and transient solution: u(r,t) = w(r) + v(r,t). The steady-state 
solution w(r) satisfies 


ld f dw A 
= 8.3.140 
r dr (r dr ) a?’ ( ) 
or re 
w(r) = T, - ——, (8.3.141) 
4a? 
where Te is the (yet unknown) temperature in the center of the cable. 
The transient solution v(r,t) is governed by 
Ov 21 0 f Ov 
= < b t 8.3.142 
ai ex) 0<r<b, 0<t, (8.3 ) 


with the initial condition that u(r,0) = T. — Ar?/(4a?) + v(0,t) = 0. At the surface r = b, 
heat radiates to free space so that the boundary condition is urp = —hu, where h is the 
surface conductance. Because the temperature equals the steady-state solution when all 
transient effects die away, w(r) must satisfy this radiation boundary condition regardless of 
the transient solution. This requires that 


A(R b 
T; = 5 (t 4 +) ; (8.3.143) 


Therefore, v(r, t) must satisfy v, (b, t) = —hv(b,t) at r = b. 
We find the transient solution v(r,t) by separation of variables v(r,t) = R(r)T(t). 
Substituting into Equation 8.3.142, 


1 dí dR 1 dT 4 
= = 8.3.144 
rR dr (« dr ) a?T dt S ( ) 
m d ( dR 
x ANUS 2 = 
2 ¢ =) + k*rR =0, (8.3.145) 
and " 
— -- k^a?T — 0, (8.3.146) 


dt 
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with R'(b) = —hR(b). The only solution of Equation 8.3.145 that remains finite at r = 0 and 
satisfies the boundary condition is R(r) — Jo(kr), where Jo is the zero-order Bessel function 
of the first kind. Substituting Jo(kr) into the boundary condition, the transcendental 
equation is 

kbJ4 (kb) — hbJo(kb) = 0. (8.3.147) 


For a given value of h and b, Equation 8.3.147 yields an infinite number of unique zeros kpn. 
The corresponding temporal solution to the problem is 


Ta (t) = An exp(—a7k?#), (8.3.148) 


so that the sum of the product solutions is 


u(r, t) = a AnJo(knr) exp(—a?k2t). (8.3.149) 


n=1 
Our final task remains to compute An. By evaluating Equation 8.3.149 at t = 0, 


Ar? = 
u(r, 0)= PT. 2, Andr) (8.3.150) 
which is a Fourier-Bessel series in Jo(k,r). In Section 6.5 we showed how to compute the 
coefficient of a Fourier-Bessel series that is expanded in Jo(k,r) and that has a boundary 


condition of the form given Equation 8.3.147. Applying those results here, we have that 


D 


2k2 ^ (Ar? 
n = (k2b2 + h202)J2 (knb) l r (as Fi z) Jo(knr) dr (8.3.151) 


from Equation 6.5.38 and Equation 6.5.47. Carrying out the indicated integrations, 


2 Aknb A Tek, A 
A, = TETON ( ead EU ) 4060) 5,3 Jo(knb)). (83.152) 
We obtained Equation 8.3.152 by using Equation 6.5.28 and integrating by parts as shown 
in Example 6.5.5. 

To illustrate this solution, let us compute it for the typical parameters b = 4 cm, hb = 1, 
a? = 1.14 cm?/s, A = 2.2747 °C/s, and T, = 23.94°C. The value of A corresponds to 37 
wires of #6 AWG copper wire within a cable carrying a current of 22 amp. 

Figure 8.3.8 illustrates the solution as a function of radius at various times. It was 
created using the MATLAB script 


clear 
asq = 1.14; A = 2.2747; b = 4; dr = 0.02; dt = 0.02; 
hb = 1; m0; M = 10; T.c = 23.94; 
consti = A * b * b / (4 * asq); const2 = A * b * b / asq; 
const3 = A * b * b / (2 * asq); 
% find k nb which satisfies hb J O(kb) = kb J 1(kb) 
for n = 1:10000 
ki = 0.05*n; k2 = 0.05*(n*1) ; 
yl = hb * besselj(O,k1) - ki * besselj(1,k1); 
y2 = hb * besselj(0,k2) - k2 * besselj(1,k2); 
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Figure 8.3.8: The temperature field (in degrees Celsius) within an electric copper cable containing 37 
wires and a current of 22 amperes at various positions r/b and times a?t/b?. Initially the temperature was 
zero and then we allow the cable to cool radiatively as it is heated. The parameters are hb — 1 and the 
radius of the cable b — 4 cm. 


if yl*y2 <= 0; m = mti; zero(m) = ki; end; 
end; 
^4 use Newton-Raphson method to improve values of kn 
for n = 1:M; for k = 1:5 
termO = besselj(0,zero(n)); 
termi = besselj(1,zero(n)); 
term2 = besselj(2,zero(n)); 
f = hb * termO - zero(n) * termi; 
fp = O.5*zero(n)*(term2-termO) - (1+hb)*term1; 
zero(n) = zero(n) - f / fp; 
end; end; 
for m= 1:M 
denom = (zero(m)*zero(m)-*hb*hb)*besselj(0,zero(m))^2; 
a(m) ((consti-T_c)*zero(m) ... 
const2/zero(m))*besselj(1,zero(m)) ... 
+ const3 * besselj(0,zero(m)); 
2 * a(m) / denon; 


a(m) 
end 
^ compute grid and initialize solution 


R = [0:dr:1]; T = [0:dt:2]; 
u = T.c * ones(length(T) ,length(R)) ; 
RR = repmat(R,[length(T) 1]); 


TT = repmat(T’,[1 length(R)]); 
% compute solution u(r,t) = w(r) + v(r,t) 
u = u - consti * RR .* RR; 
for m 1:M 
u = u + a(m)*besselj(0,zero(m)*RR).*exp(-zero(m)*zero(m)*TT); 
end 
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surf(RR,TT,u); axis([O 1 0 2 O 25]; 
xlabel(’R’,’Fontsize’,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’ TEMPERATURE, ^M circ C’,’Fontsize’ ,20) 


From an initial temperature of zero, the temperature rises due to the constant electrical 
heating. After a short period of time, it reaches its steady-state distribution given by 
Equation 8.3.140. The cable is coolest at the surface where heat is radiating away. Heat 
flows from the interior to replace the heat lost by radiation. 

Problems 
For Problems 1-5, solve the heat equation u, = a?u44, 0 < £ < m, 0 < t, subject to the 
boundary conditions that u(0,t) = u(z,t) = 0, 0 < t, and the following initial conditions 
for 0 « x « v. Then plot your results using MATLAB. 
1. u(x,0) = A, a constant 
2. u(z,0) = sin?(z) = [3sin(z) — sin(32)]/4 
3. u(z,0) — x 


4. u(z,0) =T- 


s De 0<a<7/2, 
í neds n/2«m«m. 


For Problems 6-10, solve the heat equation u; = a?u44, 0 < x < v, 0 < t, subject to the 
boundary conditions that u,(0,t) = us(z,t) = 0, 0 < t, and the following initial conditions 
for 0 « x « v. Then plot your results using MATLAB. 

6. u(z,0) 21 

7T. u(z,0) 2 x 

8. u(z, 0) = cos? (x) = [1 + cos(22)]/2 

9. u(z,0) =T- T 


_ f To, 0<a<7/2, 
Bs u(2,0) = {9 7/2« v «m. 


For Problems 11-17, solve the heat equation uz = a2u44, 0 « x « m, 0 « t, subject to the 
following boundary conditions and initial condition. Then plot your results using MATLAB. 


11. u4(0,2) = u(r, t) = 0, 0 < t; u(z,0) 2a72—m7,0« m «m 
12. u(0,t) = u(n, t) = To, O < t; u(z,0 = Tı Z Tog, 0€ mr «m 
13. u(0,t) 20, us(T, t) 20,0«t;u(z,0) =1,0 <£ <T 


14. u(0,t) 20, u (7,0) = 0, 0 < t; u(x, 0) =z, 0 << T 
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15. u(0,t) 20, us(T, t) = 0, O < t; u(z,0) =m- z, 0<T<T 
16. u(0,t) = To, u.(7,t) = 0, 0 < t; u(z,0) = T1 ZZ Tog, 0< E< T 
17. u(0,t) = 0, u(r, t) = To, O < t; u(x, 0) = To, 0< £ < T 


18. It is well known that a room with masonry walls is often very difficult to heat. Consider 
a wall of thickness L, conductivity &, and diffusivity a?, which we heat by a surface heat 
flux at a constant rate H. The temperature of the outside (out-of-doors) face of the wall 
remains constant at Tọ and the entire wall initially has the uniform temperature 7o. Let us 
find the temperature of the inside face as a function of time.!! 

We begin by solving the heat conduction problem 


o 9? 
Bp? ag uud. ed 
subject to the boundary conditions that 
t H 
Du) PE and u(L, t) = To, 
z K 


and the initial condition that u(x,0) = To. Show that the temperature field equals 


HL r 8 1 (2n — l)re (2n — 1?z?a?t 
t) = To 4 1 i 
d n RT { L m 2. (2n — 1? os| 773 | | AD? 


Therefore, the rise of temperature at the interior wall z = 0 is 
HL 8 2 1 (2n — 1)?z?a?t 
1 
k { T? À (2n - 12 exp] AD? 


8H L 4 1 ; (2n — 1?z?a?t 
kr? 2 Qun-ly P AD? ] 


Or 


For a?t/L? < 1, this last expression can be approximated!? by 2Hat!/?/ «1/?&, We thus 
see that the temperature will initially rise as the square root of time and diffusivity and 


11 See Dufton, A. F., 1927: The warming of walls. Philos. Mag., Ser. 7, 4, 888-889. 


12 Let us define the function: 


oo 


_ 1 — exp[- (2n — 1)?z2a?t/L?] 
f(t) »» onc pa 
Then = 
a?r? < 
f'(t)- I exp[- (2n — 1)?z?2a?t/17]. 
n=1 


Consider now the integral 


o a? nt a\ 4 L 
exp x g= : 
: L2 avant 
If we approximate this integral by using the trapezoidal rule with Az = 2, then 
t 
/ exp (- Ss 2 dx = 2 J exp[- (2n — 1)?z2a?t/ 1], 
o n=1 


and f'(t) zz an9/? /(4Lt!/?). Integrating and using f(0) = 0, we finally have f(t) zz am3/2t1/2/(2L). The 
smaller a?t/ L? is, the smaller the error will be. For example, if t — L? /a?, then the error is 2.4%. 
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inversely with conductivity. For an average rock, x = 0.0042 g/cm-s, and a? = 0.0118 
cm?/s, while for wood (spruce) & = 0.0003 g/cm-s, and a? = 0.0024 cm? /s. 

'The same set of equations applies to heat transfer within a transistor operating at low 
frequencies.!? At the junction (x = 0) heat is produced at the rate of H and flows to the 
transistor's supports (2 = +L) where it is removed. The supports are maintained at the 
temperature To, which is also the initial temperature of the transistor. 


19. The linearized Boussinesq equation!^ 


2 

t oe O<a<L, 0<t, 
governs the height of the water table u(x,t) above some reference point, where a? is the 
product of the storage coefficient times the hydraulic coefficient divided by the aquifer 
thickness. A typical value of a? is 10 m?/min. Consider the problem of a strip of land of 
width L that separates two reservoirs of depth hı. Initially the height of the water table 
would be hı. Suddenly we lower the reservoir on the right x = L to a depth hg [u(0, t) = hı, 
u(L,t) = ho, and u(x, 0) = hi]. Find the height of the water table at any position x within 
the aquifer and any time t > 0. 


20. The equation (see Problem 19) 


Ou Ou 

—= 1.5 0 L, 0<t 

at "E «r«L, <t, 
governs the height of the water table u(x,t). Consider the problem!” of a piece of land that 
suddenly has two drains placed at the points x = 0 and x = L so that u(0,t) = u(L,t) = 0. 
If the water table initially has the profile u(z, 0) = 8H (L°a — 3L?z? -- ALa? — 21*)/ L^, find 
the height of the water table at any point within the aquifer and any time t > 0. 


21. We want to find the rise of the water table of an aquifer, which we sandwich between a 
canal and impervious rocks if we suddenly raise the water level in the canal hg units above 
its initial elevation and then maintain the canal at this level. The linearized Boussinesq 
equation (see Problem 19) 


ðu Pu 
Ot Ox?’ 
governs the level of the water table with the boundary conditions u(0, t) = ho, and u4(L,t) = 


0, and the initial condition u(x,0) = 0. Find the height of the water table at any point in 
the aquifer and any time t > 0. 


O0<a<L, 0<t, 


22. Solve the nonhomogeneous heat equation 


=z =e”, 0cmc«m, Oct, 


13 Mortenson, K. E., 1957: Transistor junction temperature as a function of time. Proc. IRE, 45, 
504-513. Equation 2a should read Ty = —F/k. 


14 See, for example, Van Schilfgaarde, J., 1970: Theory of flow to drains. Advances in Hydroscience, No. 
6, Academic Press, 81-85. 


15 For a similar problem, see Dumm, L. D., 1954: New formula for determining depth and spacing of 
subsurface drains in irrigated lands. Agric. Eng., 35, 726-730. 
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subject to the boundary conditions u(0,t) = u,(m,t) = 0, 0 < t, and the initial condition 
u(r,0)— f(x), 0« x «m. 


23. Solve the nonhomogeneous heat equation 


2 
ðu Fu — i 


BE BS ; O0<a<l, 0<t, 


subject to the boundary conditions u,(0,t) = uz(1,t) = 0, 0 < t, and the initial condition 
u(x, 0) = 2(1—2?), 0 <a < 1. Hint: Note that any function of time satisfies the boundary 
conditions. 


24. Solve the nonhomogeneous heat equation 
u 
— — a = Acos(wt), 0cr«m, Oct, 


subject to the boundary conditions u,(0,t) = uz(7,t) = 0, 0 < t, and the initial condition 
u(r,0)— f(x), 0« x « v. Hint: Note that any function of time satisfies the boundary 
conditions. 


25. Solve the nonhomogeneous heat equation 


à 9? « 
i x:-| i ears. O<a<n, Oct, 


ðt ðr? \r-r, m/2xm«m, 


subject to the boundary conditions u(0,t) = u(z,t) = 0, 0 < t, and the initial condition 
u(z,0)—0,0« x «m. Hint: Represent the forcing function as a half-range Fourier sine 
expansion over the interval (0, 7). 
26. A uniform conducting rod of length L and thermometric diffusivity a? is initially at 
temperature zero. We supply heat uniformly throughout the rod so that the heat conduction 
equation is 
20u ðu 

| ax Ot 
where P is the rate at which the temperature would rise if there was no conduction. If 
we maintain the ends of the rod at the temperature of zero, find the temperature at any 
position and subsequent time. How would the solution change if the boundary conditions 
became u(0,t) = u(L,t) = A # 0, 0 < t, and the initial conditions read u(x,0) = A, 
0<a<L? 


D O0<a<L, 0<t, 


27. Solve the nonhomogeneous heat equation 


Ao 
= pon 0 L, 0<t, 
8t a 3x2 e «xx < 


where a? = &/cp, with the boundary conditions that 


Ou(0,t) - Ou(L, t) u 
EE 0, k x +hu(L,t)=0, O«t, 


and the initial condition that u(x,0) — 0, 0 « x « L. 


370 Advanced Engineering Mathematics with MATLAB 


28. Find the solution of 


Ou  O?u 
|] Əz ” 0<a<L, 0 « t, 

with the boundary conditions u(0,t) = 1, and u(L,t) = 0, 0 < t, and the initial condition 
u(r,0)—20,0« x< L. 


29. Solve!® , P 
u u 
Be TEUS g 0<a<L, 0 « t, 
with the boundary conditions 
Ou(0, t) 5 Ou(L, t) 
Fx 0, a Fa +kgu(L,t)=0, 0<t, 


and the initial condition u(z,0) = uo, 0 « x < L. 


30. Solve " r E 
u u 0^u 
H = 1 t 
RET au O0<a<l, 0<t, 
with the boundary conditions 
Ou(0, t) Ou(1, t) 
SA = : E 
ae u(0, t), om 0, 0 « t, 


and the initial condition u(z,0) = 1, 0 < x < 1. Hint: Let u(x,t) = v(z,t)exp[(2x — 
t)/(4a?)] so that the problem becomes 


av ae 
Ot |. Oz? 


with the boundary conditions 


O0<a<l1, Oc«t, 


a Ov(0, t) 
Ox 


,dv(1,t) 


2 SEVA 
. Ox 


=v(0,t), 2a = —v(1, t), 0 « t, 


and the initial condition v(z,0) = exp[-z/(2a?)], 0 <a < 1. 


31. Solve the heat equation in spherical coordinates 


ðu aoa ( o) . a? Q*(ru) 


= r 
Or r Or? ^" 


Ot  r?Ór 


O<r<l, Oct, 
subject to the boundary conditions lim, ,o |u(r,t)| < oo, and u(1,t) = 0, 0 < t, and the 


initial condition u(r,0) 2 1, 0 € r « I. 


32. Solve the heat equation in spherical coordinates 


2 2 92 
ou g y (eg) Tm a«r«B, 0«t, 


Ot | r?Ó0r 


16 Motivated by problems solved in Gomer, R., 1951: Wall reactions and diffusion in static and flow 
systems. J. Chem. Phys., 19, 284—289. 
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subject to the boundary conditions u(a,t) = ur(8,t) = 0, 0 < t, and the initial condition 
u(r,0) 2 ug, a « r « B. 


33. Solve!" the heat equation in spherical coordinates 


Ou 4 (Ou  2du\  a?O*(ru) 
a” (Praa Or? ’ 2 Dai 


subject to the boundary conditions 


ðu(b, t) = uh t), 0< t, 


— 0, and m b 


and the initial condition u(r,0) = uo, 0 € r < b. 


34. Solve!? the heat equation in cylindrical coordinates 


ðu v (55 1 Ou 


mf zo dus < 
Ot Or? 2:3] PSP. Mess 


subject to the boundary conditions limpo |u(r,t)| < oo, and u(b,t) = uo, 0 < t, and the 
initial condition u(r,0) 2 0, 0 <r « b. 


35. Solve the heat equation in cylindrical coordinates 


Ou x 


= "ar 


= < b t 
En no J O0<r<b, 0<t, 


subject to the boundary conditions lim;,.,o |u(r,t)| < co, and u(b,t) = 0, 0 < t, and the 
initial condition u(r,0) 2 1, 0 € r « b. 


36. Solve the heat equation in cylindrical coordinates 


Qu T 


ðt r Or "Or 


), O0<r<l, Oct, 


subject to the boundary conditions lim,_,9 |u(r,t)| < co, and u(1,t) = 0, 0 < t, and the 
initial condition 

so) A, 0<r<bd, 
"^ |B, b<r<l. 


37. The equation!’ 


ou _G, Pu | 10u 
ot — á ðr? or Or 


js O0<r<b, 0<t, 


17 Zhou, H., S. Abanades, G. Flamant, D. Gauthier, and J. Lu, 2002: Simulation of heavy metal vapor- 
ization dynamics of a fluidized bed. Chem. Eng. Sci., 57, 2603-2614. See also Mantell, C., M. Rodriguez, 
and E. Martinez de la Ossa, 2002: Semi-batch extraction of anthocyanins from red grape pomace in packed 
beds: Experimental results and process modelling. Chem. Eng. Sci., 57, 3831—3838. 


18 See Destriau, G., 1946: Propagation des charges électriques sur les pellicules faiblement conductrices 
“problém plan.” J. Phys. Radium, 7, 43-48. 


19 See Szymanski, P., 1932: Quelques solutions exactes des équations de l'hydrodynamique du fluide 
visqueux dans le cas d'un tube cylindrique. J. Math. Pures Appl., Ser. 9, 11, 67-107. 
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governs the velocity u(r,t) of an incompressible fluid of density p and kinematic viscosity 
v flowing in a long circular pipe of radius b with an imposed, constant pressure gradient 
—G. If the fluid is initially at rest, u(r,0) = 0, 0 € r < b, and there is no slip at the wall 
u(b, t) = 0, 0 < t, find the velocity at any subsequent time and position. 


38. Solve the heat equation in cylindrical coordinates 


Ou x 


at r Or "Or 


) O<r<b, 0<t, 


subject to the boundary conditions lim,_,9 |u(r,t)| < oo, and u,(b,t) = —h u(b,t), 0 < t, 
and the initial condition u(r,0) = b? — r?, 0 € r <b. 


39. Solve?? the heat equation in cylindrical coordinates 


Ou ,(0u 18u 
ET (5: -x) KU, O0<r<L, 0<t, 
subject to the boundary conditions lim,_,9 |u(r, t)| < oo, and u,(L,t) = —hu(L,t), 0 < t, 


and the initial condition 
ur de 0, 0<r<b, 
d v To, b<r < L, 


where b < L, and 0 < h, &. 


40. In their study of heat conduction within a thermocouple through which a steady current 
flows, Reich and Madigan?! solved the following nonhomogeneous heat conduction problem: 


Ou u 

8t ^ Ox? 
where J represents the Joule heating generated by the steady current and the P term 
represents the heat loss from Peltier cooling.?? Find u(x,t) if both ends are kept at zero 
[u(0,£) = u(L,t) = 0] and initially the temperature is zero [u(z, 0) = 0]. The interesting 
aspect of this problem is the presence of the delta function. 


= J—Pd(x- b), O0<a<L, 0«t, 0<b<L, 


Step 1: Assuming that u(x,t) equals the sum of a steady-state solution w(x) and a transient 
solution v(z, t), show that the steady-state solution is governed by 


Pw 
a2 — — 
dx? 


Step 2: Show that the steady-state solution is 


dide Jz(L — x)/2a? + Ax, 0 «az «b, 
— | Ja(L — z)/2aà? + B(L — x), bs gre D. 


20 Mack, W., M. Plóchl, and U. Gamer, 2000: Effects of a temperature cycle on an elastic-plastic shrink 
fit with solid inclusion. Chinese J. Mech., 16, 23-30. 


21 Reich, A. D., and J. R. Madigan, 1961: Transient response of a thermocouple circuit under steady 
currents. J. Appl. Phys., 32, 294-301. 


?? In 1834 Jean Charles Athanase Peltier (1785-1845) discovered that there is a heating or cooling 
effect, quite apart from ordinary resistance heating, whenever an electric current flows through the junction 
between two different metals. 
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Step 3: The temperature must be continuous at x = b; otherwise, we would have infinite 
heat conduction there. Use this condition to show that Ab — B(L — b). 


Step 4: To find a second relationship between A and B, integrate the steady-state differ- 
ential equation across the interface at x = b and show that 


b+e 
. dw 
lim a? — = PF: 
c0 dx be 


Step 5: Using the result from Step 4, show that A + B = —P/a?, and 


_ Jf Ja(L — z)/2a? — Px(L — b)/a?L, 0c zx «b, 
up Ja(L — z)/2a? — Pb(L — x)/a2L, b«c «L. 


Step 6: Re-express w(x) as a half-range Fourier sine expansion and show that 


| AJL? & sin(2m — 1)sz/L] | 2LP «5 sin(nz6/ L) sin(nrz/L) 


w(x) = 
me = (2m — 1)? a?r? L n? 


Step 7: Use separation of variables to find the transient solution by solving 
Ov | 40?v 
Ee ee gee 
Ot Ox?’ 

subject to the boundary conditions v(0,t) = v(L,t) = 0, 0 < t, and the initial condition 

v(x, 0) 2 —w(x),0 « x < L. 

Step 8: Add the steady-state and transient solutions together and show that 


O0<a<L, 0<t, 


4JL? SS sin[(2m — 1)r2/L] ee go 
uc a?r’ (2m — 1)? | dio ind | 
m=1 


2LP G sin(nzb/L) sin(nzz/ L) f niet 
e ; 


a2 n2 n2 
n=1 


41. Use separation of variables to solve?? the partial differential equation 


Ou u Ou 
Ot 0x? 
subject to the boundary conditions that uz(0,t) + 2au(0,t) = 0, uz(1,t) + 2au(1,t) = 0, 
0 < t, and the initial condition that u(z, 0) — 1, 0 «€ z « 1. 
Step 1: Introducing u(x,t) = e **v(z,t), show that the problem becomes 


O0<a<l, 0<t, 


ðv v , 

— — LI — av 

Ot | Oz? 
subject to the boundary conditions that v, (0, t) + av(0, t) = 0, vz(1,t)+ av(1,t) — 0, 0 <t, 
and the initial condition that u(x,0) = e°, 0 « z « 1. 


; O0<a<l, Oct, 


?3 See DeGroot, S. R., 1942: Théorie phénoménologique de l'effet Soret. Physica, 9, 699—707. 
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Step 2: Assuming that v(z,t) = X(x)T(t), show that the problem reduces to the ordinary 
differential equations 


X"-(A—-a*)X20, X’'(0)+aX(0)=0, X'(1)--aX(1) =0, 


and T" + AT = 0, where A is the separation constant. 

Step 3: Solve the eigenvalue problem and show that Ag = 0, Xo(r) = e°”, To(t) = Ao, 
2 2-2 

and A, = a? -- n?g?, X, (x) = asin(nax) — nv cos(nzz), and T,(t) = A,e-(* *" Tt, where 

n —1,2,3,..., so that 


u(x,t) = Age "* + 5 An [a sin(nzx) — nr cos(nrx)] e 2 MAD 
n=1 
Step 4: Evaluate Ag and A, and show that 
u(x,t) = ae z + 4am Sa rel [lasin(nrz) — nr cos(nzz)] eae (a trae) 
i 1— e724 SES E n s 2 i 


42. Use separation of variables to solve? the partial differential equation 


Ou O^u 


rət dp 0<a<l, 0<t, 


subject to the boundary conditions that u(0,t) = u4(0,t) = u(1,t) = u,(1,t) = 0, 0 < t, 
and the initial condition that u(z,0) = Az/2— (1— A)z?(—2),0 E z € 1. 

Step 1: Assuming that u(x,t) = X(xr)T(t), show that the problem reduces to the ordinary 
differential equations X"" +k? X" = 0, X(0) = X'(0) = X(1) = X'(1) 20, and T' - ?T = 
0, where k? is the separation constant. 


Step 2: Solving the eigenvalue problem first, show that 


1 — cos(k,,) 


Xn (z) = 1 — cos(kn 3 or 
(x) cos(k,x) + A 


[sin(k,z) — knz], 


where kp denotes the nth root of 
2 — 2cos(k) — ksin(k) = sin(k/2)[sin(k/2) — (k/2) cos(k/2)] = 


Step 3: Using the results from Step 2, show that there are two classes of eigenfunctions: 
kn = 2nm , X4(x) = 1 — cos(2nz2), and 


tan(&5/2) = Kn/2, Xn(x) = 1 — cos(&,x) + S nim — Kn]. 


Kin 
Step 4: Consider the eigenvalue problem 


XU xy". —COxlue 


24 See Hamza, E. A., 1999: Impulsive squeezing with suction and injection. J. Appl. Mech., 66, 945-951. 
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with the boundary conditions X(0) = X'(0) = X(1) = X'(1) = 0. Show that the orthogo- 
nality condition for this problem is 


| E S me oe 
0 


where X n(x) and X,,(r) are two distinct eigenfunctions of this problem. Then show that 
we can construct an eigenfunction expansion for an arbitrary function f(x) via 


— Sof (@) Xia) dz 
So [Xi (a)? dz 


f(r)- M CnXn(x), provided — C, 
n=1 


and f'(x) exists over the interval (0, 1). Hint: Follow the proof in Section 6.2 and integrate 
repeatedly by parts to eliminate the higher derivative terms. 


Step 5: Show that 
1 
Í [X7 (x)? dz = 2n?x?, 
0 


if X4(z) = 1 — cos(2nzz), and 


] Por as = 03/2, 
0 


if X4(z) = 1 — cos(&,2) + 2[sin(&,z) — &,z]/&,. Hint: sin(&;) = &4[1 + cos(«,)]/2. 
Step 6: Use the above results to show that 


u(x,t) = 5 An[l — cos(2nza)]e- 4" 7! 
n=1 


> 2 
+ 1 B, fı — cos(Kinx) — z sin(ez) — zi aoe, 
n=1 


n 


where A, is the Fourier coefficient corresponding to the eigenfunction 1 — cos(2n7a) while 
Bn is the Fourier coefficient corresponding to the eigenfunction 1 — cos(K,2) — 2[sin(K,x) — 
Kye] / Rig: 


Step 7: Show that An = 0 and Bn = 2(1 — A)/&2, so that 


u(x,t) = 2(1 — A) 5 fı — cos(Kn£) — 2 ented — zi B : 


K 
n=1 "n 


Hint: sin(&4) = &4[1 4- cos(&4)]/2, sin(&,) = 2[1 —cos(&,)]/&4, and cos(Kn) = (4— K2)/(4+ 


kŽ). 


43. Use separation of variables to solve?? the partial differential equation 


ðu > (S5 1du u 
=a | 


= ! <r<l t, 
ot Or? rør i) Sete Mes 


?5 See Littlefield, D. L., 1991: Finite conductivity effects on the MHD instabilities in uniformly elongating 
plastic jets. Phys. Fluids, Ser. A, 3, 1666-1673. 
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subject to the boundary conditions that 
Tim |u(r, t)] « oo, u(1,t) — K, 0 « t, 
and the initial condition that 
u(r,0) = g(r), 0«r «I. 


Step 1: Setting u(r,t) = Kr + v(r,t), show that the problem becomes 


ðv (8v DON. v 
8t | Or2 rOr r? 


) , O<r<l, 0<t, 
subject to the boundary conditions that 

lim |u(r,t)| < oo, v(1,t) = 0, 0 « t, 

r—0 
and the initial condition that 


u(r,0) 2 g(r) — Kr, 0 cr «I. 


Step 2: Assuming that v(r,t) = R(r)T(t), show that the problem reduces to the ordinary 
differential equations 


1 1 
R" --R' (e z) R=0, 
r T 


and 


T'- ak T =0 


with the boundary conditions lim,_,9 | R(r)| < oo and R(1) = 0, where k? is the separation 
constant. 


Step 3: Solving the eigenvalue problem first, show that R,(r) = Jı (knr) where kn denotes 
the nth root of Jı (k) = 0and n = 1,2,3,.... 


Step 4: Show that T,,(t) = A,e~® Fat so that 
v(r,t) — 5 An Ji (karje t, 
n=1 
Step 5: Using the initial condition, show that 


v(r,0) = g(r) - Kr = 5 An Ji (kr). 


n=1 


Step 6: Evaluate A, and show that it equals 


2 ae i 
An = SETTE: | aa g(r)Ja(k,r)rdr|. 
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Figure 8.4.1: Schematic of the numerical solution of the heat equation when we hold both ends at a 
temperature of zero. 


8.4 NUMERICAL SOLUTION OF THE HEAT EQUATION 


In the previous chapter we showed how we may use finite difference techniques to solve 
the wave equation. In this section we show that similar considerations hold for the heat 
equation. 

Starting with the heat equation 


Ou 2 O7u 
— =a > 8.4.1 
ot Ox?’ ( ) 
we must first replace the exact derivatives with finite differences. Drawing upon our work 
in Section 7.6, 

Ou(ras,t,) uT — um 


oo ag OA (8.4.2) 


and 
O^ u(am,tn) _ Um41 — 2Um Pt 


Ox? B (Ax)? 


+ O[(Az)?}, (8.4.3) 


where the notation u7, denotes u(%m,tn). Figure 8.4.1 illustrates our numerical scheme 
when we hold both ends at the temperature of zero. Substituting Equation 8.4.2 and 
Equation 8.4.3 into Equation 8.4.1 and rearranging, 


a? At 
unc a ṣ (Aa)? (umpi — Um + un, 1). (8.4.4) 


The numerical integration begins with n = 0 and the value of u9,,,, u?,, and u9, , are 
given by f(mAmx). 

Once again we must check the convergence, stability, and consistency of our scheme. We 
begin by writing u?, ,,, u^, 4, and wu?! in terms of the exact solution u and its derivatives 
evaluated at the point £m = mAz and t, = nAt. By Taylor's expansion, 


Qu 
"HEP qi 


Ox 


ue 0?u 
HE a (Ac) Ox? 


n 


m 3 m 
,9u 


+ i(Ax) ea | verts (8.4.5) 


n 
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" dul” | 2.1 2 ,0*u|" 
uy =u Ary L + oA) Ba? |. 5 (Ac) ois. T (8.4.6) 
and ; 3 
Qu |" 9?u |" u|” 
n-cl. ,m | iL 2 1 3 : 
Substituting into Equation 8.4.4, we obtain 
um um J2 Une — 2Um t Um-—1 
At (Ax)? 
ðu sO UN. 1, Pul 4 Pu |” 


The first term on the right side of Equation 8.4.8 vanishes because u(x,t) satisfies the heat 
equation. Thus, in the limit of Az — 0, At — 0, the right side of Equation 8.4.8 vanishes 
and the scheme is consistent. 
To determine the stability of the explicit scheme, we again use the Fourier method. 
Assuming a solution of the form: 
um = eid pina (8.4.9) 


TL , 


we substitute Equation 8.4.9 into Equation 8.4.4 and find that 


e^ —1 oe? — 2e 8 
= 8.4.10 
At. 7 (Ax? ( ) 
or ^ 
f At 0 
a 21.47 sin? 7). All 
e (Az)? sin”| 5 (8 ) 


PAG 0 
1<1-42 si? ( 195 (8.4.12) 
Xx 


The right inequality is trivially satisfied if a?2At/(Az)? > 0, while the left inequality yields 


a? At 2 1 
(Ax)? ~ 2sin?(0/2)' 


(8.4.13) 


leading to the stability condition 0 < a?At/(Ax)? < $. This is a rather restrictive condition 
because doubling the resolution (halving Ax) requires that we reduce the time step by a 
quarter. Thus, for many calculations the required time step may be unacceptably small. For 
this reason, many use an implicit form of the finite differencing (Crank-Nicholson implicit 
method?6): 


um — Um G^ [Unga T Um + Um- ES umi — Pm + Usa (8.4.14) 
A 2 (Az)? (Ax)? , 4. 


?6 Crank, J., and P. Nicholson, 1947: A practical method for numerical evaluation of solutions of partial 
differential equations of the heat-conduction type. Proc. Cambridge. Philos. Soc., 43, 50-67. 
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10* 


10° 


error 


10? 
0.0 0.2 0.4 0.6 0.8 1.0 


at 
Figure 8.4.2: The growth of error ||e;|| as a function of a?t for various resolutions. For the top line, 
Az = 0.1; for the middle line, Ax = 0.01; and for the bottom line, Az = 0.001. 


although it requires the solution of a simultaneous set of linear equations. However, there 
are several efficient methods for their solution. 

Finally we must check and see if our explicit scheme converges to the true solution. If 
we let e», denote the difference between the exact and our finite differenced solution to the 
heat equation, we can use Equation 8.4.8 to derive the equation governing e7, and find that 


2 
T a^ At 


m T (Az)? (em+1 Qe" +e” ) + O((AO? + At(Az)?], (8.4.15) 


m m-—1 


for m = 1,2,..., M. Assuming that a?At/(Ax)? € 1, then 


- a AP p EAE s s au uu 
lem | < ESE lem-1| + i = Dd €m| + (Az? lema] + AGO? + At(Az)?] 

(8.4.16) 
< |len|| + A[(At)? + At(Ax)?], (8.4.17) 

where ||en|| = maxm=o,1,...,4 |e2;]. Consequently, 
llensall < llenll + AKA?  AA2)]. (8.4.18) 

Because ||eo]| = 0 and nAt < tn, we find that 

len+1|| € An[(At)? + A£(Az)?] < At, [At + (Axz)?]. (8.4.19) 


As Ax — 0, At — 0, the errors tend to zero and we have convergence. We have illustrated 

Equation 8.4.19 in Figure 8.4.2 by using the finite difference equation, Equation 8.4.4, to 

compute |le,,|| during a numerical experiment that used a?2At/(Az)? = 0.5, and f(x) = 

sin(zx). Note how each increase of resolution by 10 results in a drop in the error by 100. 
The following examples illustrate the use of numerical methods. 
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TEMPERATURE 


0.5 
DISTA NOE 0 06 «WE 


Figure 8.4.3: The numerical solution u(x,t) of the heat equation with a?At/(Az)? = 0.47 at various 
positions z’ = a/L and times t' = a?t/L? using Equation 8.4.4. The initial temperature u(z,0) equals 
Az'(1— 2x’) and we hold both ends at a temperature of zero. 


e Example 8.4.1 


For our first example, we redo Example 8.3.1 with a? At/(Az)? = 0.47 and 0.53. Our 
numerical solution was computed using the MATLAB script 


clear 
coeff = 0.47; % coeff = a?At/(Az)? 
ncount = 1; dx = 0.1; dt = coeff * dx * dx; 
N = 99; x = O:dx:1; 
M = 1/dx + 1; 4 M = number of spatial grid points 
tplot(1) = 0; u = zeros(M,N+1); 
for m = 1:M; u(m,1)-4xx(m)*(1-x(m)); temp(m,1)-u(m,1); end 
^ integrate forward in time 
for n = 1:N 
t = dt * n; 
for m = 2:M-1 
u(m,n+1) = u(m,n) + coeff*(u(m+1,n)-2*u(m,n)+u(m-1,n)); 
end 
if mod(n+1,2) == 
ncount = ncount + 1; tplot(ncount) = t; 
for m = 1:M; temp(m,ncount) = u(m,n*1); end 
end; end 
^4 plot the numerical solution 
X = x? * ones(1,length(tplot)); T = ones(M,1) * tplot; 
surf(X,T,temp) 
xlabel(? DISTANCE?,?Fontsize?,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel (’ TEMPERATURE’ , ’Fontsize’ , 20) 


As Figure 8.4.3 shows, the solution with a2At/(Az)? < 1/2 performs well. On the other 
hand, Figure 8.4.4 shows small-scale, growing disturbances when a?At/(Az)? > 1/2. It 
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TEMPERATURE 


Figure 8.4.4: Same as Figure 8.4.3 except that a2 At/(Ax)? = 0.53. 


should be noted that for the reasonable Az = L/100, it takes approximately 20,000 time 
steps before we reach a?t/L? — 1. 


e Example 8.4.2 


In this example, we redo the previous example with an insulated end at « = L. Using 
the centered differencing formula, 


UM+1 — UM = 9, (8.4.20) 
because uz(L,t) = 0. Also, ati = M, 


2 
+1 a* At 
uM —uwc (Ax)? 


(uy 7 2u + yi). (8.4.21) 
Eliminating u$, between the two equations, 


n+1 n a? At n n 
UM = UM + (Ax) (2u4 1 = 2u) ; (8.4.22) 


To implement this new boundary condition in our MATLAB script, we add the line 
u(M,nt+i) = u(M,n) + 2 * coeff * (u(M-1,n) - u(M,n)); 
after the lines 
for m - 2:M-1 
u(m,nti) = u(m,n) + coeff * (u(mti,n) - 2 * u(m,n) + u(m-1,n)0); 


end 
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TEMPERATURE 


Figure 8.4.5: Same as Figure 8.4.4 except that we now have an insulated boundary condition uz (L,t) = 0. 
Figure 8.4.5 illustrates our numerical solution at various positions and times. 
Project: Implicit Numerical Integration of the Heat Equation 


The difficulty in using explicit time differencing to solve the heat equation is the very 
small time step that must be taken at moderate spatial resolutions to ensure stability. This 
small time step translates into an unacceptably long execution time. In this project you will 
investigate the Crank-Nicholson implicit scheme, which allows for a much more reasonable 
time step. 


Step 1: Develop a MATLAB script that uses the Crank-Nicholson equation, Equation 8.4.14, 
to numerically integrate the heat equation. To do this, you will need a tridiagonal solver to 
find u'*1, This is explained at the end of Section 3.1. However, many numerical methods 
books?" actually have code already developed for your use. You might as well use this code. 


Step 2: Test your code by solving the heat equation given the initial condition u(z,0) = 
sin(7a), and the boundary conditions u(0,t) = u(1,t) = 0. Find the solution for various 
values of At with Az — 0.01. Compare this numerical solution against the exact solution 
that you can find. How does the error (between the numerical and exact solutions) change 
with At? For small At, the errors should be small. If not, then you have a mistake in your 
code. 


Step 3: Once you have confidence in your code, discuss the behavior of the scheme for 
various values of Az and At for the initial condition u(z,0) = 0 for 0 € x < Z, and 
u(r,0) = 1 for j < x < 1 with the boundary conditions u(0,t) = u(1,t) = 0. See Figure 


8.4.6. Although you can take quite a large At, what happens? Did a similar problem arise 


27 For example, Press, W. H., B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling, 1986: Numerical 
Recipes: The Art of Scientific Computing. Cambridge University Press, Section 2.6. 
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Figure 8.4.6: The numerical solution u(x,t) of the heat equation uz; = a?u44 using the Crank-Nicholson 
method. The parameters used in the numerical solution are a? At = 0.005 and Ax = 0.05. Both ends are 


held at zero with an initial condition of u(z,0) = 0 for 0 < x < à and u(z,0) = 1 for 4 <a<l. 


in Step 2? Explain your results." Zvan et al?? have reported a similar problem in the 
numerical integration of the Black-Scholes equation from mathematical finance. 


Further Readings 


Carslaw, H. S., and J. C. Jaeger, 1959: Conduction of Heat in Solids. Oxford University 
Press, 510 pp. The source book on solving the heat equation. 


Crank, J., 1970: The Mathematics of Diffusion. Oxford University Press, 347 pp. A source 
book on the solution of the heat equation. 


Koshlyakov, N. S., M. M. Smirnov, and E. B. Gliner, 1964: Differential Equations of Math- 
ematical Physics. North-Holland Publishing, 701 pp. See Part III. Nice presentation of 
mathematical techniques. 


Morse, P. M., and H. Feshback, 1953: Methods of Theoretical Physics. McGraw-Hill Book 
Co., 997 pp. A portion of Chapter 12 is devoted to solving the heat equation. 


28 Luskin, M., and R. Rannacher, 1982: On the smoothing property of the Crank-Nicolson scheme. 
Applicable Anal., 14, 117-135. 


29 Zvan, R., K. Vetzal, and P. Forsyth, 1998: Swing low, swing high. Risk, 11(3), 71-75. 
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Chapter 9 


Laplace's Equation 


In the previous chapter we solved the one-dimensional heat equation. Quite often we 
found that the transient solution died away, leaving a steady state. The partial differential 
equation that describes the steady state for two-dimensional heat conduction is Laplace's 
equation 

Oui du 
sa +a = 0. 9.0.1 
Ox? Oy? ( ) 


In general, this equation governs physical processes where equilibrium has been reached. It 
also serves as the prototype for a wider class of elliptic equations 


Oru 9? 2u ðu ðu 
a(x, t) Oz? | b(z, t) x,t) = (ztu ðr’ x) y (9.0.2) 


where b? < 4ac. Unlike the heat and wave equations, there are no initial conditions and 
the boundary conditions completely specify the solution. In this chapter we present some 
of the common techniques for solving this equation. 


9.1 DERIVATION OF LAPLACE'S EQUATION 


Imagine a thin, flat plate of heat-conducting material between two sheets of insulation. 
Sufficient time has passed so that the temperature depends only on the spatial coordinates 
x and y. Let us now apply the law of conservation of energy (in rate form) to a small 
rectangle with sides Az and Ay. 
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If qa (x, y) and qy(z, y) denote the heat flow rates in the zx- and y-direction, respectively, 
conservation of energy requires that the heat flow into the slab equals the heat flow out of 
the slab if there is no storage or generation of heat. Now 


rate in = qz (z,y + Ay/2)Ay + gy (xz + Ax/2, y)Az, (9.1.1) 


and 
rate out = qz(x + Az, y + Ay/2)Ay + qy(a + Ax/2,y + Ay)Az. (9.1.2) 


If the plate has unit thickness, 
[qs (7, y + Ay/2) — qa (x + Av, y + A^y/2)] Ay 
+ [gy (x + 2/2, y) — qy(a + Av/2,y + Ay)]Ax = 0. (9.1.3) 
Upon dividing through by AxvAy, we obtain two differences quotients on the left side of 
Equation 9.1.3. In the limit as Ax, Ay — 0, they become partial derivatives, giving 


Oda Ody u 
Dx + E =0 (9.1.4) 


for any point (x, y). 
We now employ Fourier's law to eliminate the rates qx and qy, yielding 


9 / ðu ð f 40u 
—0 9.1.5 
Ox (o x) TU (« a) ' ( ) 
if we have an isotropic (same in all directions) material. Finally, if a? is constant, Equation 
9.1.5 reduces to 


u Ou 


which is the two-dimensional, steady-state heat equation (i.e., u; œ 0 as t > oc). 

Solutions of Laplace's equation (called harmonic functions) differ fundamentally from 
those encountered with the heat and wave equations. These latter two equations describe 
the evolution of some phenomena. Laplace's equation, on the other hand, describes things 
at equilibrium. Consequently, any change in the boundary conditions affects to some degree 
the entire domain because a change to any one point causes its neighbors to change in order 
to reestablish the equilibrium. Those points will, in turn, affect others. Because all of these 
points are in equilibrium, this modification must occur instantaneously. 

Further insight follows from the maximum principle. If Laplace's equation governs a 
region, then its solution cannot have a relative maximum or minimum inside the region 
unless the solution is constant.! If we think of the solution as a steady-state temperature 
distribution, this principle is clearly true because at any one point the temperature cannot 
be greater than at all other nearby points. If that were so, heat would flow away from the 
hot point to cooler points nearby, thus eliminating the hot spot when equilibrium was once 
again restored. 

It is often useful to consider the two-dimensional Laplace's equation in other coordinate 
systems. In polar coordinates, where r — rcos(0), y — rsin(0), and z — z, Laplace's 
equation becomes i ; 

d t ae (9.1.7) 
3r? rr Oz? 


1 For the proof, see Courant, R., and D. Hilbert, 1962: Methods of Mathematical Physics, Vol. 2: 
Partial Differential Equations. Interscience, pp. 326-331. 
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Nc 


Today we best remember Pierre-Simon Laplace (1749-1827) for his work in celestial mechanics and 
probability. 


In his five volumes Traité de Mécanique céleste (1799-1825), he accounted for the 
theoretical orbits of the planets and their satellites. Laplace's equation arose during this study of 


gravitational attraction. (Portrait courtesy of the Archives de l'Académie des sciences, Paris.) 


if the problem possesses axisymmetry. On the other hand, if the solution is independent of 
z, Laplace's equation becomes 


ou | 10u | 18u _ 
ðr? rOr 72009 7 


(9.1.8) 


In spherical coordinates, x = r cos(q) sin(0), y = r sin(q) sin(0), and z = r cos(0), where 
= g? +y? + 22, 0 is the angle measured down to the point from the z-axis (colatitude) 
and q is the angle made between the x-axis and the projection of the point on the zy plane. 
In the case of axisymmetry (no p dependence), Laplace's equation becomes 


Of ,0u\ 1 øj. Ou} | 
Or (« 5) " sin(0) 00 EO |^ 


9.2 BOUNDARY CONDITIONS 


r? 


(9.1.9) 


Because Laplace's equation involves time-independent phenomena, we must only spec- 
ify boundary conditions. As we discussed in Section 8.2, we can classify these boundary 
conditions as follows: 


1. Dirichlet condition: u given 
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TOC C ; i —— 
2. Neumann condition: S. given, where n is the unit normal direction 
n 


o 
3. Robin condition: u + a given 
n 


along any section of the boundary. In the case of Laplace’s equation, if all of the boundaries 
have Neumann conditions, then the solution is not unique. This follows from the fact that 
if u(x, y) is a solution, so is u(x, y) + c, where c is any constant. 

Finally we note that we must specify the boundary conditions along each side of the 
boundary. These sides may be at infinity as in problems with semi-infinite domains. We 
must specify values along the entire boundary because we could not have an equilibrium 
solution if any portion of the domain was undetermined. 


9.3 SEPARATION OF VARIABLES 


As in the case of the heat and wave equations, separation of variables is the most 
popular technique for solving Laplace's equation. Although the same general procedure 
carries over from the previous two chapters, the following examples fill out the details. 


e Example 9.3.1: Groundwater flow in a valley 


Over a century ago, a French hydraulic engineer named Henri-Philibert-Gaspard Darcy 
(1803-1858) published the results of a laboratory experiment on the flow of water through 
sand. He showed that the apparent fluid velocity q relative to the sand grains is directly 
proportional to the gradient of the hydraulic potential — kV p, where the hydraulic potential 
q equals the sum of the elevation of the point of measurement plus the pressure potential 
(p/pg). In the case of steady flow, the combination of Darcy's law with conservation of 
mass V -q = 0 yields Laplace's equation V2 = 0 if the aquifer is isotropic (same in all 
directions) and homogeneous. 

To illustrate how separation of variables can be used to solve Laplace’s equation, we 
will determine the hydraulic potential within a small drainage basin that lies in a shallow 
valley. See Figure 9.3.1. Following Tóth,? the governing equation is the two-dimensional 


Laplace equation 
u Oru 


age Oye O0<a<L, 0<y< %p, (9.3.1) 

along with the boundary conditions 
u(x, zo) = gzo + gez, (9.3.2) 
us (0, y) = us(L, y) = 0, and uy(z,0) = 0, (9.3.3) 


where u(x, y) is the hydraulic potential, g is the acceleration due to gravity, and c gives the 
slope of the topography. The conditions u;(L,y) = 0, and u,(z, 0) = 0 specify a no-flow 
condition through the bottom and sides of the aquifer. The condition u;(0, y) = 0 ensures 
symmetry about the x = 0 line. Equation 9.3.1 gives the fluid potential at the water table, 
where zo is the elevation of the water table above the standard datum. The term gcx in 


2 Tóth, J., 1962: A theory of groundwater motion in small drainage basins in central Alberta, Canada. 
J. Geophys. Res., 67, 4375—4387. 
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Figure 9.3.1: Cross section of a valley. 


Equation 9.3.2 expresses the increase of the potential from the valley bottom toward the 
water divide. On average it closely follows the topography. 

Following the pattern set in the previous two chapters, we assume that u(x,y) = 
X (x)Y (y). Then Equation 9.3.1 becomes 


X"Y + XY" — 0. (9.3.4) 
Separating the variables yields 
b y" 


Both sides of Equation 9.3.5 must be constant, but the sign of that constant is not obvious. 
From previous experience we anticipate that the ordinary differential equation in the z- 
direction leads to a Sturm-Liouville problem because it possesses homogeneous boundary 
conditions. Proceeding along this line of reasoning, we consider three separation constants. 


Trying a positive constant (say, m?), Equation 9.3.5 separates into the two ordinary 
differential equations 


X" — m? X — 0, and — Y" m?Y — 0, (9.3.6) 
which have the solutions 
X(x) = Acosh(mz) + B sinh(mz), (9.3.7) 


and 


Y(y) = C cos(my) + D sin(my). (9.3.8) 


Because the boundary conditions, Equation 9.3.3, imply X'(0) = X'(L) = 0, both A and 
B must be zero, leading to the trivial solution u(x, y) = 0. 

When the separation constant equals zero, we find a nontrivial solution given by the 
eigenfunction Xo(r) = 1, and Yo(y) = 44o + Boy. However, because Yj(0) = 0 from 
Equation 9.3.3, Bo = 0. Thus, the particular solution for a zero separation constant is 
uo(z, y) = Ao/2. 

Finally, taking both sides of Equation 9.3.5 equal to —k?, 


X"4+khX=0, and Y"-kY=0. (9.3.9) 
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The first of these equations, along with the boundary conditions X'(0) = X'(L) = 0, gives 
the eigenfunction X,(x) = cos(k,x), with kn = nz/L, n = 1,2,3,.... The function Y,,(y) 
for the same separation constant is 


Yn (y) = An cosh(k,y) + Bn sinh(k, y). (9.3.10) 


We must take Bn = 0 because Y; (0) = 0. 

We now have the product solution X,,(x)Yn(y), which satisfies Laplace's equation and 
all of the boundary conditions except Equation 9.3.2. By the principle of superposition, the 
general solution is 


u(z,y) = a + S An eos (277) cosh (272) ; (9.3.11) 
n=1 


Applying Equation 9.3.2, we find that 


Ao  ~— nc NT Zo 
u(x, zo) = gzo + gez = = + 2. An cos (77) cosh ( T ) ; (9.3.12) 


which we recognize as a Fourier half-range cosine series such that 


2 L 
Ao = if (gzo + gcx) dz, (9.3.13) 
0 
and L 
2 
cosh (272) An = al (gzo + gex) cos (777) dx. (9.3.14) 
Performing the integrations, 
Ao = 2gzo + gcL, (9.3.15) 
i 2gcL[1 — (-1)"] 
gc tain 
An = 9.3.16 
n?n? cosh(nrzo/L) ( ) 
Finally, the complete solution is 
gcL 4gcL «* cos|(2m — 1)rz/L] cosh((2m — 1)ry/L] 
= ; 9.3.17 
ulm y) Sb mA, (2m — 1)? cosh[2m — 1)«zo/L] ( ) 


m=1 


Figure 9.3.2 presents two graphs by Toth for two different aquifers. We see that the solution 
satisfies the boundary condition at the bottom and side boundaries. Water flows from the 
elevated land (on the right) into the valley (on the left), from regions of high to low hydraulic 
potential. 


e Example 9.3.2 


In the previous example, we had the advantage of homogeneous boundary conditions 
along x = 0 and x = L. In a different hydraulic problem, Kirkham? solved the more difficult 
problem of 

u Ou 


sp op 0<a<L, O<y<h, (9.3.18) 


3 Kirkham, D., 1958: Seepage of steady rainfall through soil into drains. Trans. Am. Geophys. Union, 
39, 892-908. 
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Figure 9.3.2: Two-dimensional potential distribution and flow patterns for different depths of the hori- 
zontally impermeable boundary. 


subject to the Dirichlet boundary conditions 


u(z,0)= Rz, u(x,h)= RL, u(L,y)= RL, (9.3.19) 
and 
0, 0<y <a, 
u(0,y) = Fly -a) a<y<b, (9.3.20) 
RL, b<y<h 


This problem arises in finding the steady flow within an aquifer resulting from the intro- 
duction of water at the top due to a steady rainfall and its removal along the sides by 
drains. The parameter L equals half of the distance between the drains, h is the depth of 
the aquifer, and R is the rate of rainfall. 

The point of this example is: We need homogeneous boundary conditions along either 
the x or y boundaries for separation of variables to work. We achieve this by breaking the 
original problem into two parts, namely 


u(x, y) = v(z, y) + w(z, y) + RL, (9.3.21) 
where , : 
ov ðv 
50 ae L 3.22 
0:2 ap 0, O<a<L, O<y<h, 9.3.22) 
with 
v(0,y) 2 v(L,y) 20, | v(z,h) = 0, 9.3.23) 
and 
v(z,0) = R(x — L); 9.3.24) 
w Ow 
am a O0<a<L, 0<y<h, 9.3.25) 
with 
w(z,0) = w(xz,h) = 0, w(L, y) ^ 0, 9.3.26) 
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and 
—RL, 0<y <a, 
w(0,y) = FL (y — a) — RL, a«y«b, (9.3.27) 
0, b<y<h. 


Employing the same technique as in Example 9.3.1, we find that 


= , (mix sinh[mz(h — y)/L] 
LONE. ; 9.3.28 
vimy) » sin( L ) sinh(nzh/L) EOM) 
where P 
2 . (NTE 2RL 
des al R(x — L) sin( i ) de = -——. (9.3.29) 
Similarly, the solution to w(x, y) is found to be 
m: , (mmyN sinh[nz(L — x)/h] 
= Bn - ; .3.30 
w(z, y) 2. sin( h ) sinh(n7 L/h) Po 


where 


Bn = re [sin (=) ane f (= 2 1) sin(“™) ay (9.3.31) 
= 221 Um DTE |sin(“™) (r5) =}. (9.3.32) 


The complete solution consists of substituting Equation 9.3.28 and Equation 9.3.30 into 
Equation 9.3.21. 


e Example 9.3.3 


The electrostatic potential is defined as the amount of work that must be done against 
electric forces to bring a unit charge from a reference point to a given point. It is readily 
shown? that the electrostatic potential is described by Laplace’s equation if there is no 
charge within the domain. Let us find the electrostatic potential u(r,z) inside a closed 
cylinder of length L and radius a. The base and lateral surfaces have the potential 0 while 
the upper surface has the potential V. 

Because the potential varies in only r and z, Laplace’s equation in cylindrical coordi- 
nates reduces to 


10 f Qu ^u 
E < 3. 
(rs) 52 0, O<r<a, O0«zc«L, (9.3.33) 


subject to the boundary conditions 


u(a,z) = u(r,0) 20, and u(r, L)= V. (9.3.34) 


^ For static fields, V x E = 0, where E is the electric force. From Section 4.4, we can introduce a 
potential y such that E = Vy. From Gauss’ law, V. E = V?o = 0. 
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To solve this problem by separation of variables,? let u(r,z) = R(r)Z(z) and 


1 d/ dR 1 ae) k? 
= = " d. 
rRdr ¢ dr )- Z dz? a? 599] 


Only a negative separation constant yields nontrivial solutions in the radial direction. In 


that case, we have that 
Lee + ii R=0 (9.3.36) 
r dr i dr a2 2x 


The solutions of Equation 9.3.36 are the Bessel functions Jo(kr/a) and Yo(kr/ a). Because 
Yo(kr/a) becomes infinite at r — 0, the only permissible solution is Jo(kr/a). The condition 
that u(a,z) = R(a)Z(z) = 0 forces us to choose values of k such that Jo(k) = 0. Therefore, 
the solution in the radial direction is Jo(k,r/a), where kn is the nth root of Jo(k) = 0. 
In the z direction, 
d k? 
da @ 


The general solution to Equation 9.3.37 is 


n =Q. (9.3.37) 


Zn(z) = An sinh (“= ) +B, cosh ( “2 Ji (9.3.38) 


Because u(r,0) = R(r)Z(0) = 0 and cosh(0) = 1, B, must equal zero. Therefore, the 


general product solution is 
kn : kn 
z) sin (94) (9.3.39) 
a a 


The condition that u(r, L) = V determines the arbitrary constant An. Along z = L, 


=V = yan (“= “sin (5 2), (9.3.40) 


[kaL 2V L knr 
san (9) An = 2 T Jo (=) dr (9.3.41) 


from Equation 6.5.38 and Equation 6.5.45. Thus, 


. kn L u 2V kar knr 
sn) An = pg C) ^) 


The solution is then 


et 


n=1 


where 


° u 2V 
0 ky Ji(kn) 


(9.3.42) 


Jo(k m sinh(k,z/a) 
(r, z =v i, J, (k,) ak L Ja)" (9.3.43) 


5 Wang and Liu [Wang, M.-L., and B.-L. Liu, 1995: Solution of Laplace equation by the method of 
separation of variables. J. Chinese Inst. Eng., 18, 731-739] have written a review article on the solutions 
to Equation 9.3.33 based upon which order the boundary conditions are satisfied. 
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Figure 9.3.3: The steady-state potential (divided by V) within a cylinder of equal radius and height a 
when the top has the potential V while the lateral side and bottom are at potential 0. 


Figure 9.3.3 illustrates Equation 9.3.43 for the case when L = a where we included the 
first 20 terms of the series. It was created using the MATLAB script 


clear 

L_over_a = 1; M = 20; dr = 0.02; dz = 0.02; 

^4 load in zeros of J.0 

zero( 1) = 2.40482; zero( 2) 5.52007; zero( 3) = 8.65372; 
zero( 4) = 11.79153; zero( 5) 14.93091; zero( 6) = 18.07106; 
zero( 7) = 21.21164; zero( 8) 24.35247; zero( 9) = 27.49347; 
zero(10) = 30.63461; zero(11) 33.77582; zero(12) = 36.91710; 
zero(13) = 40.05843; zero(14) 43.19979; zero(15) = 46.34119; 
zero(16) = 49.48261; zero(17) 52.62405; zero(18) = 55.76551; 
zero(19) = 58.90698; zero(20) 62.04847; 


^4 compute Fourier coefficients 


for m= 1:M 


a(m) = 2/(zero(m)*besselj(1,zero(m))*sinh(L over a*zero(m))); 


end 


^ compute grid and initialize solution 


R_over_a = 


[0:dr:1]; Z_over_a = 


[0:dz:1]; 


u = zeros(length(Z over a) ,length(R_over_a)); 

RR over.a = repmat(R over a,[length(Z over.a) 1]); 
ZZ over.a = repmat(Z over a?,[1 length(R over. .a)]); 
^ compute solution from Equation 9.3.43 


for m= 1:M 


u=uta(m) .*besselj(0,zero(m)*RR_over_a) .* sinh(zero(m)*ZZ over a); 


end 
surf (RR_over_a, ZZ_over_a,u) 


xlabel(’R/A’ ,’Fontsize’ ,20); ylabel(’Z/A’,’Fontsize’ ,20) 


zlabel(’U(R,Z)’,’Fontsize’ ,20) 
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Of particular interest are the ripples along the line z — L. Along that line, the solution must 
jump from V to 0 at r — a. For that reason our solution suffers from Gibbs phenomena 
along this boundary. As we move away from that region the electrostatic potential varies 
smoothly. 


e Example 9.3.4 

Let us now consider a similar, but slightly different, version of Example 9.3.3, where 
the ends are held at zero potential while the lateral side has the value V. Once again, the 
governing equation is Equation 9.3.33 with the boundary conditions 


u(r,0) 2 u(r,L) 20, and u(a,z) =V. (9.3.44) 


Separation of variables yields 


1 d/ dR IUE k 
rR dr (« =) "^ Zd2 P Modo) 
with Z(0) = Z(L) = 0. We chose a positive separation constant because a negative con- 
stant would give hyperbolic functions in z that cannot satisfy the boundary conditions. A 
separation constant of zero would give a straight line for Z(z). Applying the boundary 
conditions gives a trivial solution. Consequently, the only solution in the z direction that 
satisfies the boundary conditions is Z,,(z) = sin(nmz/L). 
In the radial direction, the differential equation is 


1 d f. dR, 272 
¢ a ) 2T Boer (9.3.46) 


r dr dr L2 


As we showed in Section 6.5, the general solution is 
nam nar 


R,,(r) = Anlo( ==) $ By Ko(“*) (9.3.47) 


where Jj and Ko are modified Bessel functions of the first and second kind, respectively, of 
order zero. Because K(x) behaves as — ln(x) as x — 0, we must discard it and our solution 
in the radial direction becomes R,,(r) = A&Io(nzr/L). Hence, the product solution is 


us(r,z) = Anlo( ==) sin(“**) ; (9.3.48) 


and the general solution is a sum of these particular solutions, namely 
t nar nrz 
=Y Al (=) i (=) l 9.3.49 
u(r, z) > o( >) sin| = ( ) 


Finally, we use the boundary conditions that u(a,z) = V to compute A,. This condition 
gives 
nrz 


u(a,z2) = V = »» Anlo( =) sin(“**) i (9.3.50) 


so that 
nra _ 2 i: a (NZ _ AV[I —(—1)"] 
ai f vam) ue VEEN oss) 
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Therefore, the final answer is 


) = A SR lm = Nr /t sin i (9.3.52) 


u(r, z) = (2m — 1)Ip[(2m — 1)ra/L] 


m=1 


Figure 9.3.4 illustrates the solution, Equation 9.3.52, for the case when L = a. It was 
created using the MATLAB script 


clear 

a_over_L = 1; M = 200; dr = 0.02; dz = 0.02; 

^ compute grid and initialize solution 

R_over_L = [0:dr:1]; Z_over_L = [0:dz:1]; 

u = zeros(length(Z_over_L) ,length(R_over_L)) ; 
RR_over_L = repmat(R over L,[length(Z over L) 1]); 
ZZ over.L = repmat(Z over.L?,[1 length(R over.L)]); 


for m - 1:M 
temp = (2xm-1)*pi; prodi = temp*a_over_L; 
^ compute modified bessel functions in Equation 9.3.52 
for j = 1i:length(Z over L); for i = 1:length(R over L); 
prod2 = temp*RR_over_L(i, j); 
if prod2 - prodi > -10 
if prod2 « 20 
ratio(i,j) = besseli(0,prod2) / besseli(0,prod1); 
else 
^ for large values of prod, use asymptotic expansion 
^ for modified bessel function 
ratio(i,j) = sqrt(prodi/prod2) * exp(prod2-prod1); end; 
else 
ratio(i,j) = 0; end 
end; end; 
^ compute solution from Equation 9.3.52 
u = u + (4/temp) * ratio .* sin(temp*ZZ_over_L) ; 
end 
surf (RR_over_L, ZZ_over_L, u) 
xlabel(’R/L’ ,’Fontsize’ ,20); ylabel(’Z/L’,’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ ,20) 
Once again, there is a convergence of equipotentials at the corners along the right side. If 


we had plotted more contours, we would have observed Gibbs phenomena in the solution 
along the top and bottom of the cylinder. 


e Example 9.3.5 


In the previous examples, the domain was always of finite extent. Assuming axial 
symmetry, let us now solve Laplace’s equation 


10/ Qu Oru 
| = < "9. 
21043 P 0, O<r<w, 0«zc«oo, (9.3.53) 
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Figure 9.3.4: Potential (divided by V) within a conducting cylinder when the top and bottom have a 
potential 0 while the lateral side has a potential V. 


in the half-plane z > 0 subject to the boundary conditions 


; | f Uo, r «a, 
jim |u(r, z)| < oo, u(r,0) — { 0, nod. (9.3.54) 
lim |u(r, z)| < oo; and lim [u(r, z)| < co. (9.3.55) 
r—0 T—00 


This problem gives the steady-state temperature distribution in the half-space z > 0 where 
the temperature on the bounding plane z = 0 equals uo within a circle of radius a and 
equals 0 outside of the circle. 

As before, we begin by assuming the product solution u(r,z) = R(r)Z(z) and separate 
the variables. Again, the separation constant may be positive, negative, or zero. Turning 
to the positive separation constant first, we have that 


R" 1 R’ m t 2 


RR cm. (9.3.56) 
Focusing on the R equation, 
A 1 / 
Es + AES dies 0, or r?R'-trR—- m?^r?R — 0. (9.3.57) 
R rR 
The solution to Equation 9.3.57 is 
R(r) = Aylo(mr) + As Ko(mr), (9.3.58) 


where Io(-) and Ko(-) denote modified Bessel functions of order zero and the first and second 
kind, respectively. Because u(r,z), and hence R(r), must be bounded as r — 0, A» = 0. 
Similarly, since u(r, z) must also be bounded as r — oo, Ai = 0 because limpo Jo(mr) > 
oo. Thus, there is only a trivial solution for a positive separation constant. 

We next try the case when the separation constant equals 0. This yields 


+-—=0, or r?R"4rR-—0. (9.3.59) 
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'The solution here is 
R(r) = A, + Ag In(r). (9.3.60) 


Again, boundedness as r — 0 requires that A9 = 0. What about Aj? Clearly, for any 
arbitrary value of z, the amount of internal energy must be finite. This corresponds to 


f lu(r,z))dr «oo or J. |R(r)| dr < oo (9.3.61) 
0 0 


and A, — 0. The choice of the zero separation constant yields a trivial solution. 
Finally, when the separation constant equals —k?, the equations for R(r) and Z(z) are 


rR’ +rR+k’r Hi — 0, and |Z"—K?*Z =0, (9.3.62) 
respectively. Solving for R(r) first, we have that 
R(r) LI A,Jo(kr) + A2Yo(kr), (9.3.63) 


where Jo(-) and Yo(-) denote Bessel functions of order zero and the first and second kind, 
respectively. The requirement that u(r,z), and hence R(r), is bounded as r — 0 forces us 
to take A» = 0, leaving R(r) = A1Jo(kr). From the equation for Z(z), we conclude that 


Z(z) = Bie"? + Bge ^. (9.3.64) 


Since u(r, z), and hence Z(z), must be bounded as z — oo, it follows that Bı = 0, leaving 
Z(z) = Boe F*. 

Presently our analysis follows closely those for a finite domain. However, we have sat- 
isfied all of the boundary conditions and yet there is still no restriction on k. Consequently, 
we conclude that k is completely arbitrary and any product solution 


ux(r, z) = A1B3 Jo(kr) e^ ^7 (9.3.65) 


is a solution to our partial differential equation and satisfies the boundary conditions. From 
the principle of linear superposition, the most general solution equals the sum of all of the 
possible solutions, or 


u(r, Zz) = l A(k) k Jo(kr) e"? dk, (9.3.66) 


where we have written the arbitrary constant A,B: as A(k)k. Our final task remains to 
compute A(k). 

Before we can find A(k), we must derive an intermediate result. If we define our Fourier 
transform in an appropriate manner, we can write the two-dimensional Fourier transform 
pair as 


MN C 
f(x,9) = 5- i / F(k, £) eo^) dk dé, (9.3.67) 


where pog pe 
F(k,£) — ox / / f(x,y) e t da dy. (9.3.68) 
T J—oo J—oo 


Consider now the special case where f(x,y) is only a function of r = y x? + y?, so that 
f(x,y) = g(r). Then, changing to polar coordinates through the substitution x = r cos(6), 
y-—rsin(0), k = pcos(), and £= psin(y), we have that 


ka + ly = rp[cos(@) cos(q) + sin(@) sin(y)] = rpcos(0 — v), (9.3.69) 
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and 
dA = dz dy = r dr d0. (9.3.70) 


Therefore, the integral in Equation 9.3.68 becomes 


1 oo 2n = 
F(k, 2) = — n g(r) e- ire ees(0—9), dr do (9.3.71) 
2m 0 0 
1 oo 2n . 
=— | rg(r) | f eire cos(9— y) a) dr. (9.3.72) 
27 Jo 0 


If we introduce à = 0 — c, the integral 


2n . 2r— p . 2n . 
[! gr pep) q = / p PES qi = | e "P9509 dA = 2r Jo(pr). (9.3.73) 
0 0 


—e 


The third integral in Equation 9.3.73 is equivalent to the second integral because the integral 
of a periodic function over one full period is the same regardless of where the integration 
begins. The final result follows from the integral definition of the Bessel function. There- 
fore, 


F(k, £) = ra r g(r) Jo(pr) dr. (9.3.74) 


Finally, because Equation 9.3.74 is clearly a function of p = Vk? + ?, F(k,£) = G(p) and 


G(p) — L r g(r) Jo(pr) dr. (9.3.75) 


Conversely, if we begin with Equation 9.3.67, make the same substitution, and integrate 
over the k£ plane, we have that 


1 d i irp cos(0— 
f(x,y) = g(r) = x] n F(k,£) e"? (0 ?) odpdy (9.3.76) 
0 0 
1 oo 2n . 
E p G(p) p etr eo9(9 9) do| dp (9.3.77) 
0 0 
-| p G(p) Jo(pr) dp. (9.3.78) 
0 


Thus, we obtain the result that if f^" |F(r)| dr exists, then 


g(r) = A p G(p) Jo(er) dp, (9.3.79) 


where = 
G(p) = n r g(r) Jo(pr) dr. (9.3.80) 


Taken together, Equation 9.3.79 and Equation 9.3.80 constitute the Hankel transform pair 
for Bessel function of order 0. 'The function G(p) is called the Hankel transform of g(r). 


6 Watson, G. N., 1966: A Treatise on the Theory of Bessel Functions. Cambridge University Press, 
Section 2.2, Equation 5. 
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Figure 9.3.5: The axisymmetric potential u(r, z)/uo in the half-space z > 0 when u(r,0) = uo ifr < a 
and u(r,0) — 0 ifr >a. 


Why did we introduce Hankel transforms? First, setting z — 0 in Equation 9.3.66, we 
find that 


u(r,0) = f ^ A(k) k Jo(kr) dk. (9.3.81) 


If we now compare Equation 9.3.79 with Equation 9.3.81, we recognize that A(k) is the 
Hankel transform of u(r, 0). Therefore, 


A(k) x ru(r,0) Jo(kr) dr = w f r Jo(kr) dr = P r Ji(kr)l = T Ju (ka). 
0 0 
(9.3.82) 
'Thus, the complete solution is 
u(r, 2) = au, | J, (ka) Jo(kr) e~** dk. (9.3.83) 
0 


Equation 9.3.83 is illustrated in Figure 9.3.5. 


e Example 9.3.6: Mixed boundary-value problem 


In all of our previous examples, the boundary condition along any specific boundary 
remained the same. In this example, we relax this condition and consider a mized boundary- 
value problem. 

Consider’ the axisymmetric Laplace equation 


10 f Qu u 
| = < 3. 
-x (ra) 522 0, O<r<ow, 0<z<1, (9.3.84) 


subject to the boundary conditions 


lim |u(r, z)| < oo, lim |u(r, z)| < oo, u(r,0) = 0, (9.3.85) 
r—0 T—00 
and 
u(r,1)— 1, 0<r<a, 
wi (9.3.86) 
diri] B es a «r «oo. 
o 


T See Nir, A., and R. Pfeffer, 1979: Transport of macromolecules across arterial wall in the presence of 
local endothial injury. J. Theor. Biol., 81, 685-711. 
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The interesting aspect of this example is the mixture of boundary conditions along the 
boundary z = 1. For r € a, we have a Dirichlet boundary condition, which becomes a 
Robin boundary condition when r > a. 

Our analysis begins as it did in the previous examples with separation of variables and 
a superposition of solutions. In the present case the solution is 


Oz a 
|^ 1+0 lt+o 


oo 
u(r, z) | A(k, a) sinh(kz)Jo(kr) dk. (9.3.87) 
0 
The first term on the right side of Equation 9.3.87 arises from a separation constant that 
equals zero while the second term is the contribution from a negative separation constant. 
Note that this equation satisfies all of the boundary conditions given in Equation 9.3.85. 
Substitution of Equation 9.3.87 into Equation 9.3.86 leads to the dual integral equations: 


af A(k, a) sinh(k)Jo(kr) dk = 1, (9.3.88) 
0 


if0<r<a, and 


I A(k, a) E + E eost) Jo(kr) dk = 0, (9.3.89) 
0 
if a « r « oo. 

What sets this problem apart from the routine separation of variables is the solution 
of dual integral equations;? in general, they are very difficult to solve. The process usually 
begins with finding a solution that satisfies Equation 9.3.89 via the orthogonality condition 
involving Bessel functions. This is the technique employed by Tranter,? who proved that 


the dual integral equations: 


n ^ GO)f Joa) dd = g(a), (9.3.90) 
and v 
] 509092 =0 (9.3.91) 
0 
have the solution " 
fox ^y^ Adis) (9.3.92) 
n=0 


if G(A) and g(a) are known. The value of « is chosen so that the difference G(A) — \?*~? 
is fairly small. In the present case, f(A) = sinh(A)A(A,a), g(a) = 1, and G(A) = 1+ 
Acoth(A)/a. 

What is the value of k here? Clearly we would like our solution to be valid for a wide 
range of c. Because G(A) — 1 as ø — oo, a reasonable choice is & = 1. Therefore, we take 


sinh(k).A(k, a) = 2, Ea OY (9.3.93) 


8 The standard reference is Sneddon, I. N., 1966: Mixed Boundary Value Problems in Potential Theory. 
Wiley, 283 pp. 


9 Tranter, C. J., 1950: On some dual integral equations occurring in potential problems with axial 
symmetry. Quart. J. Mech. Appl. Math., 3, 411—419. 
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Our final task remains to find An. 
We begin by writing 


An 


oon an-1 (ka) Brn Jam—1(k 3.94 
Dens Pe = Jam-1(ka), pue 


mci 


where Bmn depends only on a and ø. Multiplying Equation 9.3.94 by the factor Jo. 1(ka) 
dk/k and integrating, 


oe An dk 
To banth(h\ /~ n— k (ka) Bum m— k = FA 
f 1-Fkcoth(k)/o erc eof > Fam—1(ka) Jap—1 (ka) k 
(9.3.95) 
Because!” = " ; 
n-1(k —1(ka) — = —— Bo 3.9 
] m6 Ji) F = tr (9.3.96) 
where bmp is the Kronecker delta: 
ES oe s (9.3.97) 
i B 0, m z P, : ` 
Equation 9.3.95 reduces to 
Jon— 1( (ka) Jo 1(ka) dk Bing 
An m = 9.3. 
[ 1+keoth(k)/o  k m-i) ARE] 
If we define aj (ka) J (ka) dk 
Qn—-1\ KA) Jam—1\ ha 
= Sinn; 9.3.99 
f 1 + kcoth(k)/o k ( ) 
then we can rewrite Equation 9.3.98 as 
DBmn 
Because!! 
2r? 
a ^ Je(kr) Jim- 1(ka) dk = Ta 1 1— —3 j> (9.3.101) 
0 a 


ifr < a, where P,,( ) is the Legendre polynomial of order m, Equation 9.3.88 can be 


rewritten 
oo oo 55 
5 5 BmnPm-1 1— A =1. (9.3.102) 


n=l m= 


Equation 9.3.102 follows from the substitution of Equation 9.3.93 into Equation 9.3.88 
and then using Equation 9.3.101. Multiplying Equation 9.3.102 by P, 1(£) d£, integrating 


10 Gradshteyn, I. S., and I. M. Ryzhik, 1965: Table of Integrals, Series, and Products. Academic Press, 
Section 6.538, Formula 2. 


11 [bid., Section 6.512, Formula 4. 
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Table 9.3.1: The Convergence of the Coefficients A, Given by Equation 9.3.104 Where 
Simn Has Nonzero Values for 1 < m,n < N 


N A, -A A = A -As gd -A 
1 2.9980 

2 31573 1.7181 

3 3.2084 2.0329 1.5978 

4 3.2300 2.1562 1.9813 14517 

5 3.2411 2.2174 2.1548 1.8631 1.3347 

6 3.2475 2.2521 2.2495 2.0670 1.7549 1.2399 

7 82515 2.2738 2.3073 2.1862 1.9770 1.6597 1.1620 

8 3.2542 2.2882 2.3452 2.2626 2.1133 1.8925 1.5772 1.0972 


between —1 and 1, and using the orthogonality properties of the Legendre polynomial, we 
have that 


Y | PnP d= f moea-[ nenoe4 — (3210 


which shows that only m — 1 yields a nontrivial sum. Thus, 


S Bn = 22m —1) J AnSmn =0, m22, (9.3.104) 
n=1 n=1 
and "m " 
XO Bin=2)_ Asin =1, of SY SmnAn = bômi (9.3.105) 
n=1 n=1 n=l 


Thus, we have reduced the problem to the solution of an infinite number of linear equations 
which yield A, — a common occurrence in the solution of dual integral equations. Selecting 
some maximum value for n and m, say N, each term in the matrix Smn, 1 < m,n < N, is 
evaluated numerically for a given value of a and c. By inverting Equation 9.3.104, we obtain 
the coefficients A, for n = 1,..., N. Because we solved a truncated version of Equation 
9.3.105, they will only be approximate. To find more accurate values, we can increase N by 
1 and again invert Equation 9.3.104. In addition to the new Ay, the previous coefficients 
will become more accurate. We can repeat this process of increasing N until the coefficients 
converge to their correct value. This is illustrated in Table 9.3.1 when o =a = 1. 

Once we have computed the coefficients A, necessary for the desired accuracy, we use 
Equation 9.3.93 to find A(k, a) and then obtain u(r, z) from Equation 9.3.87 via numerical 
integration. Figure 9.3.6 illustrates the solution when c = 1 and a = 2. 

Mixed boundary-value problems over a finite domain can be solved in a similar manner. 
Consider the partial differential equation? 


Ou lôu u Ou 
ðr? rr r? Oz? 


=0, O<r<a, 0<2z<1, (9.3.106) 


12 See Vrentas, J. S., D. C. Venerus, and C. M. Vrentes, 1991: An exact analysis of reservoir effects for 
rotational viscometers. Chem. Eng. Sci., A6, 33-37. 


404 Advanced Engineering Mathematics with MATLAB 


Figure 9.3.6: The solution of the axisymmetric Laplace's equation, Equation 9.3.84, with u(r, 0) = 0 and 
the mixed boundary condition, Equation 9.3.86. Here we have chosen c = 1 and a = 2. 


subject to the boundary conditions 
lim lu(r,z) «oo, u(a,z)=0, O<z<1, (9.3.107) 
T— 


and 
(9.3.108) 


We begin by solving Equation 9.3.106 via separation of variables. This yields 


u(r, z) = D An sinh(k,z) J4 (kr), (9.3.109) 


n=1 


where kp is the nth root of Jj(ka) = 0. Note that Equation 9.3.109 satisfies all of the 
boundary conditions except those along z = 1. Substituting Equation 9.3.109 into Equation 
9.3.108, we find that 


As sinh(kn) A(knr) 2r, OSr<1, (9.3.110) 
n=l 
and so 
So kn An cosh(k;) Ji(knr) 20, 1«r&a. (9.3.111) 
n=1 


Equations 9.3.110 and 9.3.111 show that in place of dual integral equations, we now have 
dual Fourier-Bessel series. Cooke and Tranter!? have shown that the dual Fourier-Bessel 


series 
oo 


ayJy(k,r) 20, Ll<r<a, -l<y, (9.3.112) 
1 


n= 


Other examples include: 


Sherwood, J. D., and H. A. Stone, 1997: Added mass of a disc accelerating within a pipe. Phys. 
Fluids, 9, 3141-3148. 


Galceran, J., J. Cecilia, E. Companys, J. Salvador, and J. Puy, 2000: Analytical expressions for 
feedback currents at the scanning electrochemical microscope. J. Phys. Chem. B, 104, 7993-8000. 


13 Cooke, J. C., and C. J. Tranter, 1959: Dual Fourier-Bessel series. Quart. J. Mech. Appl. Math., 12, 
379-386. 
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Figure 9.3.7: The solution of Equation 9.3.106, which satisfies the boundary condition, Equation 9.3.107, 
and the mixed boundary condition, Equation 9.3.108. Here we have chosen a — 2. 


where J,(k,a) = 0, will be automatically satisfied if 


het? J2 1(kna)an = X bmJv42m+14p/2(km), (9.3.113) 


m=0 


where |p| € 1. Because an = knAn cosh(kn) and v = 1 here, An is given by 
ke cosh(kn) Ja (kna)An = X` Ba Ja (9.3.114) 
m=1 


if we take p = 0. 
Substitution of Equation 9.3.114 into Equation 9.3.110 gives 


2» Brn »» sinh Fn) Jm) RT) _ (9.3.115) 


k2 cosh(k,)J2 (kna) 


n=1 


Multiplying both sides of this equation by rJ(kpr) dr, p = 1,2,3,..., and integrating from 
0 to 1, we find that 


= sinh (Kn) Jam(kn)Qpn T à 
By = Jy(kpr) d 9.3.116 
2. Bo d Teoh) Ena) Ja "UK" ae 
where 
1 kp Ji (kn) Jo(kp)—kn Ji (kp) Jo (kn) 
= M k2 —k2 ’ n # P, 
Qpr = f Ji (kpr) Jı (knr) r dr = i JË ko) - Jo (bg) Jal) MA (9.3.117) 
Carrying out the integration, Equation 9.3.116 yields the infinite set of equations 
> Jo(k 
$5 Mos Bm = : p), (9.3.118) 
m=1 p 


where 


^. sinh(ks)Jom (kn) Qpn 
< k2 cosh(kn)J2 (kna) ` 


Mpm = (9.3.119) 
Once again, we compute Bm by truncating Equation 9.3.118 to M terms and inverting the 
systems of equations. Increasing the value of M yields more accurate results. Once we have 
Bm, we use Equation 9.3.114 to find A,. Finally, u(r, z) follows from Equation 9.3.109. 
Figure 9.3.7 illustrates u(r, z) when a = 2. 


406 Advanced Engineering Mathematics with MATLAB 


e Example 9.3.7 


Let us find the potential at any point P within a conducting sphere of radius a. At the 
surface, the potential is held at Vo in the hemisphere 0 < 0 < 7/2, and —Vo for 7/2 « 0 « m. 
Laplace's equation in spherical coordinates is 


ð f ðu) i əf, Qu] | 
Or (« 5) " sin(8) 90 E 2] sü, 0<r<a, 0<0 <x, (9.3.120) 


To solve Equation 9.3.120 we set u(r, 0) = R(r)O(0) by separation of variables. Substituting 
into this equation, we have that 


m G |) E a 4 EP =k’, 9.3.121) 

or 
PR p= =; 9.3.122) 

and 
TO hog] TK =0. 9.3.123) 


A common substitution replaces 0 with u = cos(0). Then, as 0 varies from 0 to 7, u varies 
from 1 to —1. With this substitution, Equation 9.3.123 becomes 


+ k?@ — 0. (9.3.124) 


'This is Legendre's equation, which we examined in Section 6.4. Consequently, because the 
solution must remain finite at the poles, k? = n(n + 1), and 


6, (0) = P, (u) = P, [cos(0)], (9.3.125) 


where n — 0,1,2,3,.... 

Turning to Equation 9.3.122, this equation is the equidimensional or Euler-Cauchy 
linear differential equation. One method of solving this equation consists of introducing a 
new independent variable s so that r = ef, or s = In(r). Because 

d dsd | d 


dr m dr ds =e ds’ (9.3.126) 


it follows that 


È dí ,d _,d(_,d [È d 
i-a a) = slc £) = (is-) ten) 


Substituting into Equation 9.3.122, 


d^R, " d Rs, 
ds? ds 
Equation 9.3.128 is a second-order, constant coefficient ordinary differential equation, which 
has the solution 


— n(n 4- 1), = 0. (9.3.128) 


Rn (s) = Cne™ + D,e- 0*5 = C, exp[nIn(r) + D, exp[—(n + 1)In(r)|  (9.3.129) 
= Cn exp[In(r^)] + Dn exp[ln(r !^*)] = Car” + Dar. (9.3.130) 
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A more convenient form of the solution is 


T T 


FR, (r) = An Ey + B, 15 (9.3.131) 


where A, = a"C, and B, = Dar, We introduced the constant a, the radius of the 
sphere, to simplify future calculations. 

Using the results from Equation 9.3.125 and Equation 9.3.130, the solution to Laplace’s 
equation in axisymmetric problems is 


u(r, 0) = S |^ Cy + Bn Ey P, [cos(0)]. (9.3.132) 


n=0 


In our particular problem we must take B, = 0 because the solution becomes infinite at 
r = 0 otherwise. If the problem had involved the domain a < r < oo, then A, = 0 because 
the potential must remain finite as r + oo. 

Finally, we must evaluate A,. Finding the potential at the surface, 


> Vo, O<p<l, 
u(a, p) = 2, AP) = { Vp, ae e p (9.3.133) 


Upon examining Equation 9.3.133, it is merely an expansion in Legendre polynomials of 
the function 


_ J Vo, 0cpu&l, 
f(u) = tu Ae e (9.3.134) 
Consequently, from Equation 9.3.133, 
2n 4-1 ft 
An = ZE f Fw) Pol) ds (9.3.135) 
—1 


Because f(u) is an odd function, A, = 0 if n is even. When n is odd, however, 


An = (2n + 1) [ Vo Pr (u) dp. (9.3.136) 


We can further simplify Equation 9.3.136 by using the relationship that 


I P (t) dt = uu [Pa (£) — Pa (2)], (9.3.137) 


where n > 1. In our problem, then, 


0, n even. 


uus { Vo [Pn—1(0) — Pn+1(0)], n odd, (9.3.138) 


The first few terms are A, = 3Vo/2, A3 = —7Vo/8, and As = 11Vo/16. 
Figure 9.3.8 illustrates our solution. It was created using the MATLAB script 
clear 
N = 51; dr = 0.05; dtheta = pi / 15; 
^ compute grid and set solution equal to zero 
r = [0:dr:1]; theta = [0:dtheta:2*pi]; 
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Figure 9.3.8: Electrostatic potential within a conducting sphere when the upper hemispheric surface has 
the potential 1 and the lower surface has the potential —1. 


mu = cos(theta); Z =r’ * mu; 


for L = 1:2 
if L == 1X = r? * sin(theta); 
else X = -r’ * sin(theta); end 


u = zeros(size(X)); 
^ compute solution from Equation 9.3.132 

rfactor = r; 

for n = 1:2:N 
A = legendre(n-1,0); B = legendre(n+1,0); coeff = A(1)-B(1); 
C = legendre(n,mu); Theta = C(1,:); 
u u + coeff * rfactor’ * Theta; 
rfactor = rfactor .* r .* rj; 


end 

surf(Z,X,u); hold on; end 
xlabel(?Z?,?Fontsize?,20); ylabel(’X’,’Fontsize’ ,20) 
zlabel(’u(R,\theta )’,’Fontsize’ ,20) ; 
Here we have the convergence of the equipotentials along the equator and at the surface. 
The slow rate at which the coefficients are approaching zero suggests that the solution 
suffers from Gibbs phenomena along the surface. o 


e Example 9.3.8 


We now find the steady-state temperature field within a metallic sphere of radius a, 
which we place in direct sunlight and allow to radiatively cool. This classic problem, first 
solved by Rayleigh,!^ requires the use of spherical coordinates with its origin at the center 


14 Rayleigh, J. W., 1870: On the values of the integral Jh QuQ; dp, Qn, Qn being Laplace's coefficients 


of the orders n, n’, with application to the theory of radiation. Philos. Trans. R. Soc. London, Ser. A, 
160, 579—590. 
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of the sphere and its z-axis pointing toward the sun. With this choice for the coordinate 
system, the incident sunlight is 


. f D(0) cos(0), 0x98 
Do) -1 0, n/2< 


If heat dissipation takes place at the surface r = a according to Newton’s law of 
cooling and the temperature of the surrounding medium is zero, the solar heat absorbed by 
the surface dA must balance the Newtonian cooling at the surface plus the energy absorbed 
into the sphere’s interior. This physical relationship is 


< iE 
Pl (9.3.139) 


(1 — p) D(0) dA = eu(a,0) dA +K dA, (9.3.140) 


Ou(a, 0) 
Or 
where p is the reflectance of the surface (the albedo), e is the surface conductance or 


coefficient of surface heat transfer, and « is the thermal conductivity. Simplifying Equation 
9.3.140, we have that 


P D(6) -u(a, 0) (9.3.141) 


for r — a. 

If the sphere has reached thermal equilibrium, Laplace's equation describes the tem- 
perature field within the sphere. In the previous example, we showed that the solution to 
Laplace's equation in axisymmetric problems is 


T 


u(r,0) = 5 An (=) + Bn Ey P, [cos(0)]. (9.3.142) 


In this problem, B,, — 0 because the solution would become infinite at r — 0 otherwise. 
Therefore, 


u(r,8) = V An (=) P, [cos(0)]. (9.3.143) 
n=0 
Differentiation gives 
Ou = nr”! 
"im 3 An zx Palcos(0)]. (9.3.144) 


A. (^ 2 3 P, [cos(6)] = (=) D(0), (9.3.145) 
Or 
Di) = [A | Anau) = Y 650) (9.3.146) 
where 
Cy = Eu An, and p= cos(0). (9.3.147) 


We determine the coefficients by 


2n 4-1 


c, - TE | DPA du = FEDO) | Pal) de (9.3.148) 
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Figure 9.3.9: The difference (in °C) between the temperature field within a blackened iron surface of 
radius 0.1 m and the surrounding medium when we heat the surface by sunlight and allow it to radiatively 
cool. 


Evaluation of the first few coefficients gives 


a(1—9)D(0) ^ , _ 5a(1 — p)D(0) 
atem) * 16(2& + ca) ' 


Aa —0, (9.3.149) 


3a(1 — p)D(0) 13a(1 — p) D(0) 


s As —0, Ag = Az = 3.1 
Í 324k-ea) ^ ^? SI 256(6%-+ea) ^ ^" 0, (9.3.150) 
17a(1 — p)D(0) 49a(1 — p)D(0) 
As = Ag —0, and Ajo = 9.3.151 
8 512(Bk bea) ' 9 , an 10 2048(10K + ca) ( ) 


Figure 9.3.9 illustrates the temperature field within the sphere with D(0) = 1200 W/m?, 
k = 45 W/m K, e = 5 W/m? K, p = 0, and a = 0.1 m. This corresponds to a cast iron 
sphere with blackened surface in sunlight. This figure was created by the MATLAB script 


clear 

dr = 0.05; dtheta = pi / 15; 

D-O = 1200; kappa = 45; epsilon = 5; rho = 0; a = 0.1; 

^ compute grid and set solution equal to zero 

r = [0:dr:1]; theta = [0:dtheta:pil]; 

mu = cos(theta); Z =r’ * mu; 

aaaa = (1-rho) * DO / ( 4 * epsilon); 

aa(1) = a * (1-rho) * DO / ( 2 * ( kappatepsilon*a)) ; 

aa(2) = 5 * a * (1-rho) * DO / ( 16 * (2*kappatepsilon*a)) ; 
aa(3) = 0; 

aa(4) = - 3 * a * (1-rho) * DO / ( 32 * (4*kappatepsilon*a)) ; 
aa(5) = 0; 

aa(6) = 13 * a * (1-rho) * D-O / ( 256 * (6*kappatepsilon*a)) ; 
aa(7) = 0; 

aa(8) = -17 * a * (1-rho) * D.O / ( 512 * (8*kappa*tepsilon*a)); 
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S P(r,0) 


Figure 9.3.10: Point charge +q in the presence of a grounded conducting sphere. 


aa(9) = 0; 
aa(10) = 49 * a * (1-rho) * D_O / (2048 * (10*kappatepsilon*a)); 
for L = 1:2 

if L == 1X = r? * sin(theta); 

else X = -r’ * sin(theta); end 

u = aaaa * ones(size(X)); 

rfactor = r; 

for n = 1:10 

A = legendre(n,mu); Theta = A(1,:); 

u + aa(n) * rfactor’? * Theta; 
rfactor = rfactor .* r; 


u 


end 

surf(Z,X,u); hold on; end 
xlabel(?Z?,?Fontsize?,20); ylabel(’X’,’Fontsize’ ,20); 
zlabel(’U(R,\theta )’,’Fontsize’ ,20); 
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The temperature is quite warm with the highest temperature located at the position where 


the solar radiation is largest; the coolest temperatures are located in the shadow region. 


e Example 9.3.9 


In this example we find the potential at any point P exterior to a conducting, grounded 
sphere centered at z = 0 after we place a point charge +q at z = a on the z-axis. See Figure 
9.3.10. From the principle of linear superposition, the total potential u(r, 9) equals the sum 
of the potential from the point charge and the potential v(r, 0) due to the induced charge 


on the sphere 


u(r,0) — : + u(r, 0). 


(9.3.152) 
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In common with the first term q/s, v(r, 0) must be a solution of Laplace’s equation. In 
Example 9.3.7 we showed that the general solution to Laplace's equation in axisymmetric 
problems is 


v(r,6) = 3 | (=) + Bn (2) 7 P, [cos(0)]. (9.3.153) 


Because the solutions must be valid anywhere outside of the sphere, An = 0; otherwise, the 
solution would not remain finite as r > oo. Hence, 


oo —1—n 
v(r,6) = Y B, (=) P,,[cos(6)]. (9.3.154) 
n=0 ro 
We determine the coefficient B,, by the condition that u(ro, 0) = 0, or 
= + DB, P, |cos(0)] = 0. 9.3.155 
S lon sphere 2, [pos ] ( ) 


We need to expand the first term on the left side of Equation 9.3.155 in terms of 
Legendre polynomials. From the law of cosines, 


s = yr? +a? — 2ar cos(0). (9.3.156) 
Consequently, if a > r, then 
1 91-1/2 
ec h — 2 cos(6)-- + (5) | (9.3.157) 
a a a 


In Section 6.4, we showed that 


(See) = Pan. (9.3.158) 
n=0 
Therefore, 
1 — TA 
z=- 2 Palcos(6)] (2) à; (9.3.159) 
From Equation 9.3.155, 
C Iq (Toy a 
» E (2) + B,| P, [cos(6)] = 0. (9.3.160) 


We can only satisfy Equation 9.3.160 if the square-bracketed term vanishes identically so 
that 


Bote (3) (9.3.161 
a a 


On substituting Equation 9.3.161 back into Equation 9.3.154, 


u(r, 0)  - 2 se (2) P, [cos(6)]. (9.3.162) 
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Figure 9.3.11: Electrostatic potential outside of a grounded conducting sphere in the presence of a point 
charge located at a/ro — 2. Contours are in units of —q/ro. 


The physical interpretation of Equation 9.3.162 is as follows: Consider a point, such as 
a’ (see Figure 9.3.10) on the z-axis. If r > a’, the Legendre expansion of 1/s' is 


s 


2-iXnewe(T). r>a (9.3.163) 
n=0 


Using Equation 9.3.163, we can rewrite it as 


v(r,9) = ——, (9.3.164) 
as 
if we set a’ = r?/a. Our final result is then 
/ 
u(r, 6) = I = (9.3.165) 


provided that q’ equals rgq/a. In other words, when we place a grounded conducting sphere 
near a point charge +q, it changes the potential in the same manner as would a point charge 
of the opposite sign and magnitude q’ = roq/a, placed at the point a’ = r2/a. The charge 
q' is the image of q. 

Figure 9.3.11 illustrates the solution, Equation 9.3.162, and was created using the 
MATLAB script 


clear 

a_over_rO = 2; 

^ set up x-z array 

dx = 0.02; x = -3:dx:3; dz = 0.02; z = -3:dz:3; 

u = 1000 * zeros(length(x) ,length(z)) ; 

X = x? * ones(1,length(z)); Z = ones(length(x),1) * z; 
^4 compute r and theta 
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rr = sqrt(X .* X + Z .* Z); 
theta = atan2(X,2); 
^ find the potential 
r.over.aprime = a over rO * rr; 
s = 1 + r_over_aprime .* rover aprime ... 
- 2 * r_over_aprime .* cos(theta) ; 

for j = 1:length(z); for i = 1:length(x); 

if rr(i,j) >= 1; u(i,j) = 1 ./ sqrt(s(i,j)); end; 
end; end 
^4 plot the solution 
[cs,h] = contourf(X,Z,u); colormap(hot); brighten(hot,0.5); 
axis square; clabel(cs,h,’manual’,’Fontsize’ ,16); 
xlabel(’X’,’Fontsize’,20); ylabel(’Z’,’Fontsize’ ,20); 
Because the charge is located directly above the sphere, the electrostatic potential for any 
fixed r is largest at the point 0 = 0 and weakest at 0 = 7. 


e Example 9.3.10: Poisson's integral formula 


In this example we find the solution to Laplace’s equation within a unit disc. The 
problem can be posed as 

o?u lôu, 18u . 

Or? rOr r?ðg — 


0 O<r<l1, 0<y<2r, (9.3.166) 


with the boundary condition u(1, p) = f(y). 
We begin by assuming the separable solution u(r, p) = R(r)®(y) so that 


r2 R" + rR'! o" 


Sik 9.3.167 
R 5 ( ) 
The solution to ®” + k?6 = 0 is 
(y) = Acos(ky) + Bsin(ky). (9.3.168) 
The solution to R(r) is 
R(r) = Cr* + Dr-*. (9.3.169) 
Because the solution must be bounded for all r and periodic in y, we must take D = 0 and 
k =n, where n = 0,1,2,3,.... Then, the most general solution is 
u(r, p) = $40 + >D [an cos(nq) + bn sin(ng)] r”, (9.3.170) 
n=1 


where an and b, are chosen to satisfy 


u(1, p) = f(¢) = $ao + 5 an cos(no) + bn sin(ny). (9.3.171) 
n=1 
Because 
An = +f f(A) cos(n@) d0, bn = i f (0) sin(n0) dé, (9.3.172) 
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we may write u(r, p) as 


uec +f no DS +e Sain oi dd. (9.3.173) 


If we let a = 0 — o, and z = r|cos(a) + isin(o)], then 


Y resa) a -a(-) -al uus esate 


n=0 n=0 
1- od 
-R r cos(a) + ir sin(a) (9.3.175) 
1 — 2r cos(a) + r? 
for all r such that |r| < 1. Consequently, 
2 1 — r cos(a) 
.3.17 
Ze cos(na) = jocum c (9.3.176) 
L x 1—rcos(o) 1 
= i = 9.3.177 
2 i > din wal 1— 2rcos(a) +r? 2 ( ) 
1 l-r? 
= ] 9.3.178 
21-— 2rcos(o) +r? ( ) 
Substituting Equation 9.3.178 into Equation 9.3.173, we finally have that 
0 T f(0) dew dó (9.3.179) 
TEST . 3. 
Un P) = Qr a 1 — 2r cos(0 — y) + r? 


This solution to Laplace’s equation within the unit circle is referred to as Poisson’s integral 
formula.!° 


Problems 
Rectangular Coordinates 


Solve Laplace’s equation over the rectangular region 0 < x < a, 0 < y < b with the following 
boundary conditions. Illustrate your solution using MATLAB. 


1. u(x, 0) = u(z, b) = u(a,y) = 0, u(0,y) = 1 
2. u(x, 0) = u(0, y) = u(a, y) = 0, u(a,b) = x 
3. u(x, 0) = u(0, y) = ula, y) = 0, u(a,b) = £z — a 


4. u(x,0) = u(0, y) = u(a, y) = 0, 


15 Poisson, S. D., 1820: Mémoire sur la maniére d’exprimer les fonctions par des séries de quantités 


périodiques, et sur l'usage de cette transformation dans la résolution de différens problémes. J. Ecole 
Polytech., 18, 417—489. 
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E 2r/a, 0<a<a/2, 
We oe ie a/2 « x « a. 


5. u(0,y) = ula, y) = u(z,0) = 0,u(z, 5) = 1 
6. uy(x,0) = u(z,b) = u(a,y) = 0,u(0,y) = 1 
7. uy(2,0) = uy(z, b) =0,u(0,y) = ula, y) =1 
8. usla, y) = uy(z,b) = 0,u(0,y) = u(z,0) = 1 
9. uy(x,0) = u(x, b) =0,u(0,y) = u(a,y) = 1 
10. u(a, y) = u(x, b) = 0, u(0,y) = u(z,0) = 1 
11. ur(0,y) =0,u(a, y) = u(a,0) = u(z,b) = 1 


12. u4(0, y) = uzla, y) = 0, u(x, b) = ui, 
t9) = | 4 0<a<a, 


0, Qa « c « a. 


13. Variations in the earth's surface temperature can arise as a result of topographic 
undulations and the altitude dependence of the atmospheric temperature. These variations, 
in turn, affect the temperature within the solid earth. To show this, solve Laplace's equation 
with the surface boundary condition that 


u(z,0) = To + AT cos(212/ A), 


where A is the wavelength of the spatial temperature variation. What must be the condition 
on u(z, y) as we go towards the center of the earth (i.e., y — oo)? 


14. Tóth!Ó generalized his earlier analysis of groundwater in an aquifer when the water 
table follows the topography. Find the groundwater potential if it varies as u(x, zo) = 
g[2o + cz + asin(bx)] at the surface y = zo, while uz(0, y) = uz(L, y) = uy(z,0) = 0, where 
g is the acceleration due to gravity. Assume that bL Z nz, where n = 1,2,3,.... 


15. During his study of fluid flow within a packed bed, Grossman!" solved 


u Oru 
sa +a =0, 0 1, 0 L 
ie oe , «r«l, «ycbL, 


subject to the boundary conditions 
u(r,0)-— L, u(x,L)- 90, 0<2a<1, 


and 
uUz(0,y) — 0, uz(Ly)-— —*, O<y<L. 


16 Tóth, J. A., 1963: A theoretical analysis of groundwater flow in small drainage basins. J. Geophys. 
Res., 68, 4795-4812. 


17 Grossman, G., 1975: Stresses and friction forces in moving packed beds. AICHE J., 21, 720-730. 
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What should he have found? Hint: Introduce u(x, y) = L — y + yv(zx, y). 
Cylindrical Coordinates in Finite Domains 
16. Solve 


0?u Lou, Ou _ 
ðr? rr O2 — 


with 


u(a,z) 20, and 


17. During their study of the role that diffusion plays in equalizing gas concentrations 
within that portion of the lung that is connnected to terminal bronchioles, Chang et al.18 
solved Laplace's equation in cylindrical coordinates 


u 10u 8u 
! =0, 0<r<b -L L, 
Or? ror Of SS dide: 


subject to the boundary conditions that 


b 
lim |u(r, z)| < oo, OM) = 0, -L«z«L, 
r—0 Ór 


and 


What should they have found? 


18. Solve!? 


Ou lôu O?u 
—0, 0< b, 0 L 
br? | ror J2 ; £< 0; <z<, 


with the boundary conditions 


lim |u(r, z)| < oo, ——— =0, 0<z<L, 
r—0 Ór 


and 


19. Solve 
0?u du, Ou E 
ðr? rr O2 — 


0, O<r<a, O0«z«h, 


18 Chang, D. B., S. M. Lewis, and A. C. Young, 1976: A theoretical discussion of diffusion and convection 
in the lung. Math. Biosci., 29, 331—349. 


19 See Keller, K. H., and T. R. Stein, 1967: A two-dimensional analysis of porous membrane transfer. 
Math. Biosci., 1, 421—437. 
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with 5 
wee?) (p.h) = 
and 
du(r, 0) — 1, 0 <r< To; 
Oz | \0, To « T <a. 
20. Solve E T E 
u u u 
| = < 
Bp ror Oz? LE n M c 
with 
Ou(1,z)  Ou(r,0) | 0 
ðr i et” 
and 
(rdi —1, O<r<a, b<r<l, 
WO) — 3 1/2 — a?) — 1, a«r «b. 
21. Solve”? 


Ou lôu u Ou 
| = <r<1 1. 
ðr? rór m pe 0, Osr<l, 0<z< 


Take for the boundary conditions either (a) 


lim |u(r,z)| <œ, u(1,z) =—-1, 0<z<1, 
r—0 
and 
uz(r,0) = u(r, 1) = 0, 0<r<l; 
or (b) 
lim Ju(r,z)| «oo, u(1,z) =0, 0cz«l, 
r—0 
and 
uz(r,0) =0, u(r,1)=r, 0«r «I. 
22. Solve ga 5 ga 
u lôu u u 
= < h 
a2 trop T pg 0, O<r<a, O0<z<h, 
with 
lim |u(r, z)| « oo, u(a,z) =0, 0<z<h, 
r—0 
and 
u(r,0) 20, uz(r,h)- Ar, O<r<a. 
23. Solve 


u 1du u i 3u u 
Oo? ` rðr vr? 8g 


0, O<r<a, 0«zc«1, 


?0 See Muite, B. K., 2004: The flow in a cylindrical container with a rotating end wall at small but finite 
Reynolds number. Phys. Fluids, 16, 3614-3626. 
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with 
Tim |u(r, z)«oo, u(a,z) =z, 0<z<1, 
and 
u(r,0) = u(r, 1) = 0, 0<r<a. 
24. Solve E 5 E 
u 1ldu u u 
' =i dx 0 h, 
ðr? rðr r? Oz? SUP ds 
with " 
lim |u(r, z)| < oo, BUG cdi 0<z<h, 
r—0 Or 
and 5 j 
u(r, 0) = 0, Hn Ps O<r<a. 
25. Solve 
u  10u u Puy argi LN 
Or? rr r2 Og ^ = i A 
with the boundary conditions 
1 
lim |u(r, z)| « oo, Bu = u(1, z), —a«z«a, 
r—0 Or 
Ou(r,—a) _ ðulr, a) 
u(r, —a u(r, a 
2 = : 0xr «I1. 
Oz Oz , t 
26. Solve?! E r E 
u  10u u 
20  D£rel, a L, 
ðr? rr Oz? 7 ile rere 
subject to the boundary conditions that 
lim ju(r,z)| <œ, ur(1,z) = —hu(l,z), 0<z< L, 
r> 


and 
u(r, 0) = Uo, u(r, L) = 0, 0 ars 1, 


using the Fourier-Bessel series 
u(r, z) = M. AnZn(z)Jo(knr) 
n=1 


where kn is the nth root of k Ji(k) + h Jo(k) = h Jo(k) — k Jı (k) = 0. 


419 


21 See Stripp, K. F., and A. R. Moore, 1955: The effects of junction shape and surface recombination on 


transistor current gain — Part II. Proc. IRE, 43, 856-866. 
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27. Solve?? Laplace's equation in cylindrical coordinates 


9?u l 1 ðu Pu 


| = < L 
Ər? rür pe 0, O<r<a, O«zc«L, 
subject to the boundary conditions that 
lim lu(r,z) «oo, —Du,(a,z) = Ku(a, z), 0<z<L, 
r> 
and 
u(r,0) = uo, uz(r,L)- 90, O0<r<a. 


28. Solve?’ the partial differential equation 


pe 18 a 
wa tage tage ee O0<r<l1, 0<z<1, 


subject to the boundary conditions that 
lim|u(r,z) «oo, u(1,z)=1, 0cz«l, 
r—0 


and 


u(r,0)—1, u(r,1)=1, O<r<l. 


1 
Hint: Break the problem into three parts: u(r,z) = u1(r, z) + us(r, z) + us(r, z), where 


3u 1 Qui 0?u, 2 
= ; O0<r<l1, 0«zc«l, 
pa "ye A ua: 2d ? 
subject to the boundary conditions that 
lim |ui(r,z)) <œ, 21(1,2) — 1, 0cz«l, 
r—0 


and 


9? 18 9? 
ae d d “2 = auz, 0<r<1, 0<z<1, 
Or? r Or 


subject to the boundary conditions that 


lim |wa(r, z)| «oo, uwa(l,z) =0, 0<z<l, 
T 


22 See Bischoff, K. B., 1966: Transverse diffusion in catalyst pores. Indust. Engng. Chem. Fund., 5, 
135-136. 


?3 See Gunn, D. J., 1967: Diffusion and chemical reaction in catalysis and absorption. Chem. Engng. 
Sci., 22, 1439-1455; Ho, T. C., and G. C. Hsiao, 1977: Estimation of the effectiveness factor for a cylindrical 
catalyst support: A singular perturbation approach. Chem. Engng. Sci., 32, 63-66. 
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and 
u»(r,0) — 1, ue(r,1) =0, 0<rx<l; 


ius o? 18 a? 
ua ua ua 
Or? T r Or = Oz? 


subject to the boundary conditions that 


= a" us, 0<r<1, 0<z<1, 


lim lus(r,z)| « oo, ug(1,z) =0, 0<z<l, 
rs 


and 
u3(r,0) =0,  us(r, 1) = 0, O0<r<l. 


Cylindrical Coordinates on Infinite and Semi-Infinite Domains 


29. Solve?4 Laplace's equation in cylindrical coordinates 


^u x 1 ðu m Oru 
ðr? rðr Oz — 


0, 0<r<a, —oo«z«oo, 


subject to the boundary conditions that 


l _f-v, |z| < d/2, 
imhnal<o waa) { Qi 42 


and 
lim u(r, z) > 0, O<r<a. 
|z|-oo 


Hint: Show that the solution can be written 


is hz knr 
uz)  -V + > An eon ( B5) a( E ) |z| < d/2, 


n=1 


and " : : 
nl nT 
u(r,z) — 2, Bn exp (-=£) Jo (=) , k| >d/2 
with the additional conditions that 
u(r, d~ /2) = u(r, d* /2) and u,(r,d- /2) = u,(r,d*/2, O<r<a, 
where kpn is the nth root of Jo(k) = 0. Then find A, and B,. 
30. Solve Laplace's equation in cylindrical coordinates 


LM 1du | Ou u 
ðr? rr Oz 


0, O<r<b, 0«z«oo, 


24 See Striffler, C. D., C. A. Kapetanakos, and R. C. Davidson, 1975: Equilibrium properties of a rotating 
nonneutral E layer in a coupled magnetic field. Phys. Fluids, 18, 1374-1382. 
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subject to the boundary conditions that 


Ou(b 
lim |u(r, z)| < oo; UL 0«z«oo, 
r0 Ór 
and 
, A, O<r<a, 
a SS sing { 0, a<r<b. 
31. Solve?° 


Ou | 10u | Ou du _4 (eed. Uses 
Or? rr O22) Oz ^ i * REN 


with the boundary conditions 


Ou(1 
lim |u(r, z)| < oo, PATER) = —Bu(1, z), 0«z, 
r—0 Ór 
and 
u(r,0) = 1, lim |u(r, z)| < oo, 0<r<l, 
ZOO 


where B is a constant. 


32. Solve? 
^u l 10u 8u 1 ðu 


Or? adu OF Həz 


with the boundary conditions 


0, O<r<b 0«z«oo, 


. Qu(b,z) - 
lim lu(r, z)| < oo, — 7 —hu(b, z), 0<z, 
lim [u(r, z)| < oo, O0<r<b, 


ZOO 


and (r,0) 
u(r, 0 _ JQ, 0<r<a, 
H -u9) - 1$ a € r «b, 


where b > a. 
Mixed Boundary- Value Problems 
33. Solve?" 


0?u ,10u, u 
Ər? r Or O2 


0, O<r<w, 0<z<%, 


?5 See Kern, J., and J. O. Hansen, 1976: Transient heat conduction in cylindrical systems with an axially 
moving boundary. Int. J. Heat Mass Transfer, 19, 707—714. 


?6 See Smirnova, E. V., and I. A. Krinberg, 1970: Spatial distribution of the atoms of an impurity element 
in an arc discharge. I. J. Appl. Spectroscopy, 13, 859—864. 


27 See Fleischmann, M., and S. Pons, 1987: The behavior of microdisk and microring electrodes. J. 
Electroanal. Chem., 222, 107-115. 
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subject to the boundary conditions 


lim |u(r, z)| < oo; lim |u(r, z)| < oo, 0«z«oo, 
T—0 T—O00 

lim u(r, z) — Uso; 0xr«oo, 

z= 00 


and the mixed boundary condition 


where u44 and Au are constants. 


Step 1: Show that 
u(r, z) = Uso -f A(k)e7** Jo(kr) dk 
0 


satisfies the partial differential equation and the boundary conditions as r > 0, r > oo, 
and z — oo. 


Step 2: Show that 
if kA(k)Jo(kr) dk = 0, a «r«oo. 
0 


Step 3: Using the relationship?’ 


2 (a? — r2)-8 r<a 
sin(ka)Jo(kr) dk = 3 ] 
0 0, r>a. 


show that kA(k) = C sin(ka). 


Step 4: Using the relationship?? 


BM dk — 1/2, r «a, 
f sin(ka)Jo(kr) T~ { ia eh, l 


sin r>a 
show that T m 
u(r, z) = Uso — ies e^^* sin(ka)Jo(kr) —. 
"T Jo 
34. Solve Laplace's equation?? 
Oru u 
ie op —oo«mz«oo, Ü«yc«oo, 


28 Gradshteyn and Ryzhik, op. cit., Section 6.671, Formula 7. 
?9 [bid., Section 6.693, Formula 1 with v — 0. 


30 See Yang, F.-Q., and J. C. M. Li, 1995: Impression and diffusion creep of anisotropic media. J. Appl. 
Phys., 77, 110-117. See also Shindo, Y., H. Tamura, and Y. Atobe, 1990: Transient singular stresses 
of a finite crack in an elastic conductor under electromagnetic force (in Japanese). Nihon Kikai Gakkai 
Rombunshu (Trans. Japan Soc. Mech. Engrs.), Ser. A, 56, 278-282. 
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Problem 34 
with the boundary conditions 
lim u(z, y) > 0, 0«y«oo, 
|z| oo 
lim u(z,y) > 0, —oo«mz «oo, 
yoo 


and the mixed boundary condition 


uy(z,0) = 1, 0 € |z| « 1, (1) 
u(x,0) = 0, 1 < |z| < oo. 


This problem would arise in finding the electrostatic potential inside the half-space y > 0 
when the boundary z = 0 is grounded for |x| > 1 and the vertical electric field equals one 
for |a| « 1. 


Step 1: Using separation of variables or transform methods, show that the general solution 
to the problem is 


u(z,y) = [ A(k)e~*¥ cos(ka) dk. 
Step 2: Using boundary condition (1), show that A(k) satisfies the dual integral equations 
i kA(h)cos(ke)dk=-%,  0<|a| <1, 
and i 


n Aye dem U. bela ete. 
0 


Step 3: Using integral tables,?! show that 
2 Td (k) 
2k 


A(k) — 
satisfies both integral equations given in Step 2. 


Step 4: Show that the solution to this problem is 
P dk 
u(z,y) = -f Ji (kje ™” cos(kz)-. 
0 


In particular, verify that u(z,0) = —V1 — z? if |z| < 1. 


31 Gradshteyn and Ryzhik, op. cit., Section 6.671, Formula 2 and Section 6.693, Formula 2 with v = 1. 
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Spherical Coordinates 


35. Find the steady-state temperature within a sphere of radius a if the temperature along 
its surface is maintained at the temperature u(a, 0) = 100[cos(0) — cos? (0)]. 


36. Find the steady-state temperature within a sphere if the upper half of the exterior 
surface at radius a is maintained at the temperature 100 while the lower half is maintained 
at the temperature 0. 


37. The surface of a sphere of radius a has a temperature of zero everywhere except in a 
spherical cap at the north pole (defined by the cone 0 = o), where it equals Tg. Find the 
steady-state temperature within the sphere. 


Poissons Integral Formula 


38. Using the relationship 


2n d 

p 27 

J = > [<1 
o l-bcos(y) vi- 


and Poisson’s integral formula, find the solution to Laplace’s equation within a unit disc if 
u(1,~) = F(p) = To, a constant. 


9.4 POISSON’S EQUATION ON A RECTANGLE 
Poisson’s equation?? is Laplace’s equation with a source term: 


Pu Bu 


daz t Bye — fes (9.4.1) 


It arises in such diverse areas as groundwater flow, electromagnetism, and potential theory. 
Let us solve it if u(0, y) = u(a, y) = u(x, 0) = u(z, b) = 0. 
We begin by solving a similar partial differential equation: 


u u 
ana ^s 0<a<a, 0«yc«b, (9.4.2) 


by separation of variables. If u(x, y) = X(x)Y (y), then 


X" y" 
— LA 9.4.3 
yty (9.4.3) 


Because we must satisfy the boundary conditions that X (0) = X (a) = Y (0) = Y (b) = 0, 
we have the following eigenfunction solutions: 


NTE ) 
, 


X.(z) = sin( 2 Ym (z£) = sin (28) (9.4.4) 


32 Poisson, S. D., 1813: Remarques sur une équation qui se présente dans la théorie des attractions des 
sphéroides. Nouv. Bull. Soc. Philomath. Paris, 3, 388—392. 
p $ ? 
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Siméon-Denis Poisson (1781-1840) was a product as well as a member of the French 
scientific establishment of his day. Educated at the École Polytechnique, he devoted 
his life to teaching, both in the classroom and with administrative duties, and to 
scientific research. Poisson's equation dates from 1813 when Poisson sought to extend 
Laplace's work on gravitational attraction. (Portrait courtesy of the Archives de 
l'Académie des sciences, Paris.) 


with Anm = —n?z?/a? — m?x?/b?; otherwise, we would only have trivial solutions. The 
corresponding particular solutions are 
nuc mm 
Unm = Anm sin( 777) sin( 72) ; (9.4.5) 
a 


where n = 1,2,3,..., and m = 1,2,3,.... 
For a fixed y, we can expand f(x,y) in the half-range Fourier sine series 


f(v,y) = » An(y) sin(“"*) (9.4.6) 
where se in 
An(y) = a f(x,y) sin(“*) dz. (9.4.7) 


However, we can also expand A,(y) in a half-range Fourier sine series 


Anly) = Y ties sin(“™) (9.4.8) 
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where 
2 b 4 b a 
dum = F An(y) sin( 2) dy = — f(x,y) sin(“**) sin( 772) dz dy, 
b 0 b ab 0 0 a b 
(9.4.9) 
and se da 
_ (NTE , mn) 
= am Sin( —— ) sin( —— ). 9.4.10 
f(x, y) 2258 sin( E ) sin( 5 ( ) 
In other words, we re-expressed f(x,y) in terms of a double Fourier series. 
Because Equation 9.4.2 must hold for each particular solution, 
a2 25 — mln = wn sin( 22) si(* 74) ; (9.4.11) 


if we now associate Equation 9.4.1 with Equation 9.4.2. Therefore, the solution to Poisson’s 
equation on a rectangle where the boundaries are held at zero is the double Fourier series 


cess Qnm . (NTEN . (mmy 
u(x, y) = -> 5 nan Ja? + mn O sin( 3 ) sin( 5 ) : (9.4.12) 
n=1m=1 
Problems 


1. The equation 

Pu Pu R b 

aa T "se We 
describes the hydraulic potential (elevation of the water table) u(x, y) within a rectangular 
island on which a recharging well is located at (0,0). Here R is the rate of recharging and 
T' is the product of the hydraulic conductivity and aquifer thickness. If the water table 
is at sea level around the island so that u(—a, y) = u(a, y) = u(z, —5) = u(x, b) = 0, find 
u(x, y) everywhere in the island. Hint: Use symmetry and redo the above analysis with the 
boundary conditions: u;(0,y) = u(a, y) = uy(z,0) = u(z, b) = 0. 


2. Let us apply the same approach that we used to find the solution of Poisson's equation 
on a rectangle to solve the axisymmetric Poisson equation inside a circular cylinder 


10 / Qu Oru 
| = < 
Por (a) 59 f(r,z), O<r<a, |z| «5, 


subject to the boundary conditions 
Hmju(z)«o» —w(mz-0  ld«b 


and 
u(r, —b) = u(r, b) = 0, O<r<a. 


Step 1: Replace the original problem with 


10 f Qu 0?u 
= < 
Op (C) 73 Au, O<r<a, |z| «5, 
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subject to the same boundary conditions. Use separation of variables to show that the 
solution to this new problem is 


1 
Unm(r, z) = AnmJo( kn~ ) zu ta) =| 


where kpn is the nth zero of Jo(k) = 0, n = 1,2,3,..., and m=0,1,2,.... 


Step 2: Show that f(r,z) can be expressed as 


J- 3 ` n cos +a) =] | 


n=1m=0 


1 
Qnm, = RTD «f, [5 f(r,z )Jo(k Dd r dr dz. 


Step 3: Show that the general solution is 


where 


Hu CE Jo r/a) co s[(m + £) 7z/b] 
2:22 au eme Dn] 


9.5 NUMERICAL SOLUTION OF LAPLACE'S EQUATION 


As in the case of the heat and wave equations, numerical methods can be used to solve 
elliptic partial differential equations when analytic techniques fail or are too cumbersome. 
They are also employed when the domain differs from simple geometries. 

'The numerical analysis of an elliptic partial differential equation begins by replacing the 
continuous partial derivatives by finite-difference formulas. Employing centered differencing, 


0?u Um --1 — 2u + um-1 
Loma m,n m-—1,n ^ 2 51 
"n s + O((Az], (9.5.1) 


and E 
u Um, n+1 — 2Um.n + Um n—1 j 3 
= —™ 5-1 4 Of(Ay)?], 9.5.2 
T AUS (Aw)? (9.5.2) 
where Um,» denotes the solution value at the grid point m,n. If Ax = Ay, Laplace's 
equation becomes the difference equation 


Um-+1,n + Um—1,n + Um,n+1 + Um,n—1 — Ag n = 0. (9.5.3) 


Thus, we must now solve a set of simultaneous linear equations that yield the value of the 
solution at each grid point. 

The solution of Equation 9.5.3 is best done using techniques developed by algebraists. 
Later on, in Chapter 3, we will show that a very popular method for directly solving sys- 
tems of linear equations is Gaussian elimination. However, for many grids at a reasonable 
resolution, the number of equations is generally in the tens of thousands. Because most 
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of the coefficients in the equations are zero, Gaussian elimination is unsuitable, both from 
the point of view of computational expense and accuracy. For this reason alternative meth- 
ods have been developed that generally use successive corrections or iterations. The most 
common of these point iterative methods are the Jacobi method, unextrapolated Liebmann 
or Gauss-Seidel method, and extrapolated Liebmann or successive over-relaxation (SOR). 
None of these approaches is completely satisfactory because of questions involving conver- 
gence and efficiency. Because of its simplicity we will focus on the Gauss-Seidel method. 


We may illustrate the Gauss-Seidel method by considering the system: 


10x +y +z = 39, (9.5.4) 
2x + 10y + z = 51, (9.5.5) 

and 
2x + 2y + 10z = 64. (9.5.6) 


An important aspect of this system is the dominance of the coefficient of x in the first 
equation of the set and that the coefficients of y and z are dominant in the second and third 
equations, respectively. 

The Gauss-Seidel method may be outlined as follows: 


e Assign an initial value for each unknown variable. If possible, make a good first guess. 
If not, any arbitrarily selected values may be chosen. The initial value will not affect the 
convergence but will affect the number of iterations until convergence. 


e Starting with Equation 9.5.4, solve that equation for a new value of the unknown which 
has the largest coefficient in that equation, using the assumed values for the other unknowns. 


e Go to Equation 9.5.5 and employ the same technique used in the previous step to compute 
the unknown that has the largest coefficient in that equation. Where possible, use the latest 
values. 


e Proceed to the remaining equations, always solving for the unknown having the largest 
coefficient in the particular equation and always using the most recently calculated values 
for the other unknowns in the equation. When the last equation, Equation 9.5.6, has been 
solved, you have completed a single iteration. 


e Iterate until the value of each unknown does not change within a predetermined value. 


Usually a compromise must be struck between the accuracy of the solution and the desired 
rate of convergence. The more accurate the solution is, the longer it will take for the solution 
to converge. 

To illustrate this method, let us solve our system Equation 9.5.4 through Equation 
9.5.6 with the initial guess x = y = z = 0. The first iteration yields x = 3.9, y = 4.32, and 
z = 4.756. The second iteration yields x = 2.9924, y = 4.02592, and z = 4.996336. As can 
be readily seen, the solution is converging to the correct solution of x = 3, y = 4, and z = 5. 


Applying these techniques to Equation 9.5.3, 


k+1 1 k k+1 k k+1 
Um,n = Ud (ub iin + Um-—1,n + Um n+1 + us x) $ (9.5.7) 
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where we assume that the calculations occur in order of increasing m and n. 
e Example 9.5.1 


To illustrate the numerical solution of Laplace’s equation, let us redo Example 9.3.1 
with the boundary condition along y = H simplified to u(x, H) = 1 + x/L. 

We begin by finite-differencing the boundary conditions. The condition u,(0,y) = 
Ux(L,y) = 0 leads to u1,, = u-1,4 and UM+1,n = uy 1,5 if we employ centered differences 
at m — 0 and m — M. Substituting these values in Equation 9.5.7, we have the following 
equations for the left and right boundaries: 


ut = + (2uf,, + uË n41 + ugha) (9.5.8) 


and 
k+1 k+1 k+1 
udi = (Quits, E ukth_1). (9.5.9) 


On the other hand, uy(#,0) = 0 yields um,ı = Um,—1, and 
m40 = i (uk iio F ius F 2uk 1) : (9.5.10) 
At the bottom corners, Equation 9.5.8 through Equation 9.5.10 simplify to 

usb. = 1 (ufo ubi) (9.5.11) 
and 


k k 
DA =3 onm un : (9.5.12) 


These equations along with Equation 9.5.7 were solved with the Gauss-Seidel method 
using the MATLAB script 


a 
tal 
Il 
o 
HB 
ba 
ll 
o 
g 
HG 
= 
I 


:dx:1; M = 1/dx+1; 4 M = number of x grid points 
dy = 0.1; y = O:dy:1; N = 1/dy+1; % N = number of y grid points 
X = x? * ones(1,N); Y = ones(M,1) * y; 
u = zeros(M,N); 4 create initial guess for the solution 
^ introduce boundary condition along y = H 
for m = 1:M; u(m,N) = 1 + x(m); end 
^ start Gauss-Seidel method for Laplace?s equation 
for iter = 1:256 
^4 do the interior first 
for n = 2:N-1; for m = 2:M-1; 
u(m,n) = (u(mti,n)+u(m-1,n)+u(m,nt1)+u(m,n-1)) / 4; 
end; end 
^4 now do the x = O and x = L sides 
for n = 2:N-1 
u(i,n) = (2xu( 2 ,n)tu(1,nti)+u(i,n-1)) / 4; 
u(M,n) = (2xu(M-1,n)*u(M,n*1)*u(M,n-1)) / 4; 
end 
^ now do the y = 0 side 
for m = 2:M-1 
u(m,1) = (u(mt+i,1)+u(m-1,1)+2*u(m,2)) / 4; 
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after 4 iterations after 16 iterations 
—0:6- 


0 0.5 1 0 0.5 1 
after 64 iterations after 256 iterations 
I 
> 0.5 
0 0.5 1 % 0.5 1 
X/L X/L 


Figure 9.5.1: The solution to Laplace's equation by the Gauss-Seidel method. The boundary conditions 
are uz (0, y) = uz(L, y) = uy(z, 0) = 0, and u(x, H) = 1 + z/L. 


end 
^ finally do the corners 
u(1,1) = (u(2,1)4u(1,2))/2; u(M,1) = (u(M-1,1)+u(M,2))/2; 
% plot the solution 
if (iter == 4) subplot(2,2,1), [cs,h] = contourf(X,Y,u); 
clabel(cs,h,[0.2 0.6 1 1.4],’Fontsize’ ,16) 
axis tight; title(’after 4 iterations’ ,’Fontsize’ ,20) ; 
ylabel(’Y/H’,’Fontsize’,20); end 
if (iter == 16) subplot(2,2,2), [cs,h] = contourf(X,Y,u); 
clabel(cs,h, ’Fontsize’ ,16) 
axis tight; title( after 16 iterations’, ’Fontsize’ ,20); 
ylabel(’Y/H’,’Fontsize’,20); end 
if (iter == 64) subplot(2,2,3), [cs,h] = contourf(X,Y,u); 
clabel(cs,h,’Fontsize’ ,16) 
axis tight; title(’after 64 iterations’, ’Fontsize’ ,20); 
xlabel(’X/L’,’Fontsize’ ,20); ylabel(’Y/H’,’Fontsize’ ,20) ; 
end 
if (iter == 256) subplot(2,2,4), [cs,h] = contourf(X,Y,u); 
clabel(cs,h,’Fontsize’ ,16) 
axis tight; title( after 256 iterations’, ’Fontsize’ ,20); 
xlabel(’X/L’,’Fontsize’ ,20); ylabel(’Y/H’, ’Fontsize’ ,20) ; 
end 
end 
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1000 


number of iterations 


Figure 9.5.2: The number of iterations required so that |Rmjn| < 10-? as a function of w during the 
iterative solution of the problem posed in the project. We used Ax = Ay = 0.01, and L = zo = 1. The 
iteration count for the boundary conditions stated in Step 1 is given by the solid line while the iteration 
count for the boundary conditions given in Step 2 is shown by the dotted line. The initial guess equaled 
zero. 


The initial guess everywhere except along the top boundary was zero. In Figure 9.5.1 
we illustrate the numerical solution after 4, 16, 64, and 256 iterations, where we have taken 
11 grid points in the x and y directions. 


Project: Successive Over-Relaxation 


The fundamental difficulty with relaxation methods used in solving Laplace’s equation 
is the rate of convergence. Assuming Az — Ay, the most popular method for accelerating 
convergence of these techniques is successive over-relazation (SOR): 


urs = (1 m lys oi + wEm,n, 


m,n 


where 
ERE! k k+1 k k+1 
Rm,n —4 (uyin F Um —1.n t Un n+1 T uos) ; 


Most numerical methods books dealing with partial differential equations discuss the the- 
oretical reasons behind this technique;? the optimum value always lies between one and 
two. In the present case, a theoretical analysis?^ gives 


4 
Wopt = 7— ————5: 
Pm eq wA 


= cos() + cos( 7) 
c= cN MJ' 


33 For example, Young, D. M., 1971: Iterative Solution of Large Linear Systems. Academic Press, 570 
pP. 


where 


34 Yang, S., and M. K. Gobbert, 2009: The optimal relaxation parameter for the SOR method applied 
to the Poisson equation in any space dimensions. Appl. Math. Letters, 22, 325-331. 
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and N and M are the number of mesh divisions on each side of the rectangular domain. 
Recently Yang and Gobbert?? generalized the analysis and found the optimal relaxation pa- 
rameter for the successive-overrelaxation method when it is applied to the Poisson equation 
in any space dimensions. 


Step 1: Write a MATLAB script that uses the Gauss-Seidel method to numerically solve 
Laplace's equation for 0 € x < L, 0 < y < zo with the following boundary conditions: 
u(x,0) = 0, u(x, zo) =14+2/L, u(0, y) = y/zo, and u(L,y) = 2y/zo. Because this solution 
will act as "truth" in this project, you should iterate until the solution does not change. 


Step 2: Now redo the calculation using successive over-relaxation. Count the number of 
iterations until |Rm,n| € 107? for all m and n. Plot the number of iterations as a function 
of w. How does the curve change with resolution Az? How does your answer compare to 
the theoretical value? See Figure 9.5.2. 


Step 3: Redo Steps 1 and 2 with the exception of u(0,y) — u(L,y) — 0. How has the 
convergence rate changed? Can you explain why? How sensitive are your results to the 
first guess? 


9.6 FINITE ELEMENT SOLUTION OF LAPLACE'S EQUATION 


In Section 6.6 we showed how the finite element method can be used to solve boundary- 
value problems. Here we extend this approach to two dimensions. The main difficulty will 
be the increase in the complexity of the “bookkeeping.” 


| Triangular elements 


Our first concern is with the element equations. Essentially there are two types: trian- 
gles and quadrilaterals. Triangular elements use the linear polynomial: 


u(z,y) = ao + a1 + ay, (9.6.1) 


where u(x,y) is the dependent variable, the a;'s are coefficients, and x and y are the 
independent variables. This function must pass through the values of u(x, y) at the triangle’s 
nodes (21,91); (x2, y2), and (x3, y3)- Therefore, 


uj = ag t A121 t a241, (9.6.2) 

ug = ag + 1X2 + 0292; (9.6.3) 
and 

ua = ao + 4123 + à2ya. (9.6.4) 


Solving for ag, a, and ag, we have that 
ag = [ui(xays — 1392) + uz(zay1 — z1ys) + ua(z1y2 — 2291)]/A; (9.6.5) 


a; = [u1 (y2 — ya) + uz(ys — yi) + us(vi — y2)I/A, (9.6.6) 


35 Ibid. 
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and 
Q2 — [us (x3 — x2) + us (z1 — x3) + U3 (xe — 21)]/A, (9.6.7) 


where A is the area of the triangular element 


A = 3[(x2y3 — £392) + (zay1 — z1ya) + (z1y2 — 223A )]. (9.6.8) 


To avoid a small A you should avoid elements with narrow geometries on the finite mesh. 
Equation 9.6.5 through Equation 9.6.7 can be substituted into Equation 9.6.1. Collecting 
terms, the result can be expressed as 


u(z,y) = Nui + Nu» + Nous, (9.6.9) 
where 
N, = [(129s — $392) + (y2 — yı )x + (xa — z2)y]/(2A), (9.6.10) 
No = [(£3y1 — 213) + (ys — yr)@ + (xı — 23)y]/(24); (9.6.11) 
and 
Ns = [(t1y2 — 2231) + (yı — y2)@ + (x2 — 21)y]/(24). (9.6.12) 


| Bilinear rectangular elements 


Bilinear rectangular elements use the interpolation formula: 
u(x, y) = Niu + N35u» + Naus + Nua. (9.6.13) 


We wish Equation 9.6.13 to be linear in both x and y. Applying the same procedure as 
before, we have that 


N, = (b — x)(a — y)/(4ab), No = (b + x)(a — y)/ (4ab), (9.6.14) 


= (b+ x)(a + y)/(4ab), and N4-(b— x)(a + y)/(4ab), (9.6.15) 
where 2b and 2a are the length and height of the element, respectively. 


| Finite element formulation 


Having introduced the two most common finite element representations, we are ready 
to apply them to Laplace’s equation. As in the one dimension we develop the finite element 
formulation using the variational formulation (weak formulation) of the boundary-value 
problem by considering the integral 


o?u 0?u Qu 
r= ff wi (5 T 8 a) dx dy — yp wlz y) dl, (9.6.16) 


e 


where w(x,y) is a weighting function and Pe is that portion of the boundary where the 
Dirichlet condition applies. Integrating Equation 9.6.16 by parts, 


Ow Ou Ow Ou du 
zx (Ext be” By 5) dedy + | w(z, y) dT, (9.6.17) 


n 
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where I» is that portion of the boundary where any Neumann condition exists. 
For a linear triangular element, we have 


Ow Ou Ow Ou 
w= ff. (Sot + opt Oy 25) da dy (9.6.18) 


“AILEY m 


where (2° is the element's domain. Upon substituting for Nj, N2, and N3 and carrying out 
the integrations, 


kı KE kag 
[K°] = | ka Ko. kos | , (9.6.20) 
k31 k32 k33 
where 
kii = [(£3 — 22)? + (ya — y3)?]/(4A), 9.6.21) 
ki» = [(@3 — v2) (x1 — 23) + (ya — ys)(ya — )]/(4A), 9.6.22) 
kis = [(£3 — £2)(£2 — 21) + (ye — ys)(Y1 — va)]/(4A). 9.6.23) 
ka = kiz 9.6.24) 
kao = [(£1 — 23)? + (ys — y1)?]/(4A), 9.6.25) 
ko3 = [(£1 — £3)(£2 — x1) + (ya — yi) (i — y2)]/ (4A), 9.6.26) 
kai = kis, 9.6.27) 
k32 = k23, 9.6.28) 
and 
k33 = [(£2 — 21)" + (yı — y2)]/(4A). 9.6.29) 


In the case of a bilinear element, we now have 


VECE OOO 


Carrying out the integration, we obtain 


K= (9.6.31) 


where Jj ; " 2 
b +a b^ — 2a 
kı = kio = ———— 9.6.32 
17 3ab ^ zs Gab ( ) 
b? + a? a? — 2b? 
E ky, = ———— .6.33 
kia Gab ^ 14 Gab (9.6.33) 


k22 = kii, k23 = kia, koa = Kis, (9.6.34) 


436 Advanced Engineering Mathematics with MATLAB 


u(x,2) 2cos(mx/6) 


insulated 


insulated 


Figure 9.6.1: The (a) domain and (b) layout of nodes for Example 9.6.1. The numbers within the circles 
give the number for each triangular element. 


and 
k33 = kı, k34 = hia, kaa = ky. (9.6.35) 


e Example 9.6.1 


Let us illustrate the finite element method by finding the solution to Laplace’s equation 
in a rectangular domain shown in Figure 9.6.1(a). The origin is taken at the lower left corner. 
The boundaries x = 0 and y = 0 are insulated, the boundary x = 3 is maintained at zero, 
and the boundary y = 2 is described by the equation u(x, 2) = cos(mx/6). 

If we use 12 nodes, Figure 9.6.1(b) gives the global node number, element number, 
and element node numbers. The global node and element numberings are arbitrary. If we 
used some software package we would have to take care that we follow their convention. 
The element node numbering scheme used here is consistent with the element interpolation 
function, a counterclockwise system. According to the element node numbering scheme 
shown in Figure 9.6.1(b), all elements in the mesh fall into one of two geometric shapes: 
one with its base at the bottom of the element and another with its base at the top of the 
element. Our choice results in all of the elements having a common geometric shape, and 
thus the element coefficients must be computed only for a single element. 

For a typical element of the mesh of triangles in Figure 9.6.1(b), the element coefficient 
matrix is 


KK] = 5 -1 NE NE (9.6.36) 


The element matrix is independent of the size of the element, as long as the element is a 
right-angled triangle where the base equals height. 
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u(x,2) = cos(Tx/6) 


insulated 


insulated 


Figure 9.6.2: The (a) domain and (b) layout of nodes for Example 9.6.2. The numbers within the circles 
give the number for each rectangular element. 


The boundary conditions require that ug = ug = uj» = 0, ug = 1, uo = V3/2, 
ui; = 1/2, and there is no heat flow at the insulated boundary. Consequently the line 
integral vanishes and [K*]u* = 0. 

We first write the six finite element equations for the six unknown primary variables. 
These equations come from nodes 1, 2, 3, 5, 6, and 7: 


2 i 46 x xa Ex yu 0 
a bo Sx 0] [pus 0 
pod ow €x la, || 6 
a 3b 0 4 2 Oe |} a (bi) 
[js de 2X & og lay V3 
Dodd c2 0 2 08 Vus 1 


The solution to these equations is uj = 0.6362, ug = 0.5510, ug = 0.3181, us = 0.7214, 
ug = 0.6248, and u; = 0.3607. The exact solution is uf**! = 0.6249, u$"**' = 0.5412, 
up ec = 0.3124, apres = 0.7125, ae = 0.6171, and qe 0.3563. 


5 


e Example 9.6.2 


Let us redo Example 9.6.3 using rectangular elements. Figure 9.6.2 illustrates the grid 
now. The stiffness matrix for an element now becomes 


4 —1 —2 -1 


ıl- A _ 
2 JN Ae (9.6.38) 
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45 44 43 42 4] 40 39 38 37 36 35 34 33 32 3l 


Figure 9.6.3: One of the possible nodal maps for the project. 


If we consider only the number of nodes, the present mesh, shown in Figure 9.6.2(b), is 
the same as the triangular mesh that we used in Example 9.6.1. Hence the boundary condi- 
tions are unchanged here. The six finite elements equations for the unknown temperatures 
U1, U2, UZ, U5, ug, and uy are 


4 -1 0 —1 o» 0 uj 0 

-1 8 1 es —2 29 uz 0 

0 -1 8 0 2 +9 uz | _ 0 

-1 -2 0 8 -2 0 us | | 1+v3 eee) 
2 aed 9 x0 qs 3 ug 34 3 

0 -2 -2 0 -2 16 m 1 +4/3 


The solution to Equation 9.6.38 is u; = 0.6128, u2 = 0.5307, u3 = 0.3064, us = 0.7030, 
ug = 0.6088, and u; = 0.3515. Note that our present results are not as accurate as those 
in the previous example. This occurs because there are only one-half of the elements as in 
the triangular element formulation. 


Project: Solving Laplace’s Equation Using Finite Elements 


In this project you will use finite elements to solve Laplace’s equation. The domain 
is square with a quarter circle removed from the lower right side of the square. Along the 
quarter circle, the solution equals one while along the straight edges the solution equals 
Zero. 

Although you could write your own MATLAB code, there already exist several codes? 
that are available online. Consequently your principal task will be to construct the tri- 
angular and quadrilateral elements by choosing reasonable nodal positions. With those 


36 See, for example, Alberty, J., C. Carstensen, and S. A. Funken, 1999: Remarks about 50 lines of 
Matlab: Short finite element implementation. Numer. Algorithms, 20, 117-137. This paper includes the 
MATLAB code that you will need. 


37 For example, http: //people.sc.fsu.edu/^burkardt/m src/fem 50/fem.50.html 
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Finite Element Solution to Laplace's Equation 


Figure 9.6.4: The finite element solution of Laplace's equation where the solution equals 0 along the 
straight edges and 1 along the quarter circle. 


in place, the coding of the Dirichlet boundary conditions is straightforward. Figure 9.6.3 
shows one possible nodal configuration. Figure 9.6.4 illustrates how your solution should 
appear. 


Further Reading 


Duffy, D. G., 2008: Mixed Boundary Value Problems. Chapman & Hall, 467 pp. This book 
gives a detailed account of mixed boundary value problems. 


Koshlyakov, N. S., M. M. Smirnov, and E. B. Gliner, 1964: Differential Equations of Math- 
ematical Physics. North-Holland Publishing, 701 pp. See Part II. Detailed presentation of 
mathematical techniques. 


Morse, P. M., and H. Feshback, 1953: Methods of Theoretical Physics. McGraw-Hill Book 
Co., 997 pp. Chapter 10 is devoted to solving both Laplace's and Poisson's equations. 


Taylor & Francis 


Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 10 


Complex Variables 


'The theory of complex variables was originally developed by mathematicians as an aid 
in understanding functions. Functions of a complex variable enjoy many powerful properties 
that their real counterparts do not. That is not why we will study them. For us they provide 
the keys for the complete mastery of transform methods and differential equations. 

In this chapter all of our work points to one objective: integration on the complex 
plane by the method of residues. For this reason we minimize discussions of limits and 
continuity, which play such an important role in conventional complex variables, in favor 
of the computational aspects. We begin by introducing some simple facts about complex 
variables. Then we progress to differential and integral calculus on the complex plane. 


10.1 COMPLEX NUMBERS 


A complex number is any number of the form a+bi, where a and b are real and i = 4/—1. 
We denote any member of a set of complex numbers by the complex variable z = x + ty. 
The real part of z, usually denoted by 9t(z), is x while the imaginary part of z, S(z), is y. 
The complex conjugate, Z or z*, of the complex number a + bi is a — bi. 

Complex numbers obey the fundamental rules of algebra. Thus, two complex numbers 
a+ bi and c+ di are equal if and only if a = c and b = d. Just as real numbers have 
the fundamental operations of addition, subtraction, multiplication, and division, so too do 
complex numbers. These operations are defined: 


Addition 
(a+ bi) + (c+ di) = (a -- c) 4- (b - d)i (10.1.1) 


Subtraction 


(a + bi) — (c+ di) = (a — c) + (b — d)i (10.1.2) 
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Multiplication 
(a+ bi)(c + di) = ac + bci + adi + i?bd = (ac — bd) + (ad + bc)i (10.1.3) 
Division 
a+bi at+bic—di | ac— adi -- bci — bdi? ac+bd+ (bc — ad)i 


= E = : 10.1.4 
c+di  c- di c— di c? + d? c? + d? ) 


The absolute value or modulus of a complex number a + bi, written |a + bi|, equals 
va? + b?. Additional properties include: 


[212223 +- Zn| = |zillzel|za| e enl 10.1.5) 
|z1 + z2 +23 dz < [zl + [z2] + les] +-+ len! 10.1.7) 

and 
EZ + zal > [zi = [23]. 10.1.8) 


The use of inequalities with complex variables has meaning only when they involve absolute 
values. 

It is often useful to plot the complex number x + iy as a point (x,y) in the zy-plane, 
now called the complex plane. Figure 10.1.1 illustrates this representation. 

'This geometrical interpretation of a complex number suggests an alternative method 
of expressing a complex number: the polar form. From the polar representation of x and y, 


z-—rcos(Ü0ü and y-rsin(0), (10.1.9) 


where r = \/x?2 + y? is the modulus, amplitude, or absolute value of z and 0 is the argument 
or phase, we have that 
z = x + iy = r|cos(0) + isin(0)]. (10.1.10) 


However, from the Taylor expansion of the exponential in the real case, 


io No (i)* 
ea T (10.1.11) 
k=0 
Expanding Equation 10.1.11, 
; U^ gg o 09 05 F 
d o. f ree ae} | — 
= cos(0) + isin(0). (10.1.13) 


Equation 10.1.13 is Euler's formula. Consequently, we may express Equation 10.1.10 as 
z re, (10.1.14) 
which is the polar form of a complex number. Furthermore, because 


Pare (10.1.15) 
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Figure 10.1.1: The complex plane. 


by the law of exponents, 
z” = r” [cos(n0) + i sin(n6)]. (10.1.16) 


Equation 10.1.16 is De Moivre’s theorem. 
e Example 10.1.1 


Let us simplify the following complex number: 


Lad Ba 1 Po 3- 3i+2i +2?  —5—i 5 i (10.1.17) 
iuo —l+i Leg 141 2 2, 2 


e Example 10.1.2 


Let us reexpress the complex number — V6 — i/2 in polar form. From Equation 10.1.9 
r= /6+2 and 0 = tan-! (b/a) = tan! (1/3) = 7/6 or 71/6. Because — v6 — i2 lies in 
the third quadrant of the complex plane, 0 = 77/6 and 


—V6 — iV2 = 2 /2e 71/6. (10.1.18) 


Note that Equation 10.1.18 is not a unique representation because +2n7a may be added to 
7/6 and we still have the same complex number since 


e(02n7) — cos(0 + 2nz) + isin(0 + 2nr) = cos(0) + isin(0) = e”. (10.1.19) 


For uniqueness we often choose n — 0 and define this choice as the principal branch. Other 
branches correspond to different values of n. 


e Example 10.1.3 


Find the curve described by the equation |z — zo| — a. 
From the definition of the absolute value, 


V(x — zo)? + (y — yo)? =a (10.1.20) 
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Or 
(x — zo)? + (y — yo)? = 0°. 


(10.1.21) 


Equation 10.1.21, and hence |z — zo| = a, describes a circle of radius a with its center located 
at (zo, yo). Later on, we shall use equations such as this to describe curves in the complex 


plane. 
e Example 10.1.4 
As an example in manipulating complex numbers, let us show that 


at bi 
b 4- ai 


We begin by simplifying 


a+ bi a+bi b-—ai 2ab b -—2a?. 


Qu 7 peus" Doa Rue i muet 


Therefore, 
at bil | 4a? b2 bt — 2a?b? tia? at + 2a?b? +b — i 
b+ail \ Fe (a? + b2}? (@@+e2)2 


MATLAB can also be used to solve this problem. Typing the commands 


>> syms a b real 
>> abs((atb*i) /(b+a*i) ) 


yields 


ans = 
1 


Note that you must declare a and b real in order to get the final result. 


Problems 


(10.1.22) 


(10.1.23) 


(10.1.24) 


Simplify the following complex numbers. Represent the solution in the Cartesian form a+bi. 


Check your answers using MATLAB. 


5i 5 + 52 20 1-24 2-—i 
/2-ci |3-4i 443i | 3-4 5i 
4. (1— i) 5. i(1 — iv3) (v3 + i) 

Represent the following complex numbers in polar form: 
6. —i 7. —4 8. 24+2V3i 
9. —5- 5i 10. 2— 2i 11. -14 V3i 


12. By the law of exponents, e(¢+9) = etae, Use Euler's formula to obtain expressions 


for cos(a + B) and sin(a + B) in terms of sines and cosines of o and f. 
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13. Using the property that gear q” = (1 — qXt")/(1 — q) and the geometric series 
aa e'"*. obtain the following sums of trigonometric functions: 


ae f NV sin[(N + 1)t/2] an (| NUN sin[(N + 1)t/2] 
5 cos(nt) = cos( 5 ) sin(t/2) and p» sin(nt) — sin( 5 ) sin(t/2) c 


n=0 n=1 


These results are often called Lagrange’s trigonometric identities. 


14. (a) Using the property that o d" = 1/(1— q), if |g| < 1, and the geometric series 
aso ene’, e| < 1, show that 


oe 1 — e cos(t) > . e sin(t) 
ye t) = d ” sin(nt) = i 
ai Eds 1 + €? — 2e cos(t) S 2. PIN 14 €? — 2e cos(t) 


(b) Let e = e^, where a > 0. Show that 


sin(t) 
cosh(a) — cos(t) ` 


2 5 e "^ sin(nt) = 
n=1 


10.2 FINDING ROOTS 


The concept of finding roots of a number, which is rather straightforward in the case of 
real numbers, becomes more difficult in the case of complex numbers. By finding the roots 
of a complex number, we wish to find all of the solutions w of the equation w” = z, where 
n is a positive integer for a given z. 

We begin by writing z in the polar form: 


z — re, (10.2.1) 
while we write 
v = Re? (10.2.2) 
for the unknown. Consequently, 
w” = Rhein? = re? =z, (10.2.3) 
We satisfy Equation 10.2.3 if 
R” =r and n®@=y4+2kn, k=0,+1,+2,..., (10.2.4) 


because the addition of any multiple of 27 to the argument is also a solution. Thus, 
R=r'/", where R is the uniquely determined real positive root, and 


2nk 
(4,254775, RETER yo (10.2.5) 


n n 


Because Wk = Wk+n, it is sufficient to take k = 0,1,2,...,n— 1. Therefore, there are exactly 
n solutions: 


2nk 
wy = Re?* = rl" exp I € + 2] (10.2.6) 
n n 
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Z, ty 


Figure 10.2.1: The zeros of z? = —32. 


with k — 0,1,2,...,n — 1. They are the n roots of z. Geometrically we can locate these 
solutions uw; on a circle, centered at the point (0,0), with radius R and separated from each 
other by 27/n radians. These roots also form the vertices of a regular polygon of n sides 
inscribed inside of a circle of radius R. (See Example 10.2.1.) 

In summary, the method for finding the n roots of a complex number Zo is as follows. 
First, write zp in its polar form: z = re^?. Then multiply the polar form by e?/7^. Using 
the law of exponents, take the 1/n power of both sides of the equation. Finally, using 
Euler's formula, evaluate the roots for k = 0,1,...,n — 1. 


e Example 10.2.1 


Let us find all of the values of z for which zë = —32 and locate these values on the 
complex plane. 
Because l 
=32 = 222" = e"f, (10.2.7) 
| — 2nik 
dies zop T + T ) "E ti (10.2.8) 
Or 
zo = few (£.) —2 [cos( =) + isim(2)] ; (10.2.9) 
PER |) Soles = ean. (10.2.10) 
5 5 5 
23 = 2e™ = —2, (10.2.11) 
genes ales” poen M (10.2.12) 
5 5 5 
and 


ge zep T) = les (ar) T isin()| . (10.2.13) 


Figure 10.2.1 shows the location of these roots in the complex plane. 
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Figure 10.2.2: The zeros of z? = —1 + i. 
e Example 10.2.2 


Let us find the cube roots of —1 + à and locate them graphically. 
Because —1 + i = V2 exp(37i/4), 


j^ Qimk 
zy = 21/6 ew ( 7 Ji 3 ) A k=0,1,2, (10.2.14) 
Or : 
zo = 21/6 exp( ™ | = 21/6 [cos(=) +isin(7)| (10.2.15) 
4 4 4 

liri 11 11 
zı = 21/6 se (55) = 21/6 cos " isin( =| , (10.2.16) 

s 197i 19 19 
z = 21/6 exp( 3) = 21/6 feos( =| isin( =) (10.2.17) 


Figure 10.2.2 gives the location of these zeros on the complex plane. 


e Example 10.2.3 


The routine solve in MATLAB can also be used to compute the roots of complex 
numbers. For example, let us find all of the roots of z4 = —a?*. 
The MATLAB commands are as follows: 


>> syms a Z 
>> solve(z^4*a^4) 
'This yields the solution 


ans- 

[ (1/2*27 (1/2) *1/2*i*2^ (1/2)) *a] 
[ (-1/2*2^ (1/2) *1/2*i*2^ (1/2))*a] 
[ (1/2*27 (1/2) -1/2*i*2^ (1/2)) *a] 
[ (-1/2*2^ (1/2) -1/2*i*2^ (1/2))*a] 
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z-plane w-plane 


Figure 10.3.1: The complex function w — z?. 


Problems 


Extract all of the possible roots of the following complex numbers. Verify your answer using 
MATLAB. 


1. 81/6 2. (=1)1⁄3 3. (=i)! 4. (—27i)!/6 
5. Find algebraic expressions for the square roots of a — bi, where a > 0 and b > 0. 


6. Find all of the roots for the algebraic equation z^ — 3iz?— 2 = 0. Then check your answer 
using solve in MATLAB. 


7. Find all of the roots for the algebraic equation z+ + 6iz? + 16 = 0. Then check your 
answer using solve in MATLAB. 


10.3 THE DERIVATIVE IN THE COMPLEX PLANE: THE CAUCHY-RIEMANN EQUATIONS 


In the previous two sections, we introduced complex arithmetic. We are now ready for 
the concept of function as it applies to complex variables. 

We already defined the complex variable z = x+iy, where x and y are variable. We now 
introduce another complex variable w = u + iv so that for each value of z there corresponds 
a value of w = f(z). From all of the possible complex functions that we might invent, we 
focus on those functions where for each z there is one, and only one, value of w. These 
functions are single-valued. They differ from functions such as the square root, logarithm, 
and inverse sine and cosine, where there are multiple answers for each z. These multivalued 
functions do arise in various problems. However, they are beyond the scope of this book 
and we shall always assume that we are dealing with single-valued functions. 

A popular method for representing a complex function involves drawing some closed 
domain in the z-plane and then showing the corresponding domain in the w-plane. This 
procedure is called mapping and the z-plane illustrates the domain of the function while 
the w-plane illustrates its image or range. Figure 10.3.1 shows the z-plane and w-plane for 
w = z?; a pie-shaped wedge in the z-plane maps into a semicircle on the w-plane. 


e Example 10.3.1 


zZ 


Given the complex function w = e~*’, let us find the corresponding u(x, y) and v(x, y). 


From Euler's formula, 


w = e77 =e (ete)? — uuo 9 tiny — uus? [cos(2xy) — isin(2xy)]. (10.3.1) 
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Therefore, by inspection, 
u(z,y) = eva" cos(2xy), and v(z,y) = 2e sin(2xy). (10.3.2) 


Note that there is no i in the expression for u(x, y). The function w = f(z) is single-valued 
because for each distinct value of z, there is a unique value of u(z, y) and v(z, y). 


e Example 10.3.2 


As counterpoint, let us show that w = /z is a multivalued function. 
We begin by writing z = re/?*?7!*, where r = \/x? + y? and 0 = tan! (y/x). Then, 


wy = Vrei?/?*rik — 4 = 9,1, 10.3.3) 
or 
wo = vyr [cos(0/2) + isin(0/2)] and WU; = —Wo. 10.3.4) 
Therefore, 
tiga.) = / 0050/2), vn (as 4) = o/Fsin( 0/2), 10.3.5) 
and 
u(x, y) = — vyr cos(0/2), vi(r,y) = — Vr sin(0/2). 10.3.6) 


Each solution wo or wy is a branch of the multivalued function yz. We can make yz single- 
valued by restricting ourselves to a single branch, say wo. In that case, the R(w) > 0 if we 
restrict —7 < 0 < v. Although this is not the only choice that we could have made, it is a 
popular one. For example, most digital computers use this definition in their complex square 
root function. The point here is our ability to make a multivalued function single-valued 
by defining a particular branch. 


Although the requirement that a complex function be single-valued is important, it is 
still too general and would cover all functions of two real variables. To have a useful theory, 
we must introduce additional constraints. Because an important property associated with 
most functions is the ability to take their derivative, let us examine the derivative in the 
complex plane. 

Following the definition of a derivative for a single real variable, the derivative of a 
complex function w — f(z) is defined as 


dw . Aw 2 f(z - Az) — f(z) 
— = lim — = lim s 
dz Az—0 Az Az—0 Az 


(10.3.7) 


A function of a complex variable that has a derivative at every point within a region of the 
complex plane is said to be analytic (or regular or holomorphic) over that region. If the 
function is analytic everywhere in the complex plane, it is entire. 

Because the derivative is defined as a limit and limits are well behaved with respect 
to elementary algebraic operations, the following operations carry over from elementary 
calculus: 


E E — cf'(z), c a constant (10.3.8) 
i [roa = n9 sta (10.3.9) 


450 Advanced Engineering Mathematics with MATLAB 


Eloo] = ta) + fev) 10.3.10) 

4 [16] _ att) - taf) 
ds Fal = ZE) 10.3.11) 
Sron = f'[g(z)]g' (2), the chain rule. 10.3.12) 


Another important property that carries over from real variables is l'Hóspital's rule: 
Let f(z) and g(z) be analytic at zo, where f(z) has a zero! of order m and g(z) has a zero 
of order n. Then, if m » n, 


P og (10.3.13) 
z— zo g(z) 
ifm=n, 
(m) 
lim J) _ £69). (10.3.14) 
220 g(z) — g("?» (zo) 
and if m « n, 
LC (10.3.15) 
2420 g(z) 
e Example 10.3.3 
Let us evaluate lim; ,;(z1? + 1)/(29 +1). From l'Hóspital's rule, 
EU 9 
. +1 . 10z 5. a 5 
mI cms = gta g 108.10 


So far, we introduced the derivative and some of its properties. But how do we actually 
know whether a function is analytic or how do we compute its derivative? At this point we 
must develop some relationships involving the known quantities u(x, y) and v(x, y). 

We begin by returning to the definition of the derivative. Because Az = Az + iAy, 
there is an infinite number of different ways of approaching the limit Az — 0. Uniqueness 
of that limit requires that Equation 10.3.7 must be independent of the manner in which 
Az approaches zero. A simple example is to take Az in the x-direction so that Az = Az; 
another is to take Az in the y-direction so that Az = iAy. These examples yield 


dw Aw Au--i^v ôu Ov 


= ]i = ]i = j 10.3.17 
dz Az0 Az Aso Ag Ox i E ( ) 
idi d ^ AutiAv ðv ð 
Ca e a a (10.3.18) 
dz Azs0 Az Ayso iAy Oy | Oy 
1 An analytic function f(z) has a zero of order m at zo if and only if f(zo) = f'(zo) 2. = f^-U(zg) = 


0 and f° (zo) £0. 
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Although educated as an engineer, Augustin-Louis Cauchy (1789-1857) would become a mathe- 
matician's mathematician, publishing 789 papers and 7 books in the fields of pure and applied 
mathematics. His greatest writings established the discipline of mathematical analysis as he refined 
the notions of limit, continuity, function, and convergence. It was this work on analysis that led him 
to develop complex function theory via the concept of residues. (Portrait courtesy of the Archives 
de l'Académie des sciences, Paris.) 


In both cases we are approaching zero from the positive side. For the limit to be unique 
and independent of path, Equation 10.3.17 must equal Equation 10.3.18, or 


Qu. v nd ON OU. (10.3.19) 


ðr dy Oy Ox 


These equations that u and v must both satisfy are the Cauchy-Riemann equations. 
They are necessary but not sufficient to ensure that a function is differentiable. The follow- 
ing example illustrates this. 


e Example 10.3.4 


Consider the complex function 


z5/|zļł, S 
= 3 10.3.20 
g { 0, z=0. ( ) 


The derivative at z = 0 is given by 


5 4 4 
dw lim (Az)?/|Az|*-—0 _ ji (Az) 


dw _ = taz) 10.3.21 
dz  Az>0 Az Az—0 |Az|*’ 0925) 
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Despite his short life, (Georg Friedrich) Bernhard Riemann's (1826-1866) mathematical work con- 
tained many imaginative and profound concepts. It was in his doctoral thesis on complex function 
theory (1851) that he introduced the Cauchy-Riemann differential equations. Riemann's later work 
dealt with the definition of the integral and the foundations of geometry and non-Euclidean (elliptic) 
geometry. (Portrait courtesy of Photo AKG, London, with permission.) 


provided that this limit exists. However, this limit does not exist because, in general, the 
numerator depends upon the path used to approach zero. For example, if Az = re™’/4 with 
r — 0, dw/dz = —1. On the other hand, if Az = re™/? with r > 0, dw/dz = 1. 

Are the Cauchy-Riemann equations satisfied in this case? To check this, we first com- 
pute 


Az \* i^y V. 
uz (0,0) = Aum, (s) =n; vy(0,0) = lim (e) =l; (10.3.22) 


u„(0,0) = lim "| GA)" | =0, and v,(0,0)— lim «( 


4 
Jim RA ) | — 0. (10.3.23) 


[Az] 
Hence, the Cauchy-Riemann equations are satisfied at the origin. Thus, even though the 
derivative is not uniquely defined, Equation 10.3.21 happens to have the same value for 
paths taken along the coordinate axes so that the Cauchy-Riemann equations are satisfied. 


In summary, if a function is differentiable at a point, the Cauchy-Riemann equations 
hold. Similarly, if the Cauchy-Riemann equations are not satisfied at a point, then the 
function is not differentiable at that point. This is one of the important uses of the Cauchy- 
Riemann equations: the location of nonanalytic points. Isolated nonanalytic points of an 
otherwise analytic function are called isolated singularities. Functions that contain isolated 
singularities are called meromorphic. 

The Cauchy-Riemann condition can be modified so that it is sufficient for the derivative 
to exist. Let us require that uz, uy, Uz, and vy be continuous in some region surrounding a 
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point z and satisfy the Cauchy-Riemann equations there. Then 


f(z) — f(2o) = [u(2) — u(2o)] + ilv(2) — v(zo)] (10.3.24) 
= [us (2o)(z — zo) + uy(zo)(y — vo) + &1(x — Zo) + €2(y — vo)] 
+ i[vs(zo)(x — xo) + vy(zo)(y — vo) + es(z — zo) + ea(y — vo)) (10.3.25) 
= [us (20) + ivs (2o))(z — 20) + (€1 + iex)(x — xo) 
+ (€ + iea)(y — yo), (10.3.26) 


where we used the Cauchy-Riemann equations and €1, €2, €3, €4 — 0 as Ax, Ay > 0. Hence, 


An ca hae 20) (10.3.27) 


because |Az| € |Az| and |Ay| € |Az|. Using Equation 10.3.27 and the Cauchy-Riemann 
equations, we can obtain the derivative from any of the following formulas: 


dw Ou _ Ov Ov jou. (10.3.28) 
dz Ox Ox Oy Oy 

and 
dw _ Ov, Ov Ou Ou (10.3.29) 
dz Oy Ox Ox Oy 


Furthermore, f'(zo) is continuous because the partial derivatives are. 
e Example 10.3.5 


Let us show that sin(z) is an entire function. 


w — sin(z) 10.3.30) 
u + iv = sin(x + iy) = sin(x) cos(iy) + cos(x) sin(iy) 10.3.31) 
= sin(x) cosh(y) + i cos(x) sinh (y), 10.3.32) 

because 
cos(iy) = 3 [e + e-690] = Ile" + e7v] = cosh(y), 10.3.33) 

and 

sin(iy) = 4 [et — e-:6:0] = — 4 [e” — e-v] = isinh(y), 10.3.34) 

so that 
u(x, y) = sin(r)cosh(y) and v(z,y)= cos(x) sinh(y). 10.3.35) 


Differentiating both u(x, y) and v(x, y) with respect to x and y, we have that 


Ou 


a = cos(x) cosh(y), p sin(x) sinh(y), (10.3.36) 
y 


Ox 
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o o 

CU = —sin(x)sinh(y),  < = cos(x) cosh(y), (10.3.37) 
Ox Oy 

and u(x,y) and v(x,y) satisfy the Cauchy-Riemann equations for all values of x and y. 
Furthermore, uz, Uy, Vz, and vy are continuous for all r and y. Therefore, the function 
w = sin(z) is an entire function. 


e Example 10.3.6 


Consider the function w = 1/z. Then 


; 1 x iy 
z x - ; 10.3.38 
w=utiv pL tua I ( ) 
Therefore, 
z y 
= d — ieee) 10.3.39 
u(x, y) T? +y?’ an v(x, y) x2 +y? ) 
Now ð 2 2 2 2 2 
—29 L 
LEM Cur par MN i M 10.3.40) 
Ox (x? + y?) (a? + y?)? 
fa) 2 2\ . 2 2 Doe Ad ð 
Os Cur Deni MNA Uc ANN | 10.3.41) 
Oy (a? + y?)? (x? + y?)? ôx 
ð 0-2 2 
“= NEL NE 10.3.42) 
Ox (£? +y?) (x? +y?) 
is du 0-2 2 a 
cupa TC ui JEN d 10.3.43) 


óp QUxVYy Wye Oe 


The function is analytic at all points except the origin because the function itself ceases to 
exist when both x and y are zero and the modulus of w becomes infinite. 


e Example 10.3.7 


Let us find the derivative of sin(z). 
Using Equation 10.3.28 and Equation 10.3.32, 


HE E = gu + (D = cos(x) cosh(y) — isin(x) sinh(y) = cos(x + iy) = cos(z). 


dz Ox Ox 
(10.3.44) 
Similarly, 
d f1 y — x 2ixy 1 1 
= = = ; 10.3.45 
su eren ey eee RS) 


The results in the above examples are identical to those for z real. As we showed 
earlier, the fundamental rules of elementary calculus apply to complex differentiation. Con- 
sequently, it is usually simpler to apply those rules to find the derivative rather than breaking 
f(z) down into its real and imaginary parts, applying either Equation 10.3.28 or Equation 
10.3.29, and then putting everything back together. 
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An additional property of analytic functions follows by cross differentiating the Cauchy- 
Riemann equations, or 


0?u 0?v Oru u du 
= = BU p E 10.3.4 
Ox? Əxðy Oy?’ 7 Oa? T Oy? 0, ( 9) 
and 2 2 2 2 
2 
jg UE BE nuca hen (10.3.47) 


0x? | OxOy H Oy?’ Ox? Oy? 5 
Any function that has continuous partial derivatives of second order and satisfies Laplace’s 
equation, Equation 10.3.46 or Equation 10.3.47, is called a harmonic function. Because 
both u(x, y) and v(z, y) satisfy Laplace's equation if f(z) = u + iv is analytic, u(x, y) and 
v(x, y) are called conjugate harmonic functions. 


e Example 10.3.8 


Given that u(x, y) = e *[rsin(y) — y cos(y)], let us show that u is harmonic and find 
a conjugate harmonic function v(x, y) such that f(z) = u + iv is analytic. 


Because 
Oru x NC = 
ggz 7 2€ "sin(y) + ve~? sin(y) — ye * cos(y), (10.3.48) 
and 
O7u v EN 2. 
Bae EE ERE) Vect D per con), (10.3.49) 


it follows that Ure + uy, = 0. Therefore, u(z, y) is harmonic. From the Cauchy-Riemann 
equations, 


Ou = gy =e "sin(y) — xe *sin(y) + ye ? cos(y), (10.3.50) 
Oy Ox 
and 
9v = E =e *cos(y) — ze * cos(y) — ye " sin(y). (10.3.51) 
Ox Oy 


Integrating Equation 10.3.50 with respect to y, 
v(x, y) = ye * sin(y) + ze * cos(y) + g(x). (10.3.52) 
Using Equation 10.3.51, 


v, = —ye ” sin(y) — ze * cos(y) + e * cos(y) + g'(x) 
=e *cos(y) — xe * cos(y) — ye ? sin(x). (10.3.53 


Therefore, g'(z) = 0 or g(x) = constant. Consequently, 
v(z,y) =e “[ysin(y) + x cos(y)] + constant. (10.3.54) 


Hence, for our real harmonic function u(x, y), there are infinitely many harmonic conjugates 
v(z, y), which differ from each other by an additive constant. 
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Problems 


Show that the following functions are entire: 
1. f(z) 2 iz 4-2 2. f(z) 2e* 3. f(z) = 28 4. f(z) = cosh(z) 


Find the derivative of the following functions: 

5. f(z) = (1 + 22)8/2 6. f(z) = (z + 221/2)1/8 7. f(z) = (l + 4i)? — 3z - 2 
8. f(z) = (2z — i)/(z+ 2i) 9. f(z)= (iz - 1) 

Evaluate the following limits: 


z? — 2iz— 1 z — sin(z) 
10. lim — ———— 11. lim — ——— 
9 25 zt +2z2+1 250 z3 


12. Show that the function f(z) = z* is nowhere differentiable. 


For each of the following u(x, y), show that it is harmonic and then find a corresponding 
v(z, y) such that f(z) = u + iv is analytic. 


13. u(x, y) = x? — 3? 14. u(x, y) = zt — 6z?y? + yt +r 


x 


15. u(x, y) = xzcos(z)e " — ysin(z)e * 16. u(x, y) = (x? — y?) cos(y)e* — 2xy sin(y)e 
10.4 LINE INTEGRALS 


So far, we discussed complex numbers, complex functions, and complex differentiation. 
We are now ready for integration. 

Just as we have integrals involving real variables, we can define an integral that involves 
complex variables. Because the z-plane is two-dimensional, there is clearly greater freedom 
in what we mean by a complex integral. For example, we might ask whether the integral of 
some function between points A and B depends upon the curve along which we integrate. 
(In general it does.) Consequently, an important ingredient in any complex integration is 
the contour that we follow during the integration. 

The result of a line integral is a complex number or expression. Unlike its counterpart 
in real variables, there is no physical interpretation for this quantity, such as area under 
a curve. Generally, integration in the complex plane is an intermediate process with a 
physically realizable quantity occurring only after we take its real or imaginary part. For 
example, in potential fluid flow, the lift and drag are found by taking the real and imaginary 
part of a complex integral, respectively. 

How do we compute fo f(z) dz? Let us deal with the definition; we illustrate the actual 
method by examples. 

A popular method for evaluating complex line integrals consists of breaking everything 
up into real and imaginary parts. This reduces the integral to line integrals of real-valued 
functions, which we know how to handle. Thus, we write f(z) = u(z, y) + iv(z, y) as usual, 
and because z = x + iy, formally dz = dx + i dy. Therefore, 


J rev [ioa + iole faz + idy (10.4.1) 
Ü e 


sj (x, y) de — viz.) dy i | u(x, y) dx + u(x, y) dy. (10.4.2) 
C C 
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2. > > 


Figure 10.4.1: Contour used in Example 10.4.1. 
The exact method used to evaluate Equation 10.4.2 depends upon the exact path specified. 


From the definition of the line integral, we have the following self-evident properties: 
f f(zdz—-] f(z)dz, (10.4.3) 
C e 


where C" is the contour C taken in the opposite direction of C and 


| f(z)dz= f(z)dz+ f(z) dz. (10.4.4) 
Cı +C2 Ci C2 
e Example 10.4.1 

Let us evaluate f. c 4 dz from z = 0 to z = 4+ 2i along two different contours. The first 
consists of the parametric equation z = t? + it. The second consists of two “dog legs": the 
first leg runs along the imaginary axis from z — 0 to z — 2i and then along a line parallel 
to the z-axis from z = 2i to z = 4 + 2i. See Figure 10.4.1. 


For the first case, the points z = 0 and z = 4+ 2i on C correspond to t = 0 and t = 2, 
respectively. T'hen the line integral equals 


2 2 
f, z*dz = f (é? + it)* d(t? + it) = n (20 — i? + t) dt = 10 — €. (10.4.5) 
1 


The line integral for the second contour C5 equals 


| ede f td f z*dz, (10.4.6) 
C2 Cra Cop 


where C2, denotes the integration from z = 0 to z = 27 while C», denotes the integration 
from z = 2i to z = 4+ 2i. For the first integral, 


2 
f z'dz = / (x — iy)(dz + idy) = n ydy — 2, (10.4.7) 
C2a C2a 0 


because x = 0 and dx = 0 along C5,. On the other hand, along C2, y = 2 and dy = 0 so 
that 


4 4 
f z*dz = (x — iy)(dz + idy) = | vie +i f —2dz = 8 — 8i. (10.4.8) 
Cop [0777 0 0 
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Figure 10.4.2: Contour used in Example 10.4.2. 


Thus the value of the entire C3 contour integral equals the sum of the two parts, or 10 — 8i. 

'The point here is that integration along two different paths has given us different results 
even though we integrated from z = 0 to z = 4+ 2i both times. This result foreshadows 
a general result that is extremely important. Because the integrand contains nonanalytic 
points along and inside the region enclosed by our two curves, as shown by the Cauchy- 
Riemann equations, the results depend upon the path taken. Since complex integrations 
often involve integrands that have nonanalytic points, many line integrations depend upon 
the contour taken. 


e Example 10.4.2 


Let us integrate the entire function f(z) = z? along the two paths from z = 0 to 
z = 2 + i shown in Figure 10.4.2. For the first integration, x = 2y, while along the second 
path we have two straight paths: z = 0 to z = 2 and z = 2 to z = 2 + i. 

For the first contour integration, 


1 1 
/ zdz = f (2y + iy)” (2 dy + idy) = f (3y? + Ay?i)(2 dy + i dy) (10.4.9) 
e 0 0 


1 1 
= I 6? dy + 8y°i dy + 3y^idy — 4y? dy = n 24° dy +11y°idy — (10.4.10) 
0 0 


= 2y8|b + Higyj = 2 4 Ue, (10.4.11) 


For our second integration, 


| z? dz = / z^ dz +f z* dz. (10.4.12) 
C2 Cra [0777 
Along C», we find that y = dy = 0 so that 
2 
I z? dz =|} a dx = įr’ = 8, (10.4.13) 
Coa 0 


and 


1 


L-44—2—£ 


i (10.4.14) 
0 


1 3 
| z? d= f (2+ iy)*idy =i (w+ — £) 
Cab 0 3 


because z = 2 and dx = 0. Consequently, 


2 lli 
/ zd $4. (10.4.15) 
T 3773 
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Figure 10.4.3: Contour used in Example 10.4.3. 


In this problem we obtained the same results from two different contours of integration. 
Exploring other contours, we would find that the results are always the same; the integration 
is path-independent. But what makes these results path-independent while the integration 
in Example 10.4.1 was not? Perhaps it is the fact that the integrand is analytic everywhere 
on the complex plane and there are no nonanalytic points. We will explore this later. 


Finally, an important class of line integrals involves closed contours. We denote this 
special subclass of line integrals by placing a circle on the integral sign: ¢. Consider now 
the following examples: 


e Example 10.4.3 


Let us integrate f(z) = z around the closed contour shown in Figure 10.4.3. 
From Figure 10.4.3, 


f zaz= f ida | idi | z dz. (10.4.16) 
C Ci C3 Cs 


Now 
0 0 y? 0 1 
f zdz= f iy (idy) =~ | ydy =- | =-, (10.4.17) 
Cı 1 1 2 1 2 
zx 2|-1 
1 
f zdz= | rdr= 2] mes, (10.4.18) 
Co 0 2 0 2 
and 
7/2. E v/2 
/ «ds = f eje dé = —| =-1, (10.4.19) 
C3 -T 2 -r 


where we used z = e? 


integral equals zero. 


around the portion of the unit circle. Therefore, the closed line 


e Example 10.4.4 


Let us integrate f(z) = 1/(z — a) around any circle centered on z = a. The Cauchy- 
Riemann equations show that f(z) is a meromorphic function. It is analytic everywhere 
except at the isolated singularity z = a. 
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If we introduce polar coordinates by letting z — a = re?! and dz = ire?'d0, 


2m ;.. 0i 2c 
gs =l ne ao =i | dO = 2ni. (10.4.20) 
0 0 


cz-a ree 


Note that the integrand becomes undefined at z = a. Furthermore, the answer is indepen- 
dent of the size of the circle. Our example suggests that when we have a closed contour 
integration it is the behavior of the function within the contour rather than the exact shape 
of the closed contour that is of importance. We will return to this point in later sections. 


Problems 
1. Evaluate $5(2*)? dz around the circle |z| = 1 taken in the counterclockwise direction. 


2. Evaluate $5 |z|? dz around the square with vertices at (0,0), (1,0), (1,1), and (0,1) taken 
in the counterclockwise direction. 


3. Evaluate f; |z| dz along the right half of the circle |z| = 1 from z = —i to z =i. 
4. Evaluate fo e” dz along the line y = x from (—1, —1) to (1, 1). 
5. Evaluate f, (2*)? dz along the line y = x? from (0,0) to (1, 1). 


6. Evaluate fc, z~1/? dz, where C is (a) the upper semicircle |z| = 1 and (b) the lower semi- 


0i 


circle |z| = 1. If z = re”, restrict —r < 0 < v. Take both contours in the counterclockwise 


direction. 
10.5 THE CAUCHY-GOURSAT THEOREM 


In the previous section we showed how to evaluate line integrations by brute-force 
reduction to real-valued integrals. In general, this direct approach is quite difficult and we 
would like to apply some of the deeper properties of complex analysis to work smarter. In 
the remaining portions of this chapter we introduce several theorems that will do just that. 

If we scan over the examples worked in the previous section, we see considerable differ- 
ences when the function was analytic inside and on the contour and when it was not. We 
may formalize this anecdotal evidence into the following theorem: 


Cauchy-Goursat theorem:? Let f(z) be analytic in a domain D and let C be a simple 
Jordan curve? inside D so that f(z) is analytic on and inside of C. Then $o f(z) dz — 0. 


Proof: Let C denote the contour around which we will integrate w = f(z). We divide the 
region within C into a series of infinitesimal rectangles. See Figure 10.5.1. The integration 


2 Goursat, E., 1900: Sur la définition générale des fonctions analytiques, d'aprés Cauchy. Trans. Am. 
Math. Soc., 1, 14-16. 


? A Jordan curve is a simply closed curve. It looks like a closed loop that does not cross itself. See 
Figure 10.5.2. 
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Figure 10.5.1: Diagram used in proving the Cauchy-Goursat theorem. 


around each rectangle equals the product of the average value of w on each side and its 
length, 


etes [e tee a 
[et Se + ag 469] (H+ nt ai ta] c 
= E = e) Gasa (10.5.1) 


Substituting w = u + iv into Equation 10.5.1, 


T EU (10.5.2) 
1 Əz 1 ay , oO. 


Because the function is analytic, the right side of Equation 10.5.1 and Equation 10.5.2 
equals zero. Thus, the integration around each of these rectangles also equals zero. 

We note next that in integrating around adjoining rectangles we transverse each side 
in opposite directions, the net result being equivalent to integrating around the outer curve 
C. We therefore arrive at the result $o f(z) dz = 0, where f(z) is analytic within and on 
the closed contour. 


ðw Ow (Ou dv l 
dx iðy \ðr yj 


The Cauchy-Goursat theorem has several useful implications. Suppose that we have a 
domain where f(z) is analytic. Within this domain let us evaluate a line integral from point 
A to B along two different contours C4 and C2. Then, the integral around the closed contour 
formed by integrating along C1 and then back along C5, only in the opposite direction, is 


j f(z)dz = f(z)dz — f(z)dz-0 (10.5.3) 
C Ci C3 


f(z)dz = f(z) dz. (10.5.4) 
Ci C2 
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3i (a) (b) 


Figure 10.5.2: Examples of a (a) simply closed curve and (b) not simply closed curve. 


Because Cı and C2 are completely arbitrary, we have the result that if, in a domain, f(z) 
is analytic, the integral between any two points within the domain is path independent. 

One obvious advantage of path independence is the ability to choose the contour so 
that the computations are made easier. This obvious choice immediately leads to 


The principle of deformation of contours: The value of a line integral of an analytic 
function around any simple closed contour remains unchanged if we deform the contour in 


such a manner that we do not pass over a nonanalytic point. 


e Example 10.5.1 
Let us integrate f(z) = z7! around the closed contour C in the counterclockwise 
direction. This contour consists of a square, centered on the origin, with vertices at (1, 1), 
(1, 21), (—1, 1), and (-1, - 1). 

The direct integration of foc z-ldz around the original contour is very cumbersome. 
However, because the integrand is analytic everywhere except at the origin, we may deform 
the origin contour into a circle of radius r, centered on the origin. Then, z = re” and 


dz = rie™d so that A " > 
$ de =| “-d9=i | dO =2ni. (10.5.5) 
cz o Pes 0 


The point here is that no matter how bizarre the contour is, as long as it encircles the origin 
and is a simply closed contour, we can deform it into a circle and we get the same answer 
for the contour integral. This suggests that it is not the shape of the closed contour that 
makes the difference but whether we enclose any singularities (points where f(z) becomes 
undefined) that matters. We shall return to this idea many times in the next few sections. 


Finally, suppose that we have a function f(z) such that f(z) is analytic in some domain. 
Furthermore, let us introduce the analytic function F(z) such that f(z) = F'(z). We would 
like to evaluate ie f(z) dz in terms of F(z). 

We begin by noting that we can represent F, f as F(z) =U +iV and f(z) = u + iv. 
From Example 10.3.28 we have that u = U; and v = Vz. Therefore, 


b b b b 
f (9e [ (ue ioydz +idy) = | Us de — Vady +i | V,dz+U,dy (10.5.6) 


a a 
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b b b b 
al Ur dz +Uydy +i | Vode + Vy dy = | a +i f dV = F(b) — F(a) 
(10.5.7) 


or 
b 
f f(z)dz = F(b) — F(a). (10.5.8) 


Equation 10.5.8 is the complex variable form of the fundamental theorem of calculus. Thus, 
if we can find the antiderivative of a function f(z) that is analytic within a specific region, 
we can evaluate the integral by evaluating the antiderivative at the endpoints for any curves 
within that region. 


e Example 10.5.2 
Let us evaluate jm z sin(2?) dz. 


The integrand f(z) = z sin(z?) is an entire function and its antiderivative equals 
—i cos(z?). Therefore, 


Problems 


For the following integrals, show that they are path independent and determine the value 
of the integral: 


2+377% 27 T 2i 
1. f e7” dz 2. I [e? — cos(z)] dz 3. J sin? (z) dz 4. I (z+ 1) dz 
1 0 0 


10.6 CAUCHY'S INTEGRAL FORMULA 


In the previous section, our examples suggested that the presence of a singularity 
within a contour really determines the value of a closed contour integral. Continuing with 
this idea, let us consider a class of closed contour integrals that explicitly contain a single 
singularity within the contour, namely $5 g(z) dz, where g(z) = f(z)/(z — zo), and f(z) is 
analytic within and on the contour C. We closed the contour in the positive sense where 
the enclosed area lies to your left as you move along the contour. 

We begin by examining a closed contour integral where the closed contour consists of 
the C1, C5, C3, and C4 as shown in Figure 10.6.1. The gap or cut between C2 and C4 is 
very small. Because g(z) is analytic within and on the closed integral, we have that 


JU pp] SJ na CR CE (10.6.1) 


Cı Z T 40 C Z — Z0 C3 Z — £0 Ca Z — 40 


It can be shown that the contribution to the integral from the path C2 going into the 
singularity cancels the contribution from the path C4 going away from the singularity as 
the gap between them vanishes. Because f(z) is analytic at zo, we can approximate its 
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y C1 


Figure 10.6.1: Diagram used to prove Cauchy's integral formula. 


value on C3 by f(z) = f(zo) 4- ó(z), where 6 is a small quantity. Substituting into Equation 


10.6.1, 
a dz = -f() | : dz a 308 dz. (10.6.2) 


Cı Z7 40 3 Z7 £0 3 ~ — Z0 


Consequently, as the gap between C2 and C4 vanishes, the contour C4 becomes the closed 
contour C so that Equation 10.6.2 may be written 


Ki e 2zi f (zo) +i d dð, (10.6.3) 


C Z7 40 0 
where we set z — zo = ce” and dz = ice®dð. 
Let M denote the value of the integral on the right side of Equation 10.6.3 and A equal 
the greatest value of the modulus of ô along the circle. Then 


2n 


27 
IMI <f pds] Ado =2nA. (10.6.4) 
0 0 


As the radius of the circle diminishes to zero, A also diminishes to zero. Therefore, |M], 
which is positive, becomes less than any finite quantity, however small, and M itself equals 
zero. Thus, we have that 


f(z) = = : un dz. (10.6.5) 


This equation is Cauchy’s integral formula. By taking n derivatives of Equation 10.6.5, we 
can extend Cauchy's integral formula* to 


fO (ay) = D fa F gz (10.6.6) 


2ri z — zo)? t1 


4 See Carrier, G. F., M. Krook, and C. E. Pearson, 1966: Functions of a Complex Variable: Theory 
and Technique. McGraw-Hill, pp. 39—40 for the proof. 
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for n — 1,2,3,.... For computing integrals, it is convenient to rewrite Equation 10.6.6 as 
f(z) Jri 
dz = (z0). 10.6.7 


e Example 10.6.1 


Let us find the value of the integral 


j ca ERR o (10.6.8) 
c (z - 1)(z - 2) 
where C is the circle |z| = 5. Using partial fractions, 
! zd i (10.6.9) 
(z-1)(2-2) 2-2 z-1 ae 
and 
¢ cos(7z) quc f cos(7z) re ¢ cos(7z) dz. (10.6.10) 
c (z — 1)(z — 2) c 2—2 c 2-1 
By Cauchy's integral formula with zp = 2 and zo = 1, 
j SESS do ecd team dim (10.6.11) 
C z—2 
and 
f aE esed (10.6.12) 
C27 1 


because zg = 1 and zo = 2 lie inside C and cos(mz) is analytic there. Thus the required 
integral has the value 


. Cos(mz) og agi 
(Sey! Ant. (10.6.13) 


e Example 10.6.2 


Let us use Cauchy’s integral formula to evaluate 


I= f G-196-3) dz. (10.6.14) 


We need to convert Equation 10.6.14 into the form Equation 10.6.7. To do this, we 
rewrite Equation 10.6.14 as 


— — M 
f. (z — 1)?(z — 3) g f (z—1) dz. (10.6.15) 


Therefore, f(z) = e*/(z — 3), n = 1, and z = 1. The function f(z) is analytic within 
the closed contour because the point z — 3 lies outside of the contour. Applying Cauchy's 
integral formula, 


j e? d |. 2ni d( € 
ls-2 (2 — D? (z — 3) PT dziz-3 


m" e” D e? = | mie 
Lern z—3 (z—-3y "o 
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Project: Computing Derivatives of Any Order of a Complex or Real Function 


'The most common technique for computing a derivative is finite differencing. Recently 
Mahajerin and Burgess? showed how Cauchy’s integral formula can be used to compute the 
derivatives of any order of a complex or real function via numerical quadrature. In this 
project you will derive the algorithm, write code implementing it, and finally test it. 


Step 1: Consider the complex function f(z) = u + iv, which is analytic inside the closed 
circular contour C of radius R centered at zg. Using Cauchy’s integral formula, show that 


n! 


fO? (zo) = PET ‘| "fits y) + iv(z, y)|[cos(n@) — isin(né)] dé, 


where x = zo + Rcos(0), and y = yo + Rsin(6). 


Step 2: Using five-point Gaussian quadrature, write code to implement the results from 
Step 1. 


Step 3: Test out this scheme by finding the first, sixth, and eleventh derivative of f(x) = 
Sr/(x? + 4) for x = 2. The exact answers are 0, 2.8125, and 1218.164, respectively. What 
is the maximum value of R? How does the accuracy vary with the number of subdivisions 
used in the numerical integration? Is the algorithm sensitive to the value of R and the 
number of subdivisions? For a fixed number of subdivisions, is there an optimal R? 


Problems 


Use Cauchy’s integral formula to evaluate the following integrals. Assume all of the contours 
are in the positive sense. 


sin? (z) sin? (z) 1 
* NEST " f. (z — 1/6) á E f z(z? + 4) id 


tan(z) j 1 f exp(z?) 
4. j dz 5. — —— ám dz 6. ——ÀÁ—- dz 
z=1 ^4 z—1|-1/2 (z — Dr — 2) "E 


10.7 TAYLOR AND LAURENT EXPANSIONS AND SINGULARITIES 


In the previous section we showed what a crucial role singularities play in complex 
integration. Before we can find the most general way of computing a closed complex integral, 
our understanding of singularities must deepen. For this, we employ power series. 


5 Mahajerin, E., and G. Burgess, 1993: An algorithm for computing derivatives of any order of a complex 
or real function. Computers & Struct., A9, 385-387. 
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One reason why power series are so important is their ability to provide locally a general 
representation of a function even when its arguments are complex. For example, when we 
were introduced to trigonometric functions in high school, it was in the context of a right 
triangle and a real angle. However, when the argument becomes complex, this geometrical 
description disappears and power series provide a formalism for defining the trigonometric 
functions, regardless of the nature of the argument. 

Let us begin our analysis by considering the complex function f(z), which is analytic 
everywhere on the boundary, and the interior of a circle whose center is at z = zp. Then, if 
z denotes any point within the circle, we have from Cauchy’s integral formula that 


_1 FO p_ d FO) 1 
ORT ELE = F TIN EIL: (10.7.1) 


where C denotes the closed contour. Expanding the bracketed term as a geometric series, 
we find that 


f)- 1 n f(Q) dixe aa) f AO dee 4G AY $ a det | 


271 G — Z0 = Zo) C (C 
(10.7.2) 
Applying Cauchy's integral formula to each integral in Equation 10.7.2, we finally obtain 
Z—4Z z — zo)” 
f(z) = f(zo) + (9) frag) Ass 22 As = 20)" F009 (29) ahi. (10.7.3) 


or the familiar formula for a Taylor expansion. Consequently, we can expand any analytic 
function into a Taylor series. Interestingly, the radius of convergence? of this series may be 
shown to be the distance between zp and the nearest nonanalytic point of f(z). 


e Example 10.7.1 
Let us find the expansion of f(z) = sin(z) about the point zp = 0. 
Because f(z) is an entire function, we can construct a Taylor expansion anywhere on 
the complex plane. For zo = 0, 
f(z) = f(0) + EF Oz+ gh O2 + 1f"(0)23 4... (10.7.4) 


Because f(0) = 0, f’(0) = 1, f"(0) 2 0, f” (0) = —1 and so forth, 


f(z) =z | Een (10.7.5) 


Because sin(z) is an entire function, the radius of convergence is |z — 0| < oo, i.e., all z. 


e Example 10.7.2 


Let us find the expansion of f(z) = 1/(1 — z) about the point zg = 0. 
From the formula for a Taylor expansion, 


f(z) = f(0) + 1/(0)z + Af" Oz + 1" (0)23 4 --.. (10.7.6) 


6 A positive number h such that the series diverges for |z — zo| > h but converges absolutely for 
|z — zo| < h. 
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Figure 10.7.1: Contour used in deriving the Laurent expansion. 


Because f? (0) = n!, we find that 
1 
$e =e pa eee (10.7.7) 
-z 
Equation 10.7.7 is the familiar result for a geometric series. Because the only nonanalytic 
point is at z = 1, the radius of convergence is |z — 0| < 1, the unit circle centered at z = 0. 


Consider now the situation where we draw two concentric circles about some arbitrary 
point zo; we denote the outer circle by C while we denote the inner circle by C1. See Figure 
10.7.1. Let us assume that f(z) is analytic inside the annulus between the two circles. 
Outside of this area, the function may or may not be analytic. Within the annulus we pick 
a point z and construct a small circle around it, denoting the circle by C5. As the gap or 
cut in the annulus becomes infinitesimally small, the line integrals that connect the circle 
C2 to Cı and C sum to zero, leaving 


I 2 .T JO FO 
fix- o E (10.7.8) 
Because f(¢) is analytic everywhere within C2, 
Tif (Zz) = FO 
2rif( )- f a (10.7.9) 
Using the relationship: 
j FCO) di = FO dé, (10.7.10) 
e 6-z a £—6 
Equation 10.7.8 becomes 
_ 1 FO) 1 O 
f= J f, ar d¢ + m ra (10.7.11) 
Now, 
a : E : (10.7.12) 


Ç=z 659—248 Ç= zol- (x— 29)/(6 — zo) 


— i | & — «0 ze) : "en e RU . eee 
E" | i =) G) d «(z) + 
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where |z — zo|/|C — zo| < 1 and 


z—( z—zo—(-^z29 z—2zol-—((-— zo)/(z — zo) (10.7.14) 
2 n 
= : k+ (E59) I (=) tt (2) teer, (10.7.15) 
sais Z= 20 zZ — zo Z — 2 


where |¢ — zo|/|z — zo| < 1. Upon substituting these expressions into Equation 10.7.11, 


ro | T FO d¢ 4 A f did 


21i Jo C — zo 2mi — 20) 


NC X $ : Jo dc 4 E 


1 1 
+ pHa + c cg HOG 20) dee 
z — zo 2ri (z — 29)? 2i Jo, 
1 1 
oor = nq ee 10.7.16 
toan f, Orta (10.7.16 
Or 
f(z)- 2 + 2 perp T e deiade tn o cd En 
z—z9 (z—zg) (z — z0)” 
(10.7. 17) 
Equation 10.7.17 is a Laurent expansion.” If f(z) is analytic at zo, then a1 = a» = ++: = 
an = ::: = 0 and the Laurent expansion reduces to a Taylor expansion. If zo is a singu- 


larity of f(z), then the Laurent expansion includes both positive and negative powers. The 
coefficient of the (z — 29)! term, a, is the residue, for reasons that will appear in the next 
section. 

Unlike the Taylor series, a Laurent series provides no straightforward method for ob- 
taining the coefficients. For the remaining portions of this section we illustrate their con- 
struction. These techniques include replacing a function by its appropriate power series, 
the use of geometric series to expand the denominator, and the use of algebraic tricks to 
assist in applying the first two methods. 


e Example 10.7.3 


Laurent expansions provide a formalism for the classification of singularities of a func- 
tion. Isolated singularities fall into three types; they are 


e Essential Singularity: Consider the function f(z) = cos(1/z). Using the expansion for 
cosine, 


1 1 1 1 
cos C) zm 912 + Ali glz dus (10.7.18) 


for 0 « |z| < co. Note that this series never truncates in the inverse powers of z. Essential 
singularities have Laurent expansions, which have an infinite number of inverse powers of 
z — zo. The value of the residue for this essential singularity at z = 0 is zero. 


T Laurent, M., 1843: Extension du théorème de M. Cauchy relatif à la convergence du développement 
d'une fonction suivant les puissances ascendantes de la variable x. C. R. l'Acad. Sci., 17, 938-942. 
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e Removable Singularity: Consider the function f(z) = sin(z)/z. This function has a 
singularity at z = 0. Upon applying the expansion for sine, 


PER Te] Fu | ... (10.7.19) 


sin(z) 1 A ae ME DP WE AE. 
= z af = 1 
z z 


for all z, if the division is permissible. We made f(z) analytic by defining it by Equation 
10.7.19 and, in the process, removed the singularity. The residue for a removable singularity 
always equals zero. 


e Pole of order n: Consider the function 


1 
—————À——. 10.7.20 
f()- G;—G41 40:720) 
This function has two singularities: one at z — 1 and the other at z — —1. We shall only 
consider the case z — 1. After a little algebra, 
1 1 1 1 1 
= = 10.7.21 
o= Gays 24-1) 2e- e-DA ( ) 
1 1 z—1 (z-1? (z-1)3 
= 1 H 7. 
DS 2 1 8 + (10.7.22) 
1 1 1 1 
= Eee (10.7.23) 


2(2—1)» 4(z—1)? 8(z—1) 16 

for 0 < |z — 1| « 2. Because the largest inverse (negative) power is three, the singularity 
at z — lis a third-order pole; the value of the residue is 1/8. Generally, we refer to a 
first-order pole as a simple pole. 


e Example 10.7.4 


Let us find the Laurent expansion for 


f(z)- G-1G-3) (10.7.24) 
about the point z — 1. 
We begin by rewriting f(z) as 
J 1-4 (z— 1) | 1 1-4 (z—1) 
ees (-1[-2-(:-10]  2(z-1)1-i(s- 1) Uic 
zu Io Dade to c2 vd (10.7.26) 
2 (z-1) 2 4 dn 
M ENE UK 3 ! 
Ez scs a (z — 1) 16 (z—1)5—-.- (10.7.27) 


provided 0 « |z — 1| « 2. Therefore we have a simple pole at z — 1 and the value of the 
residue is —1/2. A similar procedure would yield the Laurent expansion about z = 3. 
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e Example 10.7.5 


Let us find the Laurent expansion for 


P RENE ida M >0, n>0 (10.7.28) 
2 5 92 =I yeosh(a) +? eno enm ar 
about the point z = 0. 
We begin by rewriting f(z) as 
ze +27” 1 zZ +z grp 
= = : 10.7.29 
fe) (z— e9)(z—e-9)  2sinh(o) ( z — e? z— e72 ) ( ) 
Because 
— QUARO e^? (14 zet +z? eT? H.) (10.7.30) 
z— e? 1—ze-? zu 
if |z| < e“ and 
1 e? 22 
= = —e* - ze? 4 "p.e 10.7.31 
EIE TERT e? (1+ ze? + z?e?? + ---) (10.7.31) 


if |z| < e79, 


TE) E 


—Qo 


— 2sinh(a) 


if |z| < e~*. Clearly we have an nth-order pole at z = 0. The residue, the coefficient of all 
of the z^! terms in Equation 10.7.32, is found directly and equals 


sinh(no) 


Res[f (2);0] = 'sinh(a) ' 


(10.7.33) 


For complicated complex functions, it is very difficult to determine the nature of the 
singularities by finding the complete Laurent expansion, and we must try another method. 
We shall call it “a poor man's Laurent expansion." The idea behind this method is the 
fact that we generally need only the first few terms of the Laurent expansion to discover 
its nature. Consequently, we compute these terms through the application of power series 
where we retain only the leading terms. Consider the following example. 


e Example 10.7.6 


Let us discover the nature of the singularity at z — 0 of the function 


et 


f(z) = (10.7.34) 


zsinh(az)’ 


where a and ¢ are real. 
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We begin by replacing the exponential and hyperbolic sine by their Taylor expansion 
about z = 0. Then 
1+ iz-H022/24... 


f(2) = z(az + a323/6 +--+) 


(10.7.35) 
Factoring out az in the denominator, 


l-ctzT022/24... 
= : 10.7. 


Within the parentheses all of the terms except the leading one are small. Therefore, by long 
division, we formally have that 


1 


f(z) 2 —g (0 + tz c 8227/2 (1 — a?22/6 +-->) (10.7.37) 
az 
i 1 t 3-a? 
= — (1 + tz + t?2?/2—a?2? e. (10.7. 
"I z t?27/2—a?z7/64+---)= ae ae t (10.7.38) 


Thus, we have a second-order pole at z = 0 and the residue equals t/a. 
Problems 
1. Find the Taylor expansion of f(z) = (1 — z)~? about the point z = 0. 


2. Find the Taylor expansion of f(z) = (z — 1)e? about the point z = 1. (Hint: Don’t find 
the expansion by taking derivatives.) 


By constructing a Laurent expansion, describe the type of singularity and give the residue 
at zo for each of the following functions: 


3. f(z) = ze 7; zy =0 4. f(z) =z 3sin2(z); 2 =0 
5. f(z) = eh) - 1. x =0 6. f(z) = Gom 29-2 
7. f(z) = — ape 8. f(z) = d 2 = bi 
9. f(z) = mE zy =2 10. f(z) = pe), x =0 


10.8 THEORY OF RESIDUES 


Having shown that around any singularity we may construct a Laurent expansion, 
we now use this result in the integration of closed complex integrals. Consider a closed 
contour in which the function f(z) has a number of isolated singularities. As we did in the 
case of Cauchy’s integral formula, we introduce a new contour C" that excludes all of the 
singularities because they are isolated. See Figure 10.8.1. Therefore, 


fro) dz — NI z)dz —- je DLE fle): (10.8.1) 
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Figure 10.8.1: Contour used in deriving the residue theorem. 


Consider now the mth integral, where 1 < m < n. Constructing a Laurent expansion for 
the function f(z) at the isolated singularity z = Zm, this integral equals 


oo 1 oo 
z)dz — ak — dz bk Z — Zm)" dz. .8. 
fro 3 ZH CEPS D TA ) (10.8.2) 


Because (z — zm)“ is an entire function if k > 0, the integrals equal zero for each term in 
the second summation. We use Cauchy's integral formula to evaluate the remaining terms. 
The analytic function in the numerator is 1. Because d^-!(1)/dz^-! = 0 if k > 1, all of 
the terms vanish except for k = 1. In that case, the integral equals 27ia1, where a, is the 
value of the residue for that particular singularity. Applying this approach to each of the 
singularities, we obtain 


Cauchy’s residue theorem:? If f(z) is analytic inside and on a closed contour C (taken 


in the positive sense) except at points z1,23,..., za where f(z) has singularities, then 
TL 
f(z)dz =2ni X Res(f(2); z;], (10.8.3) 
C zm 
j=l 


where Res[f(z);z;|] denotes the residue of the jth isolated singularity of f(z) located at 
Z = BG. 


e Example 10.8.1 


Let us compute fiz=2 z?/(z + 1) dz by the residue theorem, assuming that we take the 
contour in the positive sense. 

Because the contour is a circle of radius 2, centered on the origin, the singularity at 
z = —] lies within the contour. If the singularity were not inside the contour, then the 


8 See Mitrinovié, D. S., and J. D. Keékié, 1984: The Cauchy Method of Residues: Theory and Ap- 
plications. D. Reidel Publishing, 361 pp. Section 10.3 gives the historical development of the residue 
theorem. 
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integrand would have been analytic inside and on the contour C. In this case, the answer 
would then be zero by the Cauchy-Goursat theorem. 

Returning to the original problem, we construct the Laurent expansion for the integrand 
around the point z — 1 by noting that 


a [(z+1) — 1]? 1 


= = 2 + 1). 10.8.4 
z+1 z+1 z+1 T ) ( ) 
The singularity at z = —1 is a simple pole and by inspection the value of the residue equals 
1. Therefore, 
z2 
j dz = 2ri. (10.8.5) 
ld-2 2+1 


As it presently stands, it would appear that we must always construct a Laurent expan- 
sion for each singularity if we wish to use the residue theorem. This becomes increasingly 
difficult as the structure of the integrand becomes more complicated. In the following 
paragraphs we show several techniques that avoid this problem in practice. 

We begin by noting that many functions which we will encounter consist of the ratio of 
two polynomials, i.e., rational functions: f(z) = g(z)/h(z). Generally, we can write h(z) as 
(z— 21)" (z — 22)" ---. Here we assumed that we divided out any common factors between 
g(z) and h(z) so that g(z) does not vanish at 21, zo, .... Clearly 2, z2,..., are singularities 
of f(z). Further analysis shows that the nature of the singularities are a pole of order m4 
at z = z1, a pole of order m» at z = z2, and so forth. 

Having found the nature and location of the singularity, we compute the residue as 
follows. Suppose that we have a pole of order n. Then we know that its Laurent expansion 
is 

a E 
iasg Lm "a LE eed bs bios ad Spes es (10.8.6) 


Multiplying both sides of Equation 10.8.6 by (z — 20)”, 


F(z) = (z— zo)” f(z) = an + Qn_1(z— 2o) +: -+bo(z — zo)" --bq(z — z0)" tt +--+. (10.8.7) 


Because F(z) is analytic at z = zo, it has the Taylor expansion 


F(z) = F(zo) + F'(zo)(z — zo) - + -— zo) 5 peris (10.8.8) 


Matching powers of z — zp in Equation 10.8.7 and Equation 10.8.8, the residue equals 


(10.8.9) 


Substituting in F(z) = (z — zo)" f(z), we can compute the residue of a pole of order n by 


dn-1 


li 
(n— Di 22; dzn-1 


Res[f(z); 2] = le sre) (10.8.10) 


Complex Variables 475 


For a simple pole, Equation 10.8.10 simplifies to 


Res[f(z);z;] = lim (z — z;)f (2). (10.8.11) 


Z> Zj 


Quite often, f(z) = p(z)/q(z). From l'Hóspital's rule, it follows that Equation 10.8.11 
becomes 


Res[f(z); z;] = plzz) l (10.8.12) 


Recall that these formulas work only for finite-order poles. For an essential singularity we 
must compute the residue from its Laurent expansion; however, essential singularities are 
very rare in applications. 


e Example 10.8.2 


Let us evaluate 


et 
co zt+a 
where C is any contour that includes both poles at z = +ai and is in the positive sense. 
From Cauchy’s residue theorem, 
ei? ei? ei? 
$ -z 3 d = 2mi Res| -3zi | +Res| — —5;-—ai ||. (10.8.14) 
c2 +a z? +a? 22+ a? 
The singularities at z = tai are simple poles. The corresponding residues are 
ei? ei? e 
Res| ———z;ai| = li j = 10.8.15 
(3 +a? 2 d a) (z—ai)(z+at) 2ia ( ) 
and 
e e” ea 
Res| 5——z;;-ai|-— li F ai = : 10.8.16 
(= +a?’ 2 UBL oH) (z — at)(z + ai) 2ia ( ) 
Consequently, 
ei? 2m m 27...) 
f, EET dz = 9s (e*—e*)- EDS sinh(a). (10.8.17) 
e Example 10.8.3 
Let us evaluate 
: a d 10.8.18 
xij aaa 5 (TOSS) 


where C includes all of the singularities and is in the positive sense. 
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The integrand has a second-order pole at z = 0 and two simple poles at z = —1 +1, 
which are the roots of z? + 2z + 2 = 0. Therefore, the residue at z = 0 is 
et 1 d et? 
———————;0 = li 0)? 10.8.19 
Res| em 220 nie ) E ( ) 
tet? (2z + 2)e'* t-1 
= li = j 10.8.20 
tin, | aS (22 +22 4 2)? 2 ( ) 
The residue at z = —1 + iis 
et e 
3 s-l+i}= 1 Fi 10.8.21 
Hes È (22 + 2242)’ i QE Ce la z?(z? + 22 4 2) ( ) 
z+1-1 
2 (im E =) ( lim 575713 5) (10.8.22) 
exp[(—1 + i)t] _ exp[(-1- 0t] 
= = 10.8.23 
2i(—1 + i)? 4 ( ) 
Similarly, the residue at z = —1— i is 
e et? 
;—l1—iļ = li 1-i 10.8.24 
Res | (72 3-22 3-2)! ] ue M E) ( ) 
NE ee : z+1+i 
5 (im z) ( im 23:12 08:28) 
exp[(-1- i) _ exp[(-1- dt] 
= = . 10.8.26 
(-2i)(—1 — i? 4 ( ) 


Then by the residue theorem, 


1 et? et tz 
dz —R ;0| +R cay 
xij. xA n Scr zer | Je ] 


z?(z? +224 3) 


t—1  expl(-1-i)] , exp[(—1—1)t] 
= 1 i 1 (10.8.28) 
= 4 [t — 1 + e^ *cos(t)]. (10.8.29) 


Problems 


Assuming that all of the following closed contours are in the positive sense, use the residue 
theorem to evaluate the following integrals: 


z+1 (z 4-4)? f 1 
l —— d 2. —— d 3. — — d 
f P — 223 * f z4 + 523 + 622 i |z|21 1 — e7 7 


2. 4 3 
4. j id zdz 5. j rn dz 6. j ze dz, n>0 
lz-2 (z — 1) ls22 27 1 [gest 
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Figure 10.9.1: Contour used in evaluating the integral, Equation 10.9.1. 


2+ 4cos(1z) 
7. j e'/? cos(1/z) dz 8. j — ~ dz 
iva qa) lgsa 2(2—1)? 


10.9 EVALUATION OF REAL DEFINITE INTEGRALS 


One of the important applications of the theory of residues consists in the evaluation of 
certain types of real definite integrals. Similar techniques apply when the integrand contains 
a sine or cosine. See Section 11.4. 


e Example 10.9.1 


Let us evaluate the integral 


? d 1[? d 
n Z = ] -h (10.9.1) 
o etl 2jJ_-əæœ%“ +1 
This integration occurs along the real axis. In terms of complex variables, we can rewrite 
Equation 10.9.1 as 
Wt d 1 d 
i Z = f =a (10.9.2) 


where the contour C is the line S(z) = 0. However, the use of the residue theorem requires 
an integration along a closed contour. Let us choose the one pictured in Figure 10.9.1. 'Then 


¢ dz = dz Ji dz (10.9.3) 
cz2+1 oa z2 +1 c, 2 +1’ Sa 


where C denotes the complete closed contour and C5 denotes the integration path along 
a semicircle at infinity. Clearly we want the second integral on the right side of Equation 
10.9.3 to vanish; otherwise, our choice of the contour C5 is poor. Because z = Re® and 


dz = iRe?! dé, 
dz 7 iRexp(6i) 
| A = J 1 L D2.--(905 DE np D2 4 0, (10.9.4) 
c, 25-1 o 1+ R? exp(201) mor" 


which tends to zero as R — oo. On the other hand, the residue theorem gives 


d 1 —i 1 
eod = 2ri Res -2milim A ——. = mi x — = m. (10.9.5) 
owt! 24+ ZFT zi z? +1 2i 
Therefore, 
? d 
n LT (10.9.6) 
0 ue + 1 2 


Note that we only evaluated the residue in the upper half-plane because it is the only one 
inside the contour. 
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This example illustrates the basic concepts of evaluating definite integrals by the residue 
theorem. We introduce a closed contour that includes the real axis and an additional 
contour. We must then evaluate the integral along this additional contour as well as the 
closed contour integral. If we properly choose our closed contour, this additional integral 
vanishes. For certain classes of general integrals, we shall now show that this additional 
contour is a circular arc at infinity. 


Theorem: /f, on a circular arc Cg with a radius R and center at the origin, zf (z) > 0 
uniformly with |z| € Cr and as R — œ, then 


Jm A f(z) dz =0. (10.9.7) 


The proof is as follows: If |zf(z)| < Mg, then |f(z)| € Mg/R. Because the length of 
Cg is aR, where a is the subtended angle, 


(z) dz 


M 
< — aR = oMg 0, (10.9.8) 
Cr R 


because Mg — 0 as R > oo. 


e Example 10.9.2 


A simple illustration of this theorem is the integral 


d L > id (10.9.9) 
cup ST 1. Jas ERI UO 


A quick check shows that z/(z? + z +1) tends to zero uniformly as R — oo. Therefore, if 
we use the contour pictured in Figure 10.9.1, 


E dx dz 1 " 
5 = = 277 Res $05 
-æT Ta c2 +z+1 z2? +z+1 


2mi li : zl (10.9.11) 
= LT im = ——. J. 
23-44 XS; N2z +1 V3 


+ 

S 
>. 

h 

T 
> 
e 
HM 
= 


e Example 10.9.3 


Let us evaluate 


? dx 
/ saat (10.9.12) 
0 
In place of an infinite semicircle in the upper half-plane, consider the following integral 
dz 
f, si (10.9.13) 


where we show the closed contour in Figure 10.9.2. We chose this contour for two reasons. 
First, we only have to evaluate one residue rather than the three enclosed in a traditional 
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Figure 10.9.2: Contour used in evaluating the integral, Equation 10.9.13. 


upper half-plane contour. Second, the contour integral along C3 simplifies to a particularly 
simple and useful form. 


Because the only enclosed singularity lies at z = e7*/°, 
d 1 . — pTi/6 
—— _ = Jri Res ;en/6| =2mi lim E (10.9.14) 
c 2841 z9 4-1 zoeti/6 2641] 
1 pu 
=2ri lim c. = = (10.9.15) 


Let us now evaluate Equation 10.9.12 along each of the legs of the contour: 


d oo 
f — =i Hm (10.9.16) 
c, 2 +1 o X91 
/ M E =0, (10.9.17) 
Co z + 1 
because of Equation 10.9.7 and 
d ° eri/3 q TOLAE 
i xe 8j ee gg ed (10.9.18) 
e, 21 as ness o «ec? +1 
since z = re™/3, 
Substituting into Equation 10.9.15, 
(1 = ent) i E e ae (10.9.19) 
v 44d 3 = 
or 
i? dr mi 2ie7i/6 u T OT (10.9.20) 
o z6+1 6 eri/6 (eri/6 — e=7i/6)  6sin(r/6) 3 UA 
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4 


y 

(—R,2ni) C3 (R,2ni) 
C; LIE C2 

(-R,0) C; (RO) ^ 


Figure 10.9.3: Rectangular closed contour used to obtain Equation 10.9.31. 
e Example 10.9.4 


Rectangular closed contours are best for the evaluation of integrals that involve hyper- 
bolic sines and cosines. To illustrate? this, let us evaluate the integral 


78 sin(ax) sinh(x) |. _ T sin(ax) sinh(x) 4 ff sinh(a)e!* 
0 


[b--cosh(z)]? ^ — J 4 [b+ cosh(2)? Los b+ cosh(z) ^ |" 


(10.9.21) 
where a > 0 and b > 1. 
We begin by determining the value of 


j sinh(z)e?** 

= dz 

c [b+ cosh(z)]? 

about the closed contour shown in Figure 10.9.3. Writing this contour integral in terms of 
the four line segments that constitute the closed contour, we have 


sinh(z)e/** sinh(z)e/** f sinh(z)e?** 
pecans i sa, liana ai MM ieee od sc, 
f. [b + cosh(z)] ^ [. [b + cosh(z)]? Rd c, [b+ cosh(z) ^^ 


sinh(z)e/4* sinh(z)e/4* 
— d ————— — dz. 10.9.22 
T A bro o [. b-Ecosh(z) ^ ( ) 


Because the integrand behaves as e7? as R — oo, the integrals along C5 and Cy vanish. 
On the other hand, 


sinh(z)e’** i sinh(x)e’*” 
———————— = ——— ————cs d 10.9.23 
J, [b + cosh(z)]? gs _oo [b+ cosh(x)]? K ( ) 
i mh(z)e' h(z)e' 
sinh(z)e'^* ?? sinh(z)e'** 
dz =—e ?re d 10.9.24 
i [b + cosh(z)]? T E J. [b + cosh(x)]? is ( ) 


because cosh(x + 277) = cosh(x) and sinh(x + 2ri) = sinh(x). 


9 This is a slight variation on a problem solved by Spyrou, K. J., B. Cotton, and B. Gurd, 2002: 
Analytical expressions of capsize boundary for a ship with roll bias in beam waves. J. Ship Res., 46, 
167-174. 
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Within the closed contour C, we have a single singularity where b + cosh(z,) = 0 or 
e* = —b— Vb? — 1 or zs = In(b+ Vb? — 1) + mi. To discover the nature of this singularity, 
we expand b + cosh(z) in a Taylor expansion and find that 


b + cosh(z) = sinh(z,)(z — zs) + cosh(z,)(z — z;)? +--+. (10.9.25) 


Therefore, we have a second-order pole at z = z,. Therefore, the value of the residue there 
is 


inh iaz inh iaz 
Rel d = lim | — EH (10.9.26) 
[b + cosh(z)] z325 dz | sinh" (z,) + sinh(z,) cosh(z,)(z — z,) +-+- 
u iae 7? : 2i 
E A exp[iacosh ^ (b)]. (10.9.27) 
Therefore, 
l. sinh(x)e’*” 2zaexp|-7a 4-aicosh '(b)] ma explaicosh™+(b)] 
£ = = 
—oo [b + cosh(x)]? (1 — e-?72) sinh(zs) vb? — 1 sinh(za) 
(10.9.28) 
because 
1 1 
sinh(z,) — b b? — 14 = b2 — 1. 10.9.29 
(o) = 3-0- VR LÁ (10.9.29) 
Substituting Equation 10.9.28 into Equation 10.9.21 yields 
l sin(ax) zur dy a 7’ sin[a cosh (b) (10.9.30) 
o [b+ cosh(z)] 20? — 1 sinh(za) 


e Example 10.9.5 


The method of residues is also useful in the evaluation of definite integrals of the form 


Ld Fsin(0), cos(0)] dð, where F is a quotient of polynomials in sin(0) and cos(0). For 
example, let us evaluate the integral!? 


2n 3 
I =| tio dó,  a»l. (10.9.31) 
0 =y 


We begin by introducing the complex variable z = e/?. This substitution yields the 


closed contour integral 
1 (z? + 1)? dz 
I= 10.9.32 
2i f, (22 + 1)? — 4a222 22’ ( ) 


where C is a circle of radius 1 taken in the positive sense. The integrand of Equation 10.9.32 
has five singularities: a second-order pole at z5 = 0 and simple poles located at 


zy = —a— va? — 1, 29 = —a + V a? — 1, (10.9.33) 


10 Simplified version of an integral presented by Jiang, Q. F., and R. B. Smith, 2000: V-waves, bow 
shocks, and wakes in supercritical hydrostatic flow. J. Fluid Mech., 406, 27-53. 
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23 —a— vVa?—1, and z4—a-wya?-]1. (10.9.34) 


Only the singularities z2, z3, and z5 lie within C. Consequently, the value of I equals 27; 
times the sum of the residues at these three singularities. The residues equal 


(22 SH Ly 5 
; H 1 
Red z?l(z? + 1)? — 4a?z?]' * " 
2 1 3 = 2—1 
AE m oco a (10.9.35) 
zca yai, 2 pa-aby gis + 1)? — 4a?2? 
: (2? + 1)? 
= l 10.9.3 
NE ert Az?(z? + 1— 2a?) ( 6) 
2-4 2_71)8 
Sa C Rule fa E ; (10.9.37) 
(2a? — 1 — 2aya? — 1 )(a? — 1 — ava? — 1) 
(2? zn i)? 
R : 2—1 
sl z?|(z2 + 1)? — 4a?z?]’ * s 
2 1 3 z 2. .1 
o dun 26 9 Dio up. a e (10.9.38) 
zca-Val-i 2?  sz3a—Vai-i (2? + 1)? — 4a? 2? 
2 1 3 
= lim C TU (10.9.39) 
zma- va? 429 (22 + 1 — 2a?) 
2(5.. 2—1) 
= N ED) , (10.9.40) 
(2a? — 1 — 2av a? — 1 )(a? — 1 — ava? — 1) 
and 
(r^ + 1)? 
uo { z?|(z? +1)? — 4a?z?]’ ? 
d (22 + 1) 
= li 10.9.41 
220 dz la +1)? — 4a?2? ) 
2 1.14 — 4a9222(22 1? 4 24 4)3 244 242 
vh 6z|(z? + 1) a*z*(z* + 1)*] — 4z(z FE a?) (10.9.42) 
z—0 [(z2 +4 1)? - 4a?z?]? 
zd (10.9.43) 
Summing the residues, we obtain 0. Therefore, 
2T 3 
n WE. dei ge, (10.9.44) 
o cos?(0)— a? 


Problems 


Use the residue theorem to verify the following integrals: 


Lf dx _ 7v2 bu dx T 
0 


21-1 4 oo (23244-2452 2 
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jos (z2-1)(z242z-2) 5 Eu 7-34 
. 0 (a? +1)? = 4 . o (zx? + 1) (£2 + 4)? = 288 
T. 
i me - i b>0 
oo @ FAR? Wary 7 
8. 
k e T a—h 
dt — 1 h 
J (t? + 1) [t2 (a/h +1) + (a/h — 1)] i | ea A 
9. 
a d = e 
= à a 
o  G-sin*(0) 2/a+a? 
10. " 
. 0 
Jj : :42 = : E b>a>0 
o  a?cos?(0)--0?sin^(0) ^ 2ab 
11. : 
T B 0 
/ sin“ (0) d= 7, (a "E p), ck 
o a+ bcos(0) b2 
12. 5 , 
eem ps 
d9 = 2r z 1 =0,1,2,... 
J 1 + 2rcos(0) + r? Tp exque »|r, 20,12, 
13. ] 
7 2x (2n)! 
: 2n 
0) dd = ——_— 
I x (2^1)? 
14. 
T = Va n 
/ SOOO) vane o SOE VOCUS e pu RES 
—4 cos(#) +a PUES 
Hint: 
2/o? —1 = 1 1 
Peel Copqqke WWE mda 
15. 2 (n) 
cos(n z 
dé = reor 0 0 
f cosh(a) — cos(0) sinha) , a#0, n2 


Hint: See Example 10.7.5. 


16. Show that 


| i x? T 
dz = ; 
o (l-r?) +a? 2|a| 
where a is real and not equal to zero. Hint: Show that the poles of 


z2 


(1 _ 22)? + a2z2 


f(z) = 
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are simple and equal 


4 (+V4—a? + |ali), if 0«]|a| <2, 


ui $ (lal a? — 4), if 2 < lal. 
If |a| = 2, we have second-order poles at zn = +i. 
17. Show that cates B 
J, cosh? (bæ) 7 —3]9smhasjQoy > 


Hint: Evaluate the closed contour integral 


elt 
j amus de 
c cosh“ (bz) 


where C is a rectangular contour with vertices at (oo, 0), (—00,0), (oo, 7/b), and (—oo, m/b). 


18. Show!! that 


a da: _ J 2a/sinh(a), if a#0, 
o cosh(x) cosh(z +a) - 2, if a=2. 


Hint: Evaluate the closed contour integral 


z 
c cosh(z)cosh(z--a) '' 
where C is a rectangular contour with vertices at (oo, 0), (—00,0), (00,7), and (—o0, 7). 


19. During an electromagnetic calculation, Strutt!? needed to prove that 
8 8 p 


per 9g x (ni T i |z| < m. 
Verify his proof by doing the following: 
Step 1: Using the residue theorem, show that 
1 sinh(rz) dz sinh(oz) N (—1)* sin [(n + §) x] 
2ni Soy T cosh(rz) z-o T cosh(or) £x c —i(nd i) i 
where Cy is a circular contour that includes the poles z = ø and z, = i (n + i), n 


0,1,2,...,N. 


11 See Yan, J. R., X. H. Yan, J. Q. You, and J. X. Zhong, 1993: On the interaction between two 
nonpropagating hydrodynamic solitons. Phys. Fluids A, 5, 1651-1656. 


12 Strutt, M. J. O., 1934: Berechnung des hochfrequenten Feldes einer Kreiszylinderspule in einer konzen- 
trischen leitenden Schirmhülle mit ebenen Deckeln. Hochfrequenztechn. Elecktroak., 43, 121—123. 


Complex Variables 485 


Step 2: Show that in the limit of N — oo, the contour integral vanishes. Hint: Examine 
the behavior of z sinh(zz)/[(z — e) cosh(»2z)] as |z| — oo. Use Equation 10.9.7 where Cr is 
the circular contour. 


Step 3: Break the infinite series in Step 1 into two parts and simplify. 


In the chapter on Fourier series, we shall show how we can obtain the same series by 
direct integration. 


10.10 CAUCHY'S PRINCIPAL VALUE INTEGRAL 


The conventional definition of the integral of a function f(x) of the real variable x over 
a finite interval a € x € b assumes that f(x) has a definite finite value at each point within 
the interval. We shall now extend this definition to cover cases when f(x) is infinite at a 
finite number of points within the interval. 

Consider the case when there is only one point c at which f(x) becomes infinite. If c 
is not an endpoint of the interval, we take two small positive numbers e and 7 and examine 
the expression 


Me Nes que (10.10.1) 


ccn 
If Equation 10.10.1 exists and tends to a unique limit as e and 7 tend to zero independently, 
we say that the improper integral of f(x) over the interval exists, its value being defined by 


b c—e b 
f f(x)dx = lim f f(a) dx + n I: f (x) da. (10.10.2) 


If, however, the expression does not tend to a limit as € and 7 tend to zero independently, 
it may still happen that 


lim ti f(x)dx + i f(x) a} (10.10.3) 


e€—0 mI 


exists. When this is the case, we call this limit the Cauchy principal value of the improper 
integral and denote it by 


PV A f (x) da. (10.10.4) 


Finally, if f(x) becomes infinite at an endpoint, say a, of the range of integration, we say 
that f(x) is integrable over a € x < b if 


b 
lim ee f (x) dx (10.10.5) 


e30* 


exists. 
e Example 10.10.1 


Consider the integral jen dx/x. This integral does not exist in the ordinary sense 
because of the strong singularity at the origin. However, the integral would exist if 
© dx ? dx 


lim — + lim — (10.10.6) 
«70 J_, X 020Js £ 
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Figure 10.10.1: Contour C used in Example 10.10.2. 


existed and had a unique value as € and 6 independently approach zero. Because this limit 
equals 
lim [In(e) + In(2) — In(6)] = lim [In(2) — In(d/e)], (10.10.7) 
€,6—0 €,0—0 
our integral would have the value of In(2) if 6 = e. This particular limit is the Cauchy 
principal value of the improper integral, which we express as 


pv f a = In(2). (10.10.8) 


We can extend these ideas to complex integrals used to determine the value or prin- 
cipal value of an improper integral by Cauchy’s residue theorem when the integrand has a 
singularity on the contour of integration. We avoid this difficulty by deleting from the area 
within the contour that portion which also lies within a small circle |z — c| = € and then 
integrate around the boundary of the remaining region. This process is called indenting the 
contour. 

The integral around the indented contour is calculated by the theorem of residues 
and then the radius of each indentation is made to tend to zero. This process gives the 
Cauchy principal value of the improper integral. T'he details of this method are shown in 
the following examples. 


e Example 10.10.2 


Let us show that 


py [ CE d (10.10.9) 
Lo, A? — x? a 
Consider the integral l 
$ ck (10.10.10) 
C af — z 


where the closed contour C consists of the real axis from —R to R and a semicircle in the 
upper half of the z-plane where this segment is its diameter. See Figure 10.10.1. Because 
the integrand has poles at z = +a, which lie on this contour, we modify C by making an 
indentation of radius e at a and another of radius 7 at —a. The integrand is now analytic 
within and on C and Equation 10.10.10 equals zero by the Cauchy-Goursat theorem. 
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Evaluating each part of the integral, Equation 10.10.10, we have that 
T jiR cos(0)— Rsin(0) iz iz 
€ E € € 
li -B = HB iRe" d0 + Ja lud dz + f. 2z dz 


—a-1) cit a—e eit R eit 
«f ait | da + I 5 dr —0, (10.10.11) 
x a 


-R ate dg d^ cq — x 


where C and C5 denote the integrals around the indentations at a and —a, respectively. 
The modulus of the first term on the left side of Equation 10.10.11 is less than tR/(R?—a?) 
so that this term tends to zero as R — oo. To evaluate C1, we observe that z = a + ce” 
along Ci, where 0 decreases from 7 to 0. Hence, 


gt 9 0i cic? 
> dz = li j jee”) —— d0 10.10.12 
J, gz 0 : exp (ia + iee’) —9aec8i — e2e20i d ( ) 
T i i miet! 
=li exp (ia + iee” - dó = . 10.10.13 
en 0 xP (ia rer ) 2a + celi 2a ( ) 
Similarly, 
e” jp ia 
f pe = -_, (10.10.14) 
Cy a^ — % 2a 


as 7 tends to zero. 
Upon letting R — oo, e > 0, and 7 — 0, we find that 


ee Ti ia ia 7 Sin(a) 
py [ Gud -—-» (e^ —e t^) = LI 5 (10.10.15) 
Finally, equating the real and imaginary parts, we obtain 
py f SD pp py [ EOS das (10.10.16) 
Loo Q9 — x? a Los Q2 — x? 
e Example 10.10.3 
Let us show that "D 
J c P (10.10.17) 
MEE 
Consider the integral 
j c du (10.10.18) 
C zZ 


where the closed contour C consists of the real axis from —R to R and a semicircle in the 
upper half of the z-plane where this segment is its diameter. Because the integrand has a 
pole at z — 0, which lies on the contour, we modify C by making an indentation of radius 
€ at z = 0. See Figure 10.10.2. Because e**/z is analytic along C, 


T B ee =ë eit ei R ei 
n aa dicm ido. f cds. f dis f — dz =0. (10.10.19) 
0 -R T e, * e T 
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4 


Figure 10.10.2: Contour C used in Example 10.10.3. 


Since e- Fsin(€) < eF? for 0 — 9 — m, 


T iRcos(9)—Rsin(6); " _ Re 1—e 7 
e id0| < e dà = —————, (10.10.20) 
0 0 


which tends to zero as R — oo. Therefore, 


=E ei oo e e^ 
/ — da + f dx = | dz. (10.10.21) 
eo x € x e zZ 


[Ze] Ej dz f $4 ri (10.10.22) 
€, 4 Có c, 2 


in the limit € > 0 because z = cet. Consequently, in the limit of € > 0, 


Now, 


py [ t de =n. (10.10.23) 
"n x 


Upon separating the real and imaginary parts, we obtain 


py f cos(x) dz = 0, " sin(z) dx = m. (10.10.24) 
o5 x —o0o0 T 
Problems 


1. Noting that 


0—e 
[ e dy MEN [1(8 4 ¢)] 
— n 
o cos(y) —cos(#)  sin(0) |sin|l(0— ¢)] 0 
and T 
T dp _ 1, |sn[3@+9)] 
e. cos(y) —cos(0) —sin(@) | sin [$(0 — y)] are 
show that 


P SS SSS ee s 
vf CORO 0, 0<0<T 
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2. Using f(z) = e/77/2/(2? — 1), show that 


iz cos(7a/2) d 


z2—1 


v= TT. 
—oo 


3. Show that 


co pax _ pbx 
[FEE t= sento) cotn), aene 


— oo 


Use a rectangular contour with vertices at (—R,0), (R,0), (—R,7), and (R,«) with a 
semicircle indentation at the origin. 


4. Showt? that 


? 1-eosDae(r--Q) , — T in 20 
Eo isst apart Oe aer) ee ea) 


e 2ae [(¢? — a”) cos(2aC) + 2a¢ sin(2ac)] h 


where a, a, and Ç are real. Use a semicircular contour of infinite radius with the real axis 
as its diameter. 


5. Using the complex function e’”*/(z — a) and a closed contour similar to that shown in 
Figure 10.10.2, show that 


rv f” coama) ————— dr = —m sin(ma), and py f” anne) —— dx = 7 cos(ma), 


eG L-a 
where m > 0 and a is real. 


6. Using a closed contour similar to that shown in Figure 10.10.2, except that we now have 
two small semicircles around the singularities on the real axis, show that 


= une i ge ni 
PV —— dr = -ri P dz = 3mi mi 
/ Sog Tt, and vf oes z à (6 gps 


— oo 


where m > 0. 


7. Redo Example 10.10.3 except the contour is now a rectangle with vertices at +R and 
+R + Ri indented at the origin. 


8. Let us show! that 
da lo| 2-1 


1 at 
Gla) = Pv | = o oe a|vo2 —1' , 
arra ca {| MET Jubet 


13 Ko, S. H., and A. H. Nuttall, 1991: Analytical evaluation of flush-mounted hydrophone array response 
to the Corcos turbulent wall pressure spectrum. J. Acoust. Soc. Am., 90, 579-588. 


14 Ott, E., T. M. Antonsen, and R. V. Lovelace, 1977: Theory of foil-less diode generation of intense 
relativistic electron beams. Phys. Fluids, 20, 1180-1184. 
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Step 1: Using the transformation 2iz = z — z^, show that 


G(a) = Pv $ as 


lei 2° + 2iag — 1? 


which has singularities at z = +V1— a? + ai. 


Step 2: To evaluate G(a) given in Step 1, the principal value can be evaluated using 


dz dz 
Gla) = eae ee ——— 
ites Z? + 2097 — 1 [sige E + 2iaz — 1 


where e — 0*. Use the residue theorem and evaluate these contour integrals, yielding the 
desired result. 


9. Let the function f(z) possess a simple pole with a residue Res[ f(z); c] on a simply closed 
contour C. If C is indented at c, show that the integral of f(z) around the indentation 
tends to —Res|f(z);c]ai as the radius of the indentation tends to zero, a being the internal 
angle between the two parts of C meeting at c. 


10.11 CONFORMAL MAPPING 


Conformal mapping is a powerful technique for finding solutions, or for simplifying 
the process of finding solutions, to Laplace’s differential equation in two dimensions. This 
method involves introducing two complex variables: z = x + iy and T = p + io. These 
two complex variables are related to each other via the mapping z = f(r). Under this 
mapping the Argand diagram for the z-variable is mapped into one for the r-variable. In 
certain cases, for example T = yZ, the complex z-plane may only map into a portion of the 
T-plane. In other cases, say T = z + 3i, the complete z-plane would be mapped into the 
complete 7-plane. 

Once we map the original domain into a simpler geometry (a half-plane, circle or 
square), how do we find the solution? There are several techniques available. One method, 
for example, recalls that the real and imaginary part of an analytic function satisfies La- 
place's equation. Therefore, if we could construct an analytic function whose real or imagi- 
nary part satisfies the boundary conditions in the new domain, we would have the solution 
in the 7-plane. Then we could use the transformation to obtain the solution in the original 
z-plane. 

What types of functions f(z) are useful? Consider an arbitrary point zo in the complex 
z-plane. Assuming that f’(zo) Z 0, a straightforward transformation yields 


u u j 2 (0v wv 
CI E TOM 


where u(x, y) and v(p, c) are solutions to Laplace's equation in the z and 7 planes, respec- 
tively. Thus, f(z) must be analytic. 
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Figure 10.11.1: The conformal mapping used to find the fields of a semi-infinite ring head with a finite gap 
of width a. The potential on the right pole face equals 1 while the potential of the left pole face equals —1. 
In the z-plane the point A is located at (0,00) while point B is located at (0, —oo). Because of symmetry 
the potential along the center of the gap AB equals 0. 


e Example 10.11.1 


In their study of magnetic recording, Curland and Judy!? modeled the ring heads as 
two semi-infinite regions located below the z-axis and running to the right of x = a/2 and 
to the left of x = —a/2. See Figure 10.11.1. 

From symmetry we need only consider the half-space x > 0. Consequently, the new 
boundary consists of the four line segments: AB, BC, CD and DE. If we require that the 
point D in the 7-plane lies at 7 = 1, we shall show in Example 10.11.7 that the desired 
conformal mapping is 


iv slog(— 7) a (10.11.2) 


A useful method for illustrating this conformal mapping is to draw lines of constant p and a 
in the z-plane. See Figure 10.11.2. This figure shows the local orthogonality between lines 
of constant p and o. 

The greatest difficulty in creating this figure was computing 7 for a given z. This was 
done using the Newton-Raphson method. Starting at the top of the domain, the first guess 
there was given by T = 1-- 727. Marching downward, the 7 from the previous grid point 
was used for the initial guess. The corresponding MATLAB script is: 


clear; delta = 0.01; 4 resolution of the grid 


for jj 1:201 

for ii 1:201 
XX(jj,ii) = delta*ii; YY(jj,ii) = delta*(jj-101); 
RHO(jj,ii) = NaN; SIGMA(jj,ii) = NaN; 

end; end 


^4 code for the domain z,y > 0 


15 Curland, N., and J. H. Judy, 1986: Calculation of exact ring head fields using conformal mapping. 
IEEE Trans. Magnet., MAG-22, 1901-1903. 
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x/a 


Figure 10.11.2: Lines of constant p (dashed lines) and o (solid lines) given by the conformal mapping 
expressed by Equation 10.11.2. 


for jj = 1:100 
y = 1 - delta*(jj-1); 
for ii - 1:201 
x = delta*ii; z = complex(x,y); 
if (jj == 1) tau = 1*pi*pi*z*z; else tau = TAU(ii); end 
for icount - 1:10 
tempi = sqrt(tau-1); 
temp2 = tempi - O.5*i*log(i-i*tempi) + 0.5*ixlog(1+i*temp1) ; 
ff = temp2/pi + 0.5 - z; deriv = tempi /(2*pixtau) ; 
temp3 - ff/deriv; tau - tau - temp3; ^ Newton-Raphson method 
end 
TAU(ii) = tau; RH0(202-jj,ii) = real(tau); 
SIGMA(202-jj,ii) = imag(tau); 
end; end 


^ code for the domain 0<a< i and y <0 


for jj = 1:101 
y = delta - delta*jj; 
for ii - 1:49 
x = delta*ii; z = complex(x,y); 
tau = TAU(ii); % first guess 
for icount - 1:10 
tempi = sqrt(tau-1); 
temp2 = tempi - O.5*i*log(i-i*tempi) + 0.5*ixlog(1+i*temp1) ; 
ff = temp2/pi + 0.5 - z; deriv = tempi /(2*pi*tau); 
temp3 = ff/deriv; tau = tau - temp3; % Newton-Raphson method 
end 
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y/a 


x/a 


Figure 10.11.3: The solution to Laplace’s equation when the left boundary is held at 0 while the left and 
top sides of the shaded rectangle are held at 1. This figure shows only a portion of the domain x > 0 and 
ly] < co. 


TAU (ii) = tau; RHO(102-jj,ii) = real(tau); 
SIGMA(102-jj,ii) = imag(tau); 
end; end 


^4 plot the conformal mapping Equation 10.11.2 


figure 

[C,h] = contour(XX,YY,SIGMA, [0.002,0.05,0.5,5,10,20,30],’k’); 
clabel(C,h,’FontSize’ ,10,’Color’,’k’,’Rotation’ ,0) 
xlabel(’x/a’,’FontSize’ ,20); ylabel(’y/a’, ’FontSize’ ,20) ; 
hold on 

v = [-8,-4,0,4,10,20,30]; 

[C,h] = contour(XX,YY,RHO,v,’--b’); 

clabel(C,h,’FontSize’ ,10,’Color’,’b’,’Rotation’ ,0) 


Now that we can transform between the z-plane and the 7-plane, and vice versa, let us 
turn our attention to finding the solution to Laplace’s equation in the 7-plane. There the 
solution equals 1 for p > 0 and 0 for p < 0 along o = 0. 

Consider now the analytic function (except at the branch point 7 = 0) 


f(r) =i—log(r)/r. (10.11.3) 


A quick check (using T = ret?) shows that the imaginary part of f(T), v(r,9) = 1 — 0/7, 
satisfies Laplace’s equation and the boundary conditions. Thus, constructing the solution 
is as follows: For a given x and y, we use our MATLAB code to compute T. Substituting 
that 7 into Equation 10.11.3 we compute f(r). Taking the imaginary part, we have the 
solution at x and y. Figure 10.11.3 illustrates the solution for the domain 0 < x < 2 and 
-l<y<l. 
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Figure 10.11.4: The conformal mapping between the z-plane and 7-plane achieved by the conformal 


mapping T = Vz? + a2. 


In summary, conformal mapping allowed us to transform the original domain into one 
(an upper half-plane) where we could construct another analytic function whose imaginary 
part satisfied Laplace’s equation and the boundary conditions. A natural question is what 
do we do if we cannot find this analytic function in the 7-plane? The next example shows 
an alternative approach. 


e Example 10.11.2 


For our second example of conformal mapping, consider 7 = Vz? 4- a2. To illustrate 
this mapping we have constructed two Argand diagrams; one is for the z-plane while the 
second is for the 7-plane. Figure 10.11.4 shows how a particular boundary in the z-plane 
maps into the 7-plane. The advantage here is that the infinitely thin filament or peg located 
at z — 0 is completely eliminated in the 7-plane. 

One source of concern is the presence of the square root; for any value of z we would 
have two possible solutions. We make the mapping unique by requiring that S(r) > 0. 

To better understand this transformation, Figure 10.11.5 illustrates various lines of 
constant R(r/a) and S(r/a) as a function of z/a and y/a. This figure was constructed 
using the MATLAB code: 


clear; 
^4 compute T for various values of z 


for jj = 1:40 
y = 0.05 * jj; 

for ii = 1:42 
x = 0.05 * (ii-21.5); z = x + i*y; tau(ii,jj) = sqrt(z*z*a*a); 
if (imag(tau(ii,jj)) <= 0) tau(ii,jj) = -tau(ii,jj); end 
X(ii,jj) = x; YGi,ji = y; 
IM(ii,jj) = imag(tau(ii,jj)); REAL(ii,jj) = real(tau(ii,jj)); 
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x/a 


Figure 10.11.5: Lines of constant (t/a) (dashed line) and S(r/a) (solid lines) as a function of x and y 
for the conformal mapping T = Vz? + a?. 


end; end 
^ plot the conformal mapping Equation T = V2? + a? 


figure 

[C,h] = contour(X,Y,IM, [0.1,0.25,0.5,0.75,1,1.5,2] ,? k^); 
clabel(C,’FontSize’,10,’Color’,’k’,’Rotation’ ,0) 
xlabel(’x’,’FontSize’,20); ylabel(’y’,’FontSize’ ,20); 
hold on 

v = [-1,-0.5,-0.25,-0.01,0.01,0.25,0.5,1]; 

[C,h] = contour(X,Y,REAL,v,’--b’); 
clabel(C,’manual’,’FontSize’,10,’Color’,’b’,’Rotation’ ,0) 


As y — oo, lines of constant S(7/a) become parallel to the boundary y = 0. Only for 
smaller values of y and as we approach the peg at z — 0 do these lines deviate strongly 
from the horizontal as they pass over the obstacle. The smaller the value of S(r/a) the 
more they conform to the shape of the obstacle. 

The behavior of lines of constant R(T/a) are more difficult to understand. There are 
two general classes, depending upon whether the absolute value of 3t(7/a) is less or greater 
than 1. When [R(r/a)| > 1 they are clearly orthogonal to constant lines of S(7/a). Positive 
values of 3t(7/a) exist for x > 0 while negative values occur when x < 0. [R(r/a)| < 1 for 
y 2a. 

'This example has two interesting aspects to it. The first is the presence of the square 
root. The second involves how we will find the solution to Laplace's equation in the 7-plane. 

Let us assume that in the original z-plane the solution equals to zero along the entire 
boundary except along the “peg.” There, the solution equals 1. In the 7-plane the solution 
equals zero along the entire boundary except for the segment —a « p « a, where o = 0, 
along which the solution equals 1. Instead of finding an analytic function whose real or 
imaginary part satisfies this boundary condition, we employ the results from Section 11.7. 
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In the present case, we find that 


1 oO 


u(p,c) = - S. ETET d£ (10.11.4) 


+ fran? (222) ew (Ste), (10.11.5) 


Given Equation 10.11.5 we can compute the solution as follows: For a specific value of 
x and y, we find the corresponding value of p and ø. Equation 10.11.5 gives us the solution 
to Laplace's equation at that point and the corresponding x and y. The MATLAB code is: 


clear; a = 1; 


for jj = 1:100 
y = 0.02 * jj; 
for ii = 1:202 
x = 0.02 * (ii-101.5); z = x + i*y; tau = sqrt(z*z*a*a); 
if (imag(tau) <= 0) tau = -tau; end 
sigma = imag(tau); rho = real(tau); 
X(ii,jj) = x; YGi,jj) = y; 
% Equation 10.11.5 
argi = (a-rho)/sigma; arg2 = (atrho)/sigma; 
T(ii, jj) = (atan(argi)*atan(arg2)) / pi; 
end; end 


^4 plot the solution to Laplace’s equation 


figure 

[C,h] = contourf(X,Y,T,[0,0.05,0.2,0.4,0.6,0.8],’k’); 
colormap autumn 

clabel(C,’FontSize’ ,10,’Color’,’k’,’Rotation’ ,0) 
xlabel(’x’,’FontSize’,20); ylabel(’y’, ’FontSize’ ,20); 


Figure 10.11.6 illustrates this solution. 


So far we have not presented a strategy for finding our conformal mappings. One 
method would be to simply experiment with transforms that had been used in similar 
problems. Fortunately, during the 1860s two German mathematicians, E. B. Christoffel!® 
and H. A. Schwarz,!" developed a very popular method of mapping a polygon into a half 
plane. Example 10.11.1 illustrated one of their transforms. Indeed, if we imagine that 
the boundary of the polygon is constructed from a thin wire, the purpose of the Schwarz- 
Christoffel transformation is to unbend the corners so that the wire becomes straight. 

Our derivation begins by considering a mapping z = f(T) where 


dz ; 
dr C(r — pi)" (T — pa)" ---(r — pu)", (10.11.6) 


16 Christoffel, E. B., 1868: Sul problema delle temperature stazionarie e la rappresentazione di una data 
superficie. Ann. Mat. Pura Appl, Series 2, 1, 89-103; Christoffel, E. B., 1870: Sopra un problema 
proposto da Dirichlet. Ann. Mat. Pura Appl., Series 2, 4, 1-9. 


17 Schwarz, H. A., 1868: Über einige Abbildungsaufgaben. J. Reine Angew. Math., 70, 105-120. 
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Figure 10.11.6: The solution of Laplace’s equation when the solution (potential) along the boundary 
equals zero except along the peg located at x = 0. There the solution (potential) equals one. 


and p1, p2,..., p, are any n points arranged in order along the real axis in the 7-plane such 
that pı < po < ... < py. Here the k;'s are real constants and C is a real or complex 
constant. By taking the logarithm of both sides of Equation 10.11.6 we find that 


d 
L3 = log(C) + kı log(T — p1) + kalog(r — po) +--+ k,log(r — py). | (10.11.7) 


We have assumed that the principal value!? of each logarithm is taken. The local magnifi- 
cation factor of the mapping from the r-plane to the z-plane equals dz/d7 while the angle 
of dz/dr gives the angle through which a small portion of the mapped curve in the 7-plane 
is rotated by the mapping. This angle is given by 


dz 


(S) = ý (C) + k14 (T — p1) + k24 (T — pa) +++ + kn 4 (T — pa). (10.11.8) 


Equation 10.11.8 follows by first taking the imaginary part of Equation 10.11.7 and then 
noting that 4 (C) = S[log(C)]. 

Let the point (p,c) = (—oo,0) in the 7-plane be mapped into the point z* in the 
z-plane. See Figure 10.11.7. If we consider the image of a point p as it moves to the right 
along the negative real axis in the 7-plane, then all of the p — p; are real and negative as 
long as p < pı. Hence the angles for all of the p — p; are constant and equal to 7 in Equation 
10.11.8. Therefore, this equation simplifies to 


dz 
(Z) = 4 (C) + (kı + ka +- + Eu). (10.11.9) 
Thus the portion of the p axis to the left of the point pı is mapped into a straight line 
segment, making the angle defined by Equation 10.11.9 with the real axis in the z-plane, 
and extending from z* to zı the image of p — pı. 


18 For the complex number z = re?*, r Æ 0, the principle value of the logarithm is log(z) = In(r) + 6i, 
where 0 must lie between 0 and 27. 
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Figure 10.11.7: Diagram used in the derivation of the Schwarz-Christoffel method. 


Now as the point p crosses the point pı on the real axis, the real number p— pı becomes 
positive so that its angle abruptly changes from 7 to 0. Hence 4 (dz/dr) abruptly decreases 
by an amount kır and then remains constant as 7 travels from pı to po. It follows that 
the image of the segment (p12) in the z-plane makes an angle of —k1x with the segment 
(z*z1). 

Proceeding in this way, we see that each segment (p4,05,41) is mapped into a line 
segment (Zn, 2n41) in the z-plane, making the angle of —k,7 with the segment previously 
mapped. Thus, if the interior angle of the resultant polynomial contour at the point zn is 
to have the magnitude an, we must set 7 — Qn = —k,z, or kn = A/a — 1 in Equation 
10.11.6. After an integration, we then conclude that the mapping 


" 

z=0 J (n — X (n — p2) -- (n — pa dn + K, (10.11.10) 
where the arbitrary complex constants C and K map the real axis ø = 0 of the r-plane into 
a polynomial boundary in the z-plane in such a way that the vertices z1,22,...,z4 with 
interior angles 04,02,...,c, are the images of the points p1, po, . .. , pn. 


For the final segment 7 — p > pn the numbers 7 — p; are all real, positive, and equal to 
zero, so that this segment is rotated through the angle 


A(dz/dr) = 4(C), P> fn. (10.11.11) 
For a closed polynomial the sum of the interior angles is 
0 -- 03 +++» + On = (n — 2)r. (10.11.12) 


Therefore, 
—2 
ki +k +- + kn = (n—2)r n= —2. (10.11.13) 
T 


Thus, according to Equations 10.11.8 and 10.11.11, the two infinite segments of the line 
c = 0 are rotated through the angle 4 (C) — 2r and 4 (C), as is clearly necessary for a 
closed figure. 
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p,-0 zl 


c’ A B C 
Figure 10.11.8: The complex z- and 7-planes used in Example 10.11.4. 


What roles do C and K play? Because C is often complex, this constant introduces 
any necessary magnification and rotation of the transformation so that amy prescribed 
polynomial in the z-plane is made to correspond point by point to the real axis ø = 0 in 
the 7-plane. In fact, this correspondence can be set up in infinitely many ways, in that 
three of the numbers p1, p», ..., Pn can be determined arbitrarily. Finally, the mapping can 
be shown to establish a one-to-one correspondence between points in the interior of the 
polygon in the z-plane and points in the upper half of the r-plane. 


e Example 10.11.3 
Let us derive the conformal mapping used in Example 10.11.2. Referring back to Figure 
10.11.4, we see that ay = 7/2, ky = —1/2, and pı = —a at 21 = 07; ag = 2a, k2 = 1, and 


p2 = 0 at z2 = ai; and a3 = 7/2, ka = —1/2, and p3 = a at z3 = 0*. Therefore, from 
Equation 10.11.6, 


(10.11.14) 


Integrating this differential equation, 


z= Ov T? — a? 4- K. (10.11.15) 


Because the point pı = —a corresponds to z = 0^, K = 0. Similarly, at p2 = 0, we have 
that 
ai = C4/—a?, Or C — 1. (10.11.16) 


Therefore, the conformal mapping is given by z = VT? — a?, or T = V2? + a?. 


e Example 10.11.4 


Consider the triangle ABC located in the z-plane as shown on Figure 10.11.8. Here 
we desire to map the interior space of this triangle into the upper half of the 7-plane. At 
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y Z-plane 


p;= -1 p,= 0 p;= 1 


Figure 10.11.9: The complex z- and 7r-planes used in Example 10.11.5. 


point C, points along the boundary and to the left of C are to be mapped out to —oo in 
the r-plane while points along the boundary and to the right of C are mapped to +oo. 
From Equation 10.11.6 we have that 


d 
= SOG 18/871 = oft — hl, (10.11.17) 
- 


Integrating this differential equation, 


z= c f ne! ™1 1 — g)P/7— dn + K. (10.11.18) 


Because we want the points T = 0 and z = 0 to correspond to each other, K = 0. On the 
other hand, if we wish 7 = 1 and z = 1 to correspond, Equation 10.11.18 yields 


c [ elt - fia = CLE =1, (10.11.19) 
where T(-) is the gamma function defined by 
T(z) = jus dt. (10.11.20) 
Consequently, 
C= DAT (10.11.21) 
i z= a a qe Pada, (10.11.22) 


A noteworthy aspect of this example is that the conformal mapping is given by an integral 
and not some analytic expression. 
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Figure 10.11.10: The complex z- and 7-planes used in Example 10.11.6 with a < 1. 
e Example 10.11.5 


Consider the domain lying in the upper half of the z-plane except for a triangular section 
BCD shown in Figure 10.11.9. We wish to construct the Schwarz-Christoffel transformation 
that maps this domain into the upper half of the 7-plane. From Equation 10.11.6 we have 
that 


dz 


= C (r3) il nu a m Eee: qued mt (10.11.23) 
- 
rear T2a/m 


= C’ = à 10.11.24 
C (72 — 1)e/7 C - sias ( ) 


Integrating this differential equation, 


UP 
z= cf zi ü dn + K. (10.11.25) 


(1 = n?) 


If we want the point T = 0 to correspond to the point z = ki, then K = ki. On the other 
hand, if the point 7 = 1 corresponds to z = a, then 


jer 
a= cf qa d ges dnt ki. (10.11.26) 


Solving for C, 
"m vm(a — ki) 


à 10.11.27 
T (a/r + 3) T(1— a/n) ( ) 
Therefore, the final answer is 
— ki T 2o 
ges iC L | i dr + ki. (10.11.28) 
T (a/r + i) T(1— o/z) Jo (1— y2)o/7 


502 Advanced Engineering Mathematics with MATLAB 


y [o] 
z-plane T- plane 


ai 


nva EE [ouo nx 
77 i A "2 D ` 


Wa S 


Figure 10.11.11: The complex z- and 7-planes used in Example 10.11.7. 
e Example 10.11.6 


Consider the domain within the L-shaped boundary shown in Figure 10.11.10. We wish 
to construct the Schwarz-Christoffel transform that maps the interior into the upper half 
of the 7-plane. Note that we broke the boundary in such a manner that points slightly to 
the left of point A are mapped to —oo while points slightly below the point A are mapped 
to +00. 

Because a « 1, Equation 10.11.6 gives 


dz 


qe =r + 13-9 p +a) ee a — a) (r — 1). (10.11.29) 


Integrating this differential equation, 


n — 1) dn ES i (n — 1) dy 
e eons i) n - m TET TRES (10.11.30) 


where p? = 1/a?. To compute C and K, we would need further information. 


e Example 10.11.7 


Let us derive the conformal mapping, Equation 10.11.2, used in Example 10.11.1. The 
z— and r—planes are shown in Figure 10.11.11. From this figure we see that a; = 37/2, 
a2 = 1/2, a3 = 1/2, pı = 1, p2 = 0^, and p3 = 0*. This yields 


c = K(r — 189/0916. — g-(/e9-1(* — g*ye/G0-1 2 KYT= (10.11.31) 
T T 


Integrating Equation 10.11.31, we find that 


E (pain ok [v1 tog ( | +O. 
(10.11.32) 
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Because at T = 1, z = a/2, we have C = a/2. 
The computation of K is more complicated. Referring to Figure 10.11.11, we note that 


Y dr. (10.11.33) 
7 


C c 
vri 
f è= gy 
B B' 


Setting T = re” with r — 0, Equation 10.11.34 becomes 


9 refi —1. 


5 =K lim ir e” d9 = Kr. (10.11.34) 


r>0 J. rei 
Thus K = a/(2-) and we recover Equation 10.11.2. 
Problems 

1. Verify that the function 7 = e? maps the strip 0 < S(z) < 7 into the half-plane S(7) > 0. 


2. Verify that the function 7? = 1 — e? maps the strip —7 < S(z) < m, except for the 
negative real axis, into the upper half of the 7-plane. 


3. Use the Schwarz-Christoffel method to find the conformal mapping that maps the quarter 
plane z > —7, y < v into the upper half of the 7-plane. We require that the point (—7, 7) 
in the z-plane maps to the point (0,0) in the 7-plane. 


4. Use the Schwarz-Christoffel method to find the conformal mapping that maps the sector 
lying between the x-axis and the line 0 = 7/3 into the upper half of the 7-plane. We require 
that the point (0,0) in the z-plane maps to the point (0,0) in the 7-plane. 
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Problem 7 


5. Use the Schwarz-Christoffel method to find the conformal mapping that maps the portion 
of the z-plane defined by 0 < r < œ, 0 < 0 < 77/4 into the upper half of the -plane. We 
require that the points (0,0) and (1,0) in the z-plane map to the points (0,0) and (1,0) in 
the r-plane, respectively. 


6. Use the Schwarz-Christoffel method to find the conformal mapping that maps the domain 
|x| < a, 0 < y into the upper half of the 7-plane. Let the point (—a,0) become the point 
(—1,0) while the point (a, 0) becomes the point (1,0). 


7. Use the Schwarz-Christoffel method to find the conformal mapping that maps the region 
xz >0,0< y< a into the upper half of the 7-plane. We require that the point (0, a) maps 
to (—1,0) in the 7-plane while the point (0,0) maps to (1,0) in the 7-plane. 


8. Use the Schwarz-Christoffel method to find the conformal mapping that maps the region 
shown in the figure into the upper half of the 7-plane. We require that the points (0,0) and 
(0, a) in the z-plane map to the points (0,0) and (1,0) in the 7-plane, respectively. 
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Problem 9a 


9. Referring to the figure entitled Problem 9b, construct a transform between a z-plane 
which has a barrier that runs parallel to the z-axis from z = L 4- * Li to coo 4- 1 Là anda 
T-plane that has no barrier. 


Step 1: Referring to the figure entitled Problem 9a, use the Schwarz-Christoffel method to 
show that the transform is given by 
dz 


rF Cr (7 - 1)", 


where kı = —a/(27) and k2 = a/r — 1. 


Step 2: Consider the situation as the points B and D in the z-plane move out to infinity 
(so that a — 27). Show that the transform approaches 


dz cid 


dr T! 


or 


z = C[r — log(7)] + K. 


Here we have taken the principal branch of the logarithm so that log(z) = In(|z|) +70 where 
0 € 0 € 7. (We do not require that 0 < 0 < 2m because we are always in the upper 
half-plane.) See the figure entitled Problem 9b. 
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Step 3: Following Example 10.11.7, consider the area around 7 — 0. Show that 


dc Sede, 
T 


where 7 = re^. Integrating from point Bi to point Bs, show that C = L. 


Step 4: To compute K, note that if the point C, located at z = L + 1 Li, corresponds to 
the point C”, located at 7 = 1, then K = c Li. 


10. Use conformal mapping to solve Laplace's equation for the infinite strip —oo « x « oo, 
0 <y € m. The solution equals zero everywhere along the boundary except for x > 0, 
y = 0, where u(z,0) = 1. 


Step 1: Consider the mapping T = e^. Show that p = e*cos(y) and e = e*sin(y). In 
particular, (00,7) — (—09,0), (0,7) — (—1,0), (-oo,y) — (0,0), (0,0) — (1,0), and 
(00,0) — (oco, 0). 


Step 2: Using the Fourier method from Section 11.7, show that 


lia l-z 1 y 
= |>- tan! -1--tan | — }. 
ucro a 


Step 3: Show that 


1 z gi 
u(z,y)-1— —tan ! LORI, ; 
T e7 cos(y) — 1 
11. Use conformal mapping to solve Laplace's equation for a pie-shaped sector in the first 
quadrant. See Figure 10.11.12. The solution equals zero along the entire boundary except 
for 0 < z < 1 where it equals one. 


a 


Step 1: Show that the mapping z = r°/™ or T = z7/* maps the pie-shaped sector into the 


half-plane S(7) > 0. See Figure 10.11.12. 
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Figure 10.11.12: The conformal mapping between the z-plane and 7-plane achieved by the conformal 
mapping T = gna, 


Step 2: Using the Fourier method from Section 11.7, show that 


ES 


1 
u(p,o) = Leo! ( = 


Step 3: Show that 


T/a 
Wee ee |: Mul 
T 


sin(70/a) 


where x = r cos(0) and y = r sin(0). 


12. Use conformal mapping to solve Laplace’s equation for the semi-infinite strip 0 € x <a, 
0 € y< oo, where u(z,0) — 1, 0 € z € a, and u(0,y) = u(a,y) 20,0 y< oo. 


Step 1: Consider the mapping 7 = — cos(mz/a). Show that 
p = —cos(zx/a)cosh(zy/a), and o —sin(zr/a)sinh(zy/a). 


In particular, (0,00) — (—09,0), (0,0) > (—1,0), (a/2,0) — (0,0), (a,0) > (1,0), and 
(a, oo) — (oo, 0). 


Step 2: Using the Fourier method from Section 11.7, show that 


a) 


pe 
u(p,c) = = cot 1 Dc 


Step 3: Show that 


seat = fnr [p (E) =o (EE)] fos (5) sma (Z) 
E = tan [sin(™) /sin(™)| 
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Step 4: In the case that boundary conditions read u(0,y) = u(a,y) = 1 for 0 € y « oo 
and u(z,0) = 0 for 0 € x € a, how could you use the solution in Step 3 to solve this new 
problem? 


Further Readings 


Ablowitz, M. J., and A. S. Fokas, 2003: Complex Variables: Introduction and Applications. 
Cambridge University Press, 660 pp. Covers a wide variety of topics, including complex 
numbers, analytic functions, singularities, conformal mapping and the Riemann-Hilbert 
problem. 


Carrier, G. F., M. Krook, and C. E. Pearson, 1966: Functions of a Complex Variable: 
Theory and Technique. McGraw-Hill Book Co., 438 pp. Graduate-level textbook. 


Churchill, R. V., 1960: Complex Variables and Applications. McGraw-Hill Book Co., 297 
pp. Classic textbook. 


Flanigan, F. J., 1983: Complex Variables. Dover, 364 pp. A crystal clear exposition and 
emphasis on an intuitive understanding of complex analysis. 
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Chapter 11 
The Fourier Transform 


In the previous chapter we showed how we could expand a periodic function in terms 
of an infinite sum of sines and cosines. However, most functions encountered in engineering 
are aperiodic. As we shall see, the extension of Fourier series to these functions leads to the 
Fourier transform. 


11.1 FOURIER TRANSFORMS 


The Fourier transform is the natural extension of Fourier series to a function f(t) of 
infinite period. To show this, consider a periodic function f(t) of period 2T that satisfies 


the so-called Dirichlet’s conditions. If the integral de | f(t)| dt exists, this function has the 
complex Fourier series 


O= X eT, (11.1.1) 
where 
1 E i t/T g 
= — tje aT t. 11.1.2 
m= zr | fe (11.1.2) 


Equation 11.1.1 applies only if f(t) is continuous at t; if f(t) suffers from a jump discon- 
tinuity at t, then the left side of Equation 11.1.1 equals i[f(t*) + f(t-)], where f(t^) = 


2 
lim, ,d f(x) and f(t^) = lim, ,,- f(x). Substituting Equation 11.1.2 into Equation 11.1.1, 
lox m[ /T 
Ese innt —inmaz dz. 11.1.3 
fog Y, emm | seyret (11.1.3) 


1 A function f(t) satisfies Dirichlet’s conditions in the interval (a,b) if (1) it is bounded in (a,b), and 


(2) it has at most a finite number of discontinuities and a finite number of maxima and minima in that 
interval. 
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Let us now introduce the notation wn = nz/T so that Awn = wn+1 — Wn = T/T. Then, 


I - > XO Funjet Awn, (11.1.4) 
where 
T 
F(wn) s f(x)e "da. (11.1.5) 
=F 


As T — œ, wn approaches a continuous variable w, and Aw, may be interpreted as the 
infinitesimal dw. Therefore, ignoring any possible difficulties,? 


f= sf Foedus, (11.1.6) 
and 
F(w) -f. fe tdt. (11.1.7) 


Equation 11.1.7 is the Fourier transform of f (t) while Equation 11.1.6 is the inverse Fourier 
transform that converts a Fourier transform back to f(t). Alternatively, we may combine 
Equation 11.1.6 and Equation 11.1.7 to yield the equivalent real form 


f(t) = Ju u. Pose a ax} d. (11.1.8) 


Hamming? suggested the following analog in understanding the Fourier transform. Let 
us imagine that f(t) is a light beam. Then the Fourier transform, like a glass prism, breaks 
up the function into its component frequencies w, each of intensity F(w). In optics, the 
various frequencies are called colors; by analogy the Fourier transform gives us the color 
spectrum of a function. On the other hand, the inverse Fourier transform blends a function's 
spectrum to give back the original function. 

Most signals encountered in practice have Fourier transforms because they are abso- 
lutely integrable, since they are bounded and of finite duration. However, there are some 
notable exceptions. Examples include the trigonometric functions sine and cosine. 


? For a rigorous derivation, see Titchmarsh, E. C., 1948: Introduction to the Theory of Fourier Integrals. 
Oxford University Press, Chapter 1. 


3 Hamming, R. W., 1977: Digital Filters. Prentice-Hall, p. 136. 
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e Example 11.1.1 


Let us find the Fourier transform for 


jb [il<a, 
f(t- b Issa: (11.1.9) 
From the definition of the Fourier transform, 
F(w) =| ew a+ f ie a+ f Oe! dt (11.1.10) 
wai ,—wat 2 si 
Es ED oo tne, (11.1.11) 
We W 


where sinc(z) = sin(z)/x is the sinc function. 
Although this particular example does not show it, the Fourier transform is, in general, 
a complex function. The most common method of displaying it is to plot its amplitude and 
phase on two separate graphs for all values of w. Another problem here is ratio of 0/0 when 
w = 0. Applying L'Hópital's rule, we find that F(0) = 2. Thus, we can plot the amplitude 
and phase of F(w) using the MATLAB script: 
clear; % clear all previous computations 
omegan = [-20:0.01:-0.01]; % set up negative frequencies 
omegap [0.01:0.01:20]; % set up positive frequencies 
^ compute Fourier transform for negative frequencies 
f omegan - 2.*sin(omegan)./omegan; 
^4 compute Fourier transform for positive frequencies 
f_omegap = 2.*sin(omegap)./omegap; 
^4 concatenate all of the frequencies 
omega = [omegan,0,omegap] ; 
^ bring together the Fourier transforms found 
^ at positive and negative frequencies 
f omega = [f omegan,2,f omegap]; 
amplitude = abs(f_omega); ^ compute the amplitude 
phase = atan2(0,f omega); % compute the phase 
clf; 4 clear all previous figures 
^4 plot frequency spectrum 
subplot(2,1,1), plot(omega,amplitude) 
^4 label amplitude plot 
ylabel(’ |F(\omega) | /a? ,? FontSize’ ,15) 
Subplot(2,1,2), plot(omega,phase) ^ plot phase of transform 
ylabel(’phase’,’FontSize’,15) % label amplitude plot 
xlabel(’\omega’,’FontSize’,15) % label x-axis. 


Figure 11.1.1 shows the output from the MATLAB script. Of these two quantities, the 
amplitude is by far the more popular one and is given the special name of frequency spectrum. 


From the definition of the inverse Fourier transform, 


m sin(wa) iwt 1, |t| < a, 
i Em dw = 0, lt] > a. (11.1.12) 
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Figure 11.1.1: Graph of the Fourier transform for Equation 11.1.9. 


An important question is what value does f(t) converge to in the limit as t — a and 
t — —a? Because Fourier transforms are an extension of Fourier series, the behavior at a 
jump is the same as that for a Fourier series. For that reason, f(a) = $[f(a*)+ f(a )] = 4 


and f(—a) = 3[f(-a*) + f(-a7)] = 5- 
e Example 11.1.2: Dirac delta function 


Of the many functions that have a Fourier transform, a particularly important one is 
the (Dirac) delta function. For example, in Section 11.6 we will use it to solve differential 
equations. We define it as the inverse of the Fourier transform F(w) = 1. Therefore, 

~ Or 


ó(t) -f et qu, (11.1.13) 


— oo 


To give some insight into the nature of the delta function, consider another band-limited 
transform 


fL [wf <Q, 
Fo(w) = b lw] > 9, (11.1.14) 
where Q is real and positive. Then, 
po. Q sin(Qt 
fo(t) = x] pig tS (11.1.15) 
2T Jg Tm Qt 


Figure 11.1.2 illustrates fo(t) for a large value of Q. We observe that as Q — oo, fo(t) 
becomes very large near t = 0 as well as very narrow. On the other hand, fo(t) rapidly 
approaches zero as |t| increases. Therefore, the delta function is given by the limit 


sin(Qt) f oo, t — 6, 
v 0, t xz 0. 


ó(t) = lim 


(11.1.16) 
0—oo Ti 


^ Dirac, P. A. M., 1947: The Principles of Quantum Mechanics. Oxford University Press, Section 15. 
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fo(t) 


Figure 11.1.2: Graph of the function given in Equation 11.1.15 for Q = 300. 


Because the Fourier transform of the delta function equals one, 
/ ó(t)e tdt = 1. (11.1.17) 
Since Equation 11.1.17 must hold for any w, we take w — 0 and find that 


T ó(t) dt = 1. (11.1.18) 
Thus, the area under the delta function equals unity. Taking Equation 11.1.16 into account, 
we can also write Equation 11.1.18 as 
b 
ó(t) dt = 1, a,b » 0. (11.1.19) 


—a 


Finally, from the law of the mean of integrals, we have the sifting property that 


f rose toa = f(to), (11.1.20) 


if a < to < b. This property is given its name because 0(t — to) acts as a sieve, selecting 
from all possible values of f(t) its value at t = to. 

We can also use several other functions with equal validity to represent the delta func- 
tion. These include the limiting case of the following rectangular or triangular distributions: 


1 € 

PE t < 25 i e: t) 

E JE. or  ó(t)—lim4 * (1 eye USS (11.1.21) 
0, >s iid 0, lt] > e, 


6(t) = lim ——————. (11.1.22) 


Note that the delta function is an even function. 
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The Fourier Transforms of Some Commonly Encountered Functions 


f(t), |t| < oo F(w) 
e 1 
1. e “H(t), a»0 f 
a +w 
1 
2. e*H(—-t), a>0 : 
à — wi 
3. te“ H(t), a»0 : 
(a + wi)? 
—1 
4. te“ H (—t), 2-0 
CURSUS. ig (a — wi)? 
n! 
5 ig 9H OL R(a) Sem 2. — 
e (t), R(a) iG (a 4- wi)rtt 
E 2a 
6 € it a0 Uia? 
7 te-“U4l a>0 _ tawi 
, (w2 + a2)? 
1 T 
8. S — e7 le/al 
1+ a?t? al^ 
cos(at) T —|w—a —|wta 
9. 142 Z (e | Ie | l) 
sin(at) T —|w—a —|w+a 
10. ILE £ (e | Lawn l) 
T i t| «a 2 sin(wa) 
| o l>a = 
sin(at) n/a, lw] <a 
12. 
at 0, l| >a 


2 
13. ec, a >0 "i is exp (-=) 
a 4a 


Note: The Heaviside step function H(t) is defined by Equation 11.1.31. 
e Example 11.1.3: Multiple Fourier transforms 


The concept of Fourier transforms can be extended to multivariable functions. Consider 
a two-dimensional function f(x,y). Then, holding y constant, 


G(&. y) =f f(x, y) e da. (11.1.23) 


Then, holding £ constant, 


F(E, n) ef G(£,y) e?" dy. (11.1.24) 


Therefore, the double Fourier transform of f(x,y) is 


F(E, n) 2: d f(x,y) e E+) de dy, (11.1.25) 
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assuming that the integral exists. 
In a similar manner, we can compute f(x,y) given F(£,r) by reversing the process. 
Starting with 


G(£,y) = xf. F(€,n) e" dn, (11.1.26) 
followed by 2 
f(a,y) = 5 G(E,y) e'** dé, (11.1.27) 
we find that 
f(x,y) = aj. A F(£,q) e*t" dé dn. (11.1.28) 


e Example 11.1.4: Computation of Fourier transforms using MATLAB 
The Heaviside (unit) step function is a piecewise continuous function defined by 


H(t—a)= { " : ji R (11.1.29) 


where a > 0. We will have much to say about this very useful function in the chapter on 

Laplace transforms. Presently we will use it to express functions whose definition changes 

over different ranges of t. For example, the “top hat” function Equation 11.1.9 can be 

rewritten f(t) = H(t-- a) — H(t — a). We can see that this is correct by considering various 

ranges of t. For example, if t < —a, both step functions equal zero and f(t) = 0. On 

the other hand, if t > a, both step functions equal one and again f(t) = 0. Finally, for 

—a < t < a, the first step function equals one while the second one equals zero. In this 

case, f(t) — 1. Therefore, f(t) = H(t + a) — H(t — a) is equivalent to Equation 11.1.9. 
'This ability to rewrite functions in terms of the step function is crucial if you want to 

use MATLAB to compute the Fourier transform via the MATLAB routine fourier. For ex- 

ample, how would we compute the Fourier transform of the signum function? The MATLAB 

commands 

>> syms omega t; syms a positive 

>> fourier(?Heaviside(t*a)-Heaviside(t-a)',t,omega) 

>> simplify (ans) 

yields 


ans = 
2*sin(a*omega)/omega 


the correct answer. 
Problems 


1. (a) Show that the Fourier transform of 


2a 
i= ~alt| 0 is F = . 
JOs, a0 is FO) cu 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
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(b) Use MATLAB’s fourier to find Fw). 


2. (a) Show that the Fourier transform of 


4awi 

t)=t =at 0 i F ECT E 
A ) € > ard, E (w) (w? aa a2)? 
Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Use MATLAB’s fourier to find Fw). 


3. (a) Show that the Fourier transform of 


f(t) 2 gt a> 0, is F(w) = Ze0(-S) ; 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Use MATLAB’s fourier to find F (w). 


4. (a) Show that the Fourier transform of 


_ fe”, t « 0, : 7 3 
deos t» 0, ie. HRS CE 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


5. (a) Show that the Fourier transform of 


1+i)t 
? 


-( t>0 
f(t) = oa t< 0. 1S F(w) E 


—2i(w 4-1) 
WIFI 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F (w). 


6. (a) Show that the Fourier transform of 


. f cos(at), [t| « 1, A . Sin(v— a) , sin(w +a) 
st) ={ 0, it| > 1, is F(w) = ea ae aoe uu 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


7. (a) Show that the Fourier transform of 


f(t) = ire 0<t<1, 


0, otherwise, 


The Fourier Transform 517 


l[1-—cos(w —1) cos(w - 1) — J D E —1)  sin(w 4 1) 


Flw) = 
(ERR: mem oed Bl ure TEX 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


8. (a) Show that the Fourier transform of 


fü Lo [t| <a, ; F(w) = 2icos(wa) 2i sin(wa) 


0, [t| > a, W wa 
Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


9. (a) Show that the Fourier transform of 


Gay, em 4 Acos(wa) 4sin(wa) , 2sin(wa) 

t) = , , F(w)- | 
f { 0, [t| >a, dic, wa wa? w 

Using MATLAB, plot the amplitude and phase spectra for this transform. 

(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F (w). 


10. (a) Show that the Fourier transform of 


is F(w) = 


bs D Spr ; 2e 1" [sin(wr) 
DES ? — , u 
fe) { 0, otherwise, re 2 cos(wT) 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


11. (a) Show that the Fourier transform of 


_ f[1- (t/ay?, [t| <a, . 4sin(wa) — daw cos(wa) 
f(t) = { 0. Ml a, and — F(w) = EX. 


Using MATLAB, plot the amplitude and phase spectra for this transform. 
(b) Rewrite f(t) in terms of step functions. Then use MATLAB’s fourier to find F(w). 


12. The integral representation? of the modified Bessel function K,(-) is 


| T (v i) (2a)” [°° | cos(wt) 
Ko TE h Erara” 


5 Watson, G. N., 1966: A Treatise on the Theory of Bessel Functions. Cambridge University Press, p. 
185. 
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where I'(-) is the gamma function, v > 0 and a > 0. Use this relationship to show that 


| 1 | — JwT (3) Kr (ala ) 
EPAR] TP+) | 


13. Show that the Fourier transform of a constant K is 2rô(w)K 


14. Show that 7 
/ TÓ(t— T) dr —- t|H(t — a) - H(t—)]. 


Hint: Use integration by parts. 


15. For the real function f(t) with Fourier transform F(w), prove that |F(w)| = |F(—w)| 
and the phase of Fw) is an odd function of w. 


11.2 FOURIER TRANSFORMS CONTAINING THE DELTA FUNCTION 


In the previous section we stressed the fact that such simple functions as cosine and 
sine are not absolutely integrable. Does this mean that these functions do not possess a 
Fourier transform? In this section we shall show that certain functions can still have a 
Fourier transform even though we cannot compute them directly. 

The reason why we can find the Fourier transform of certain functions that are not 
absolutely integrable lies with the introduction of the delta function because 


i, (uw — woje dw = ett (11.2.1) 


for all t. Thus, the inverse of the Fourier transform ó(w — wo) is the complex exponential 
etot [2m or l 
F (et) = 2nó(w — wo). (11.2.2) 


This immediately yields the result that 
F (1) = 2rô(w), (11.2.3) 


if we set wọ = 0. Thus, the Fourier transform of 1 is an impulse at w = 0 with weight 
2r. Because the Fourier transform equals zero for all w Æ 0, f(t) = 1 does not contain a 
nonzero frequency and is consequently a DC signal. 

Another set of transforms arises from Euler’s formula because we have that 


J[sin(wot)] = [F (e*t) — F (e7™™+t)] /(2i) (11.2.4) 
= T [ó(to — wo) — ô(w + wo)] /i (11.2.5) 
= —qió(w — wo) + tiô(w + wo) (11.2.6) 
and 
F [cos(wot)] = 5 [F (et) + F (e~ #0") | = m [6(w — wo) + 6(w + wo)] . (11.2.7) 


Note that although the amplitude spectra of sin(wot) and cos(wot) are the same, their phase 
spectra are different. 
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Let us consider the Fourier transform of any arbitrary periodic function. Recall that 
any such function f(t) with period 2L can be rewritten as the complex Fourier series 


FOS es (11.2.8) 
where wo = 7/L. The Fourier transform of f(t) is 
F(w) = FIFE a 2TCnôlw — nwo). (11.2.9) 


Therefore, the Fourier transform of any arbitrary periodic function is a sequence of impulses 


with weight 27c,, located at w = nwo with n = 0, +1, 2, .... Thus, the Fourier series and 
transform of a periodic function are closely related. 
e Example 11.2.1: Fourier transform of the sign function 
Consider the sign function 
1, t0, 
sgn(t) = 0, t — 0, (11.2.10) 
—1, t « 0. 


The function is not absolutely integrable. However, let us approximate it by e~‘!lsgn(t), 
where e is a small positive number. This new function is absolutely integrable and we have 
that 


0 oo 
: ; —1 1 
F[|sgn(t)] = lim -/ gig hp «f pog um a| = lim ( — + - ) : 
c0 t 0 «30 L\E— w E€ +w 
(11.2.11) 
If w £0, Equation 11.2.11 equals 2/iw. If w = 0, Equation 11.2.11 equals 0 because 


] expe . il 
lim (= + ~) =0. (11.2.12) 
e>0 € € 
Thus, we conclude that 
F[sgn(t)] = { 2 f o; (11.2.13) 


e Example 11.2.2: Fourier transform of the step function 
An important function in transform methods is the (Heaviside) step function 


1, t0, 
A) = Is A (11.2.14) 


In terms of the sign function it can be written 


H(t) = 5 isgn(t). (11.2.15) 
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Because the Fourier transforms of 1 and sgn(t) are 27d(w) and 2/iw, respectively, we have 
that 


F|H(t)] = rô(w) + ES (11.2.16) 


These transforms are used in engineering but the presence of the delta function requires 
extra care to ensure their proper use. 


Problems 


1. Verify that 


F (sin(wot) AH (t)) = —5——;5 + Tg + wo) — ó(w — wo)]. 


2. Verify that 
iw T 
F[cos(wot)H(t)] = -=z + zlé(w + wo) + 6(w — wo)]. 
w — W 2 


3. Using the definition of Fourier transforms and Equation 11.2.17, show that 


I e™™t dt = 15(w) — a or | e™t dt = mó(w) + E, 
0 w 0 w 
4. Following Example 11.2.1, show that 


2 2wi 
F[sgn(t) sin(wot)] = mn and  Ff[sgn(t) cos(wot)] = " 
0 


11.3 PROPERTIES OF FOURIER TRANSFORMS 


In principle we can compute any Fourier transform from its definition. However, it is 
far more efficient to derive some simple relationships that relate transforms to each other. 
This is the purpose of this section. 


| Linearity 


If f(t) and g(t) are functions with Fourier transforms F(w) and G(w), respectively, 
then 


Fle f (t) + cog(t)] = a F(w) + e3G(o), (11.3.1) 


where cı and cg are (real or complex) constants. 
This result follows from the integral definition 


Fi ft) teg] = f lef) aote at (11.3.2) 
=¢ is fedt + es i. g(t)e dt (11.3.3) 


= a F(w) + @G(w). (11.3.4) 
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Figure 11.3.1: The amplitude and phase spectra of the Fourier transform for cos(2t) H(t) (solid line) and 
cos[2(t — 1) H(t — 1) (dashed line). The amplitude becomes infinite at w = +2. 


| Time shifting 


If f(t) is a function with a Fourier transform F(w), then JF(f(t — 7)] = e *** F(w). 
'This follows from the definition of the Fourier transform 


FUf(t-T7)-2 A f(t - r)e “dt = ie f(e 9 n da (11.3.5) 


— eT it T faje dr = eT F(u). (11.3.6) 


e Example 11.3.1 


The Fourier transform of f(t) = cos(at)H(t) is F(w) = iw/(a? — w?) + n[6(w +a) + 
(w — a)]/2. Therefore, 


F(cos[a(t — k)| H(t — k)} = e~” F[cos(at) H (t)], (11.3.7) 


Or 
awe '^5 T 


7 (cos[a(t — K)]H(t — k)} = 535 + 5€ BO + a) + 5(w — a). (11.3.8) 


In Figure 11.3.1 we present the amplitude and phase spectra for cos(2t) H(t) (the solid 
line) while the dashed line gives these spectra for cos[2(t — 1)| H(t — 1). This figure shows 
that the amplitude spectra are identical (why?) while the phase spectra are considerably 
different. 
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| Scaling factor 


Let f(t) be a function with a Fourier transform Fw) and k be a real, nonzero constant. 
Then F[f(kt)] = F(w/k)/|kl. 
From the definition of the Fourier transform: 


F[f(kt)| = T f(kt)e~*dt = i 5 f(z)e dge = ar (=) (11.3.9) 


e Example 11.3.2 


The Fourier transform of f(t) = e * H(t) is F(w) = 1/(1 + wi). Therefore, the Fourier 
transform for f(at) = e "* H(t), a > 0, is 


F|f(at)) = (i) (7) = — (11.3.10) 


To illustrate this scaling property we use the MATLAB script 


clear; % clear all previous computations 

omega = [-10:0.01:10]; % set up frequencies 

^4 real part of transform with a - 1 

fir omega = 1./(1*omega.*omega); 

^ imaginary part of transform with a = 1 

fili omega = - omega./(1+omega.*omega) ; 

% real part of transform with a = 2 

f2r omega = 2./(4tomega. *omega) ; 

% imaginary part of transform with a = 2 

f2i_omega = - omega./(4+omega.*omega) ; 

^4 compute the amplitude of the first transform 

ampli = sqrt(fir omega.*fir omega + fli_omega.*f1i_omega) ; 
% compute the amplitude of the second transform 

ampl2 = sqrt(f2r omega.*f2r omega + f2i omega.*f2i omega); 
^4 compute phase of first transform 

phase1 = atan2(fii omega,fir omega); 

^4 compute phase of second transform 

phase2 - atan2(f2i omega,f2r omega); 

clf; 4 clear all previous figures 

^4 plot amplitudes of Fourier transforms 

subplot(2,1,1), plot(omega,ampli,omega,ampl2,?--") 
ylabel(’|F(\omega) |’, ’FontSize’,15) % label amplitude plot 
^ plot phases of Fourier transforms 

subplot (2,1,2), plot (omega, phase1,omega,phase2,’--’) 
ylabel(?phase?,?FontSize?,15) ^ label amplitude plot 
xlabel(’\omega’,’FontSize’,15) % label x-axis 
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Figure 11.3.2: The amplitude and phase spectra of the Fourier transform for e^ H(t) (solid line) and 


e-?' H(t) (dashed line). 


to plot the amplitude and phase when a — 1 and a — 2. Figure 11.3.2 shows the results 
from the MATLAB script: The amplitude spectra decreased by a factor of two for e7% H(t) 


compared to e H(t) while the differences in the phase are smaller. 


| Symmetry 


If the function f(t) has the Fourier transform F(w), then .F[F(t)] = 2x f (—w). 


From the definition of the inverse Fourier transform, 


1 


~ 9m 


f(t) 2 — F(w)e*tdw = > T F(z)e'^ da. 


— oo 


Then 


2r f(—w) = T F(x)e "dr = T F(t)e **!dt = [F(t)]. 


e Example 11.3.3 


The Fourier transform of 1/(1 + t?) is te~|*!. Therefore, 


F (ne-!!) = "o Or F (e-*?) = s à 
1+w? 1+w? 


| Derivatives of functions 


(11.3.11) 


(11.3.12) 


(11.3.13) 


Let f? (t), k = 0,1,2,...,n—1, be continuous and f (t) be piecewise continuous. Let 
ru «Xe b» 0,0 < t< œ; |f| < Me™ a 0,00 € t X 0,k = 0,1,...,n. 


Then, [f ^? (t)] = (iw) F (w). 
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ftt) 


spectrum 


'-20.0 -10.0 0.0 10.0 20.0 


Figure 11.3.3: The (amplitude) spectrum of a rectangular pulse Equation 11.1.9 with a half width a — 10 
that has been modulated with cos(5t). 


We begin by noting that if the transform F[f’(t)] exists, then 


JF" (t) zz sige tat =f f'(t)e*:! [cos(w,t) — isin(w,t)] dt (11.3.14) 


= (—w; + iwr) [. f (te”**[cos(w,t) — isin(w,t)] dt (11.3.15) 
= iw D Fedt = iwF (w). (11.3.16) 

Finally, 
FISO E] = is Ff? O] = Goy FIOI H] = -= (iw) F (w). (11.3.17) 


e Example 11.3.4 
The Fourier transform of f(t) = 1/(1 + t?) is F(w) = re™”l. Therefore, 


dr 


| Modulation 


In communications a popular method of transmitting information is by amplitude mod- 
ulation (AM). In this process the signal is carried according to the expression f(t)e/*^*, 
where wo is the carrier frequency and f(t) is an arbitrary function of time whose amplitude 
spectrum peaks at some frequency that is usually small compared to wo. We now show that 


| =iwre "ll, o F f eTel, (11.3.18) 


t = TWT 
1+2] 2 
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Figure 11.3.4: The (amplitude) spectrum of a rectangular pulse Equation 11.1.9 with a half width a = 10 
that has been modulated with cos(t/2). 


the Fourier transform of f(t)e'*» is F(w— wo), where F (w) is the Fourier transform of f (t). 


We begin by using the definition of the Fourier transform, or 
F[f (tet = J ftit edt = / f(t)e-*«-"9»'qt = F(w—wo). — (11.3.19) 


Therefore, if we have the spectrum of a particular function f(t), then the Fourier transform 
of the modulated function f (t)e'*»* is the same as that for f (t) except that it is now centered 
on the frequency wọ rather than on the zero frequency. 


e Example 11.3.5 


Let us determine the Fourier transform of a square pulse modulated by a cosine wave 
as shown in Figures 11.3.3 and 11.3.4. Because cos(wot) = 1 [e'^"* + e~ 0") and the Fourier 
transform of a square pulse is F(w) = 2sin(wa)/w, 


F[f (t) cos(wot)] = nS SE) y GREEN (11.3.20) 
wW — Wo w + wo 


Therefore, the Fourier transform of the modulated pulse equals one half of the sum of the 
Fourier transform of the pulse centered on wo and —wo. See Figures 11.3.3 and 11.3.4. 

In many practical situations, wo >> a/a. In this case we may treat each term as 
completely independent from the other; the contribution from the peak at w — wo has a 
negligible effect on the peak at w = —wo. 


e Example 11.3.6 
The Fourier transform of f(t) = e" H(t) is F(w) = 1/(b + iw). Therefore, 


le" cos(at) H(t)] = lr (eiate- t i e~ iat ebt) (11.3.21) 
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Figure 11.3.5: The amplitude and phase spectra of the Fourier transform for e~?* H(t) (solid line) and 
e?! cos(4t) H(t) (dashed line). 


1 1 1 
—bt EE 
Fle ^ cos(at) H(t)] = 7 ic "" + Fae Er) (11.3.22) 
a A : (11.3.23) 
~ 2) (b+iw)—ai —(b--iw)-ai m 
_ bc dw (11.3.24) 


(b+ iw)? + a2” 


We illustrate this result using e^? H (t) and e~*! cos(4t) H(t) in Figure 11.3.5. 


e Example 11.3.7: Frequency modulation 


In contrast to amplitude modulation, frequency modulation (FM) transmits information by 
instantaneous variations of the carrier frequency. It can be expressed mathematically as 


exp li E f(r) dr+iC] eof, where C is a constant. To illustrate this concept, let us find 
the Fourier transform of a simple frequency modulation 


| Jwi, |t| < T/2, 
f) = { 0, E> T/2, (11.3.25) 
and C = —w T/2. In this case, the signal in the time domain is 
t | e IA T/2 iuot. t « -Tj2, 
g(t) — exp Gi f(r)dr + ic| pH ecu eiwiteiwot, -PeteT (11.3.26) 
—oo gi T/A giwot T/2 <t. 


We illustrate this signal in Figures 11.3.6 and 11.3.7. 
The Fourier transform of the signal G(w) equals 


-T 


Glu) = eT | 


— oo 


/2 T/2 oo 
eilwo—w)t dt «f eilwotw1—w)t dt + gon eilwo—w)t dt 
—TJ/2 T/2 
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Figure 11.3.6: The (amplitude) spectrum |G(w)|/T of a frequency-modulated signal (shown top) when 
w1T = 2» and woT = 107. The transform becomes undefined at w = wo. 


(11.3.27) 
0 oo 0 
= pum eilwo—w)t dt + pun ei(wo—w)t dt — pU eilwo—w)t dt 
—oo 0 —T/2 
T/2 T/2 
«f a alia qt — goats eoo dt, (11.3.28) 
—T/2 0 
Applying the fact that 
[i etie! dt = qó(o) + +, (11.3.29) 
0 Q 


[goo ea ym _ e (voten za 


_ _ iw, T/2 PERPE 
G(w) = rô(w — wo) [eT + e + aes 
[eon uL _ a ca 


on — uj (11.3.30) 


2w: sin[(w — wo — w1)T/2] l 


= 2rô(w — wo) cos(w1T/2) + (w — wo) (w — wo — w1) 


(11.3.31) 


Figures 11.3.6 and 11.3.7 illustrate the amplitude spectrum for various parameters. In 
general, the transform is not symmetric, with an increasing number of humped curves as 
w1T' increases. 


| Parseval's equality 


In applying Fourier methods to practical problems we may encounter a situation where 
we are interested in computing the energy of a system. Energy is usually expressed by the 
integral f^. |f (t)? dt. Can we compute this integral if we only have the Fourier transform 
of F(w)? 
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Figure 11.3.7: The (amplitude) spectrum |G(w)|/T of a frequency-modulated signal (shown top) when 
w1T = 8r and woT = 10r. The transform becomes undefined at w = wo. 


From the definition of the inverse Fourier transform 
1 ii iwt 
f(t)— = F(w)je dw, (11.3.32) 
T J—oo 
we have that 
/ eer dt = x | f(t) H F(u)etdu dt. (11.3.33) 
—oo T J—oo —oo 
Interchanging the order of integration on the right side of Equation 11.3.33, 


[erase [rol roca] a. s 


However, 


F*(w) = i f(t)e**tqt. (11.3.35) 


Therefore, 


f rota f FoP (11.3.36) 


This is Parseval's equality? as it applies to Fourier transforms. The quantity |F(w)|? is 
called the power spectrum. 


$ Apparently first derived by Rayleigh, J. W., 1889: On the character of the complete radiation at a 
given temperature. Philos. Mag., Ser. 5, 27, 460—469. 


The Fourier Transform 529 


Some General Properties of Fourier Transforms 


function, f(t) Fourier transform, F(w) 

1. Linearity cı f(t) + cag(t) ci F(w) + e2G(w) 

2. Complex FA F*(—w) 
conjugate 

3. Scaling flat) F(w/o)/|a| 

4. Delay f(t— T) ee P 

5. Frequency ett F(t) F(w — wo) 
translation 

6. Duality-time F(t) 2m f (—w) 
frequency 

7. Time f'(t) iwF (w) 
differentiation 


e Example 11.3.8 


In Example 11.1.1, we showed that the Fourier transform for a unit rectangular pulse 
between —a < t < a is 2sin(wa)/w. Therefore, by Parseval’s equality, 


2 CO gt 2 a © sin? 
zJ THI w= f 1?dt=2a, or / SE (ue) scm. (RSI) 


2 2 
—oo wW —a —oo Ww 


| Poisson’s summation formula 


If f(x) is integrable over (—oo, oo), there exists a relationship between the function and 
its Fourier transform, commonly called Poisson’s summation formula." 
We begin by inventing a periodic function g(x) defined by 


g(a) = 5 f(x +2rk). (11.3.38) 


k-——oo 


Because g(x) is a periodic function of 27, it can be represented by the complex Fourier 
series: 


g(z)9 M ee", o g= M, fuk) V, e. (11.3.39) 


n--—oo k-——oo n--—oo 


7 Poisson, S. D., 1823: Suite du mémoire sur les intégrales définies et sur la sommation des séries. J. 


École Polytech., 19, 404—509. See page 451. 
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Computing cn, we find that 
1 of = 
Cn = 5 EIC e "* da = — is f(x + 2kn)e ""* dz (11.3.40) 
oT 


—ina e ine au) F(n) 
DN m (x 4- 2kn)e-?"* dx = JE f(a dr— —5—, (11.3.41) 


a s 


where F'(w) is the Fourier transform of f(x). Substituting Equation 11.3.41 into the right 
side of Equation 11.3.39, we obtain 


oo 1 oo 
22 f(2rk) = = 2m F(n) (11.3.42) 
Or 
Y flak) = 5 r(=*) ; (11.3.43) 
k——oo n-—-—oo 


e Example 11.3.9 


One of the popular uses of Poisson's summation formula is the evaluation of infinite 
series. For example, let f(x) = 1/(a? + x?) with a real and nonzero. Then, F(w) = 
me—!2+l /|a| and 


E -lanj_ ^ [1249 —|aln 11.3.44 
È mma b ga A) ona 
1 2 |a| 
T 14 = th 11.3.4 

zs aa) = 7, qq ^? (8) SN 


Problems 


1. Find the Fourier transform of 1/(1 + a?t?), where a is real, given that F[1/(1 + t?)] = 
-|el 
me I", 


2. Find the Fourier transform of cos(at)/(1 4- t?), where a is real, given that F[1/(1+¢?)] = 
-|el 
ne i", 


3. Use the fact that F[e~“ H(t)] = 1/(a + iw) with a > 0 and Parseval’s equality to show 


that 
D dr — m 
œT? +a a 
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4. Use the fact that F[1/(1 + t?)] = e~'! and Parseval's equality to show that 
ri dr = 
Ds @ +12 2 


5. Use the function f(t) = e~“ sin(bt) H (t) with a > 0 and Parseval's equality to show that 


T dx z: i dx u T 
o (x2 +a? —b2)2+4a2b2 J g(z?--a? —02)?--4a?b? — 2a(a? +b?) 


6. Using the modulation property and [e^ H(t)| = 1/(b + iw), show that 


TEES INS 
(b + iw)? +a? 


F [e~™ sin(at) H(t)] = 
Use MATLAB to plot and compare the amplitude and phase spectra for e * H(t) and 
e 'sin(2t) H(t). 


7. Use Poisson’s summation formula with f(t) = e-!*! to show that 


oo 


DES ma de 
=T . 
Pris n? --1 1—e-?" 


8. Use Poisson's summation formula to prove? that 


~ —a(n+c)?+2b(n+e) = T b? /a = n?n? /a—2nri(b/a—c) 
i fen Ye | 


n=—Cco n=— Co 


9. Use Poisson’s summation formula to prove that 


Y —ianT __ 2n J> ó e _ a), 


where ó(-) is the Dirac delta function. 


10. Prove the two-dimensional form? of Poisson’s summation formula: 


S j» Tt aiki, aka) = ae S Y r( =, mne) ! 


kı=— 20 k2 =— o0 nı =— 00 N2 =— 00 


8 First proved by Ewald, P. P., 1921: Die Berechnung optischer und elektrostatischer Gitterpotentiale. 
Ann. Phys., 4te Folge, 64, 253-287. 


9 Lucas, S. K., R. Sipcic, and H. A. Stone, 1997: An integral equation solution for the steady-state 
current at a periodic array of surface microelectrodes. SIAM J. Appl. Math., 57, 1615-1638. 
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where 


Blass / J f(x, y)e 2 de dy. 


11.4 INVERSION OF FOURIER TRANSFORMS 


Having focused on the Fourier transform in the previous sections, we now consider the 
inverse Fourier transform. Recall that the improper integral, Equation 11.1.6, defines the 
inverse. Consequently, one method of inversion is direct integration. 


e Example 11.4.1 


Let us find the inverse of F(w) = ve- ^l. 
From the definition of the inverse Fourier transform, 


1 oo , 1 0 . 1 po , 
f(t) => | re let dyy = J elite diy + J etto dw (11.4.1) 
T J—oo —oo 0 
ieee” eee) if 1 1 ]_ 1 nee. 
2| 1+it |_. -litja | 2[1+i -1+it}] 1-0 "s 


An alternative to direct integration is the MATLAB function ifourier. For example, 
to invert F(w) = e-!*l, we type in the commands: 
>> syms pi omega t 
>> ifourier(’pi*exp(-abs (omega) )’ ,omega,t) 
This yields 
ans = 
1/(1+t72) 


Another method for inverting Fourier transforms is rewriting the Fourier transform 
using partial fractions so that we can use transform tables. The following example illustrates 
this technique. 

e Example 11.4.2 


Let us invert the transform 


F(w) : (11.4.3) 
“= (d oy — Di)?" ih 
We begin by rewriting (11.4.3) as 
1f 1 2 6 1 1 1 
F(w) = | s: | . (11.4.4 
(o) 9|1+iw ioe Gooey 9(1 + iw) 92 — iw) 6(3 — tw)? 


Using a table of Fourier transforms (see Section 11.1), we invert Equation 11.4.4 term by 
term and find that 


f= Let H(t) + se? H(-t) = cte!/? H (—t). (11.4.5) 
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To check our answer, we type the following commands into MATLAB: 


>> syms omega t 

>> ifourier(1/((1*i*omega)*(1-2*i*omega)^2) ,omega,t) 

which yields 

ans = 

1/9*exp(-t)*Heaviside(t)-1/6*exp(1/2*t)*t*Heaviside(-t) 
*1/9*exp(1/2*t)*Heaviside(-t) 


Although we may find the inverse by direct integration or partial fractions, in many 
instances the Fourier transform does not lend itself to these techniques. On the other hand, 
if we view the inverse Fourier transform as a line integral along the real axis in the complex 
w-plane, then some of the techniques that we developed in Chapter 10 can be applied to 
this problem. To this end, we rewrite the inversion integral, Equation 11.1.6, as 


1 itw 1 itz 1 itz 
—— — — Aa. 


— oo 


where C denotes a closed contour consisting of the entire real axis plus a new contour 
Cg that joins the point (00,0) to (—oo,0). There are countless possibilities for Cg. For 
example, it could be the loop (oo, 0) to (co, R) to (—oo, R) to (—00,0) with R > 0. However, 
any choice of Cg must be such that we can compute fo, F(z)e"^ dz. When we take that 
constraint into account, the number of acceptable contours decreases to just a few. The 
best is given by Jordan's lemma.!? 


Jordan's lemma: Suppose that, on a circular arc Cr with radius R and center at the 
origin, f(z) —^ 0 uniformly as R — co. Then 


(1) lim (a)e*^* dz = 0, (m > 0) (11.4.7) 


R-o0o Cr 


if Cr lies in the first and/or second quadrant; 


R-0o 


(2) lim I f(z)e-** dz — 0, (m > 0) (11.4.8) 
Cn 
if Cr lies in the third and/or fourth quadrant; 


(3) lim A f(z)e"* dz =0, (m » 0) (11.4.9) 


Roo 


if Cr lies in the second and/or third quadrant; and 


(4) lim (z)e ""? dz = 0, (m » 0) (11.4.10) 


Roo Cr 


if Cr lies in the first and/or fourth quadrant. 


10 Jordan, C., 1894: Cours D’Analyse de l'École Polytechnique. Vol. 2. Gauthier-Villars, pp. 285—286. 
See also Whittaker, E. T., and G. N. Watson, 1963: A Course of Modern Analysis. Cambridge University 
Press, p. 115. 
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Technically, only (1) is actually Jordan's lemma while the remaining points are varia- 
tions. 


Proof: We shall prove the first part; the remaining portions follow by analog. We begin by 
noting that 


|Ir| = f fizje”* dz| < | | f(z)| e*"*| Ida]. 11.4.11) 
Cr Cn 
Now 
|dz| = Rd, |f(z)| < Mn, 11.4.12) 
jen = |exp(imRe”’)| = Jexp(imR[cos(0) + isin(0)])| = e^" 252, 11.4.13) 
Therefore, 
01 
\IR| € RMa f exp[—mR sin(0)] d9, 11.4.14) 
0o 


where 0 < ĝo < 01 € v. Because the integrand is positive, the right side of Equation 11.4.14 
is largest if we take 09 = 0 and 0, = 7. Then 


T . 7/2 x 
[In| < RMR | e~™Rsin(®) 49 — 28 Mg | g ARE) qu (11.4.15) 
0 0 


We cannot evaluate the integrals in Equation 11.4.15 as they stand. However, because 
sin(0) > 20/7 if 0 < 0 < 1/2, we can bound the value of the integral by 


7/2 
UIg| € 2RMp | e 2mB6/s qg = T. Mp (Lew P). (11.4.16) 
0 m 


If m > 0, |In| tends to zero with Mg as R > oo. 


Consider now the following inversions of Fourier transforms: 
e Example 11.4.3 


For our first example we find the inverse for 


F(w) = ST — aap 6b20 (11.4.17) 
From the inversion integral, 
fi) = > l. cs dw, (11.4.18) 
2T J 4, W? — 2ibw — a? — b? 
or 
itz itz 
f= = f, ge 2: — a? — b? x x A z? — ae — a? — b? 2 ae) 


where C denotes a closed contour consisting of the entire real axis plus Cr. Because 
f(z) = 1/(z? — 2ibz — a? — b?) tends to zero uniformly as |z| — oo and m = t, the second 
integral in Equation 11.4.19 vanishes by Jordan's lemma if Cg is a semicircle of infinite 
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original | contour 


Cr fort «0 


Figure 11.4.1: Contour used to find the inverse of the Fourier transform, Equation 11.4.17. The contour 
C consists of the line integral along the real axis plus Cg. 


radius in the upper half of the z-plane when t > 0 and a semicircle in the lower half of the 
z-plane when t « 0. 
Next we must find the location and nature of the singularities. They are located at 


z? — 2ibz — a? — V? =0, or z=+a+ bi. (11.4.20) 


Therefore we can rewrite Equation 11.4.19 as 


fl) = = f, CETESUCENEST da (a 


Thus, all of the singularities are simple poles. 
Consider now t > 0. As stated earlier, we close the line integral with an infinite 
semicircle in the upper half-plane. See Figure 11.4.1. Inside this closed contour there are 


two singularities: z = +a + bi. For these poles, 
eit? (z pom bijet? 
Res : bi} = li 11.4.22 
(5t i) Beier (z —a — bi)(z +a— bi) ( ) 


eiate-bt ot 
min m -zg eos(at) -Fisim(at), (11.4.23) 
a a 


where we used Euler's formula to eliminate e^^*. Similarly, 


itz —bt 
Res( 5 Z D UNT S + i) = mE [cos(at) — i sin(at)]. (11.4.24) 


Consequently, the inverse Fourier transform follows from Equation 11.4.21 after applying 
the residue theorem, and equals 
e*t 
f(t) ac sin(at) (11.4.25) 
a 


for t > 0. 
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For t < 0, the semicircle is in the lower half-plane because the contribution from the 
semicircle vanishes as R — oo. Because there are no singularities within the closed contour, 
f(t) =0. Therefore, we can write in general that 


—bt 


f(t) = S sin(at) H(t). (11.4.26) 


e Example 11.4.4 


Let us find the inverse of the Fourier transform 


—wi 


€ 


S erar 


(11.4.27) 


where a is real and positive. 
From the inversion integral, 


1 oo ei (t- o 1 ett- 12 1 ei (t-1)2 
t)= dw = d d 11.4.28 
f x] ote = x jura 7 E " ( ) 


where C denotes a closed contour consisting of the entire real axis plus Cr. The contour 
Cr is determined by Jordan's lemma because 1/(z? + a?) — 0 uniformly as |z| — oo. Since 
m = t — 1, the semicircle Cr of infinite radius lies in the upper half-plane if t > 1 and in 
the lower half-plane if t < 1. Thus, if t > 1, 


i ei(t-1z ; e -a-1) 
f(t) = ORo ial = 5 —. (11.429) 
whereas for t < 1, 
1 g s ei(t-1)z ea(t—1) 
t) = =-(-2n1)Res ;—ai| = 11.4.3 
1O = Erie Sai] = 77 (11.4.30) 


The minus sign in front of the 277 arises from the clockwise direction or negative sense of 
the contour. We can write the inverse as the single expression 


e-alt-1| 
1) = 11.4.31 
r=- (11.4.31) 
e Example 11.4.5 
Let us evaluate the integral 
^* cos(kx) 
e ag lt, (11.4.32) 


where a, k > 0. 
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original contour 


CR fort<0 


Figure 11.4.2: Contour used in Example 11.4.6. 


We begin by noting that 


oo k oo tka ikz 
/ COSI) den J dec f GE dz); (11.4.33) 
Lœ T? +a? Los t? +a? Gi 2? +a? 


where C denotes a line integral along the real axis from —oo to oo. A quick check shows 
that the integrand of the right side of Equation 11.4.33 satisfies Jordan's lemma. Therefore, 


oo ci^ ei^ ei^ 
—— dr = $ —— —; dz = 2ni Res! ———5;ai (11.4.34) 
Los €? + a? c2 +a? 22 + a? 
. na Yoikz 
sgm dm 42996 7 T oda (11.4.35) 
z—ai z2 + a2 a 


where C denotes the closed infinite semicircle in the upper half-plane. Taking the real and 
imaginary parts of Equation 11.4.35, 


eee © sin(k 
| cos(kz) | Te-ka ang I sin(kz) o. (11.4.36) 
oo x3 a à coy T? + a? 


e Example 11.4.6 


Let us now invert the Fourier transform F(w) = 2a/(a? — w?), where a is real. The 
interesting aspect of this problem is the presence of singularities at w = +a that lie along 
the contour of integration. How do we use contour integration to compute 


rozi f erem (11.4.37) 


2 2 
=66 =w 


The answer to this question involves the concept of Cauchy principal value integrals, 
which allows us to extend the conventional definition of integrals to include integrands 
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that become infinite at a finite number of points. See Section 10.10. Thus, by treating 
Equation 11.4.37 as a Cauchy principal value integral, we again convert it into a closed 
contour integration by closing the line integration along the real axis as shown in Figure 
11.4.2. The semicircles at infinity vanish by Jordan's lemma and 


a eit? 
f= T "CEN dz. (11.4.38) 
T Joa?^—z 
For t > 0, 
2ria 1 eit? 27ia 1 eit? 
t)h= ; ;a|. 11.4.39 
f(t) ^ sRes| e - ges ia ( 39) 


We have the factor 4 because we are only passing over the “top” of the singularity at z — a 
and z — —a. Computing the residues and simplifying the results, we obtain f(t) — sin(at). 


Similarly, when t « 0, 


2ria 1 eit? 271ia 1 
T e 2 2? + 
z 


itz 
TP ai = —sin(at). — (11.4.40) 


These results can be collapsed down to the single expression f(t) = sgn(t) sin(at). 
e Example 11.4.7 


An additional benefit of understanding inversion by the residue method is the ability 
to qualitatively anticipate the inverse by knowing the location of the poles of F(w). This 
intuition is important because many engineering analyses discuss stability and performance 
entirely in terms of the properties of the system's Fourier transform. In Figure 11.4.3 we 
graphed the location of the poles of F(w) and the corresponding f(t). The student should 
go through the mental exercise of connecting the two pictures. 


e Example 11.4.8 


So far, we used only the first two points of Jordan's lemma. In this example!! we 
illustrate how the remaining two points may be applied. 
Consider the contour integral 


j cot(7z) a + = dz, 
C (r4 222)?)--8?  (r—2mz) + 8? 


where c > 0 and 8,7 are real. Let us evaluate this contour integral where the contour is 
shown in Figure 11.4.3. 


11 See Hsieh, T. C., and R. Greif, 1972: Theoretical determination of the absorption coefficient and the 
total band absorptance including a specific application to carbon monoxide. Int. J. Heat Mass Transfer, 
15, 1477-1487. 
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Figure 11.4.3: The correspondence between the location of the simple poles of the Fourier transform F (w) 
and the behavior of f(t). 


540 Advanced Engineering Mathematics with MATLAB 


Figure 11.4.4: Contour used in Example 11.4.7. 


From the residue theorem, 


j cot(ne) | = + pu | dz 
c (T +2r2z)? +8?  (r—2mz)?-- f? 


= 2ri 5 Res{ cot(r2)| — F 2622 m + oz 2073 st z| ; n} 


n=1 
f d e c .Jr| + Bi 
+ 2i Res{ cot(z) | e oar z| PA j 
. e e |r|- Bi 
+2ri Res cot(x2) | Cri eraa = E \ (11.4.41) 


Now 


—cz 


" ; e € : 
epo We lc Fonz? +B (r-2niya zJ"j 


. (z—n)cos(mz) .. Cae e c 
zx l H 11.4.42 
iom sin(zz) im (7 +2nz)? +8?  (r—2mz)?- f? ( ) 
1 e- ne e qe 
x: 11.4.4 
T lc +2nr)? +8?  (r—2nz)4 z| l ( d 


—cZz 


€ e € T| + Bi 
Res cot(mz) f=: Spee GE One =| | - \ 
' cot(rz)| (z—]|r| — bije” (z — |r| — Bi)e ** 
H 2 (r1 80/2 4r? E + 7/27)? + 82/47? Y (z — T/2n)? + 8? /An? 
. cot(|r]/2 + 81/2) exp(—e|r|/27) [cos(cB/27) — i sin(c8/2)] 
Ar Di : 


| (11.4.44) 


(11.4.45) 


and 


—cZz 


€ € 


eee ee 
Res coka) lc + 272z)? + 8? i (7 — 2nz)? + z| 2n \ 
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(11.4.46) 


i cot(1z) (z — |r| + Bije”? (z — |r| + Bi)e" 
sz onde 4n? | (z2+7/27)? + B?/An? | (z— 7/277)? + 82 /4m? 
. cot(|r|/2 — 81/2) exp(—c|r|/2m)|cos(cB8 /2m) + i sin(cB/2m)] 
E —Ar fi 


(11.4.47) 


Therefore, 


j cot(zz) = E | dz 
C (T +202)? +8?  (r—2mz)?-- f? 


oo enc enc 

2 lc + 2n)? + 8? u (T — 2nz)? + = 

i i eltl + ef 
2B eilr| — ef 
i ell 4 e-8 


2 28 eilr] — e-8 


ecl?" [cos(c /2r) — i sin(cB/2x)] 


e- 4171/7" (cose /25) + isin(cB/27)] (11.4.48) 


. oo e ^c enc 
E 93 [c + 2nr)? + 8? * (T — 2nz)? — 
i sinh(8) cos(cB/27) + sin(|7|) sin(c8/27) e—cl7|/27 


B cosh(8) — cos(7) 


where cot(o) = i(e?"* + 1)/(e?** — 1), and we made extensive use of Euler’s formula. 

Let us now evaluate the contour integral by direct integration. The contribution from 
the integration along the semicircle at infinity vanishes according to Jordan's lemma. In- 
deed, that is why this particular contour was chosen. Therefore, 


(11.4.49) 


j cota) ma | = | dz 
C (r4 222))-- 0?  (r—222)? + f? 
= ju cot(ne) | = + = | dz 
b Gone +B (rins e e 
«f cot(r2) | aes + = | dz 
C. (7 +272)? + 0?  (r—2mz)?- f? 


—ioo ec ee 
i iB TER) | (T + 222) + 62 (r -—2rz)? + zi des (11.4.50) 


Now, because z = iy, 


ie enc en 
j^ ela) | (7 4-292)? + 8? | (T —2nz)? + E ge 
€ "ET e icu 

a soth(ry)|.— lxi) Bi + (r— my + z| dy (11.4.51) 

jo coth(zy)(7? + 8? — 4n?y?)e **v 
< G2 B2 — 4242)? + 16727242 


I 


I 


dy, (11.4.52) 


—ioo : e c? e c F 
J eorom oa rE 
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—oo e eV e ic 
= th A, 
if coth(my) | ponte CR YT FE dy (11.4.53) 


oo th 2 2 2,,2Y51cy 
Lo OR UE camo a: (11.4.54) 
e (T2 + 62 — 4r?y?)? 162?7?y? 7" 


and 


J cot(ne) | = H = | dz 
C. (T +272)? + 0?  (r—2m2)?-- f? 


—n/2 1 0i . 
=| |e E | eta 
7/2 mee”? 3 


— cce? _pepbi 
Oe a a o MN (11.4.55) 
(T + 21ee?:)? + 82 (T — 2mee?')? + 8? 


In the limit of e > 0, 


f cot(7z) e + d d 
z 
c (T +2r2)? +8?  (r—2z2) + 8? 
æ coth 2 | 82 qum d t . 
E ai f co erg) +8 n’y?) sin(cy) dj 2i (11.4.56) 
o (7? + B? — Arr y?)? + 160272 y? 71? + f? 


: eo e" S 
3 | €— 1 


i sinh(B) cos(cB/2n) + sin(||) sin(cB/27) 4-2 
B cosh(8) — cos(7) £ "s (11.4.57) 


or 


M) coth(my) (7? + 8? — 41?y?) sin(cy) d 
o (7r? 8? — 4n?y?)? + 162?7?y? 
s (11.4.58) 


: e 
B oz +82  (r—2nz + B? 

1 sinh(8) cos(cB/27) + sin(|7]) sin(c8/27) c drl/2n 2 
B cosh(8) — cos(r) Sede: Bes 


If we let y = x/2m, 


Zu coth(z/2)(r? + 8? — x? )sin(ez/2m) 3 
z 
(T2 + 82 — z?)? + Ar?z? 


—nc —nc 
€ 


e 
B 26> |g + 2n7)? + 8? t (7 — 2nr)? + 8? 
sinh(8) cos(cB8/27) + sin(|r|) sin(cB/27) e drl/2n 28 


cosh(8) — cos(7) T T? + 8?^ 


(11.4.59) 


Problems 


1. Use direct integration to find the inverse of the Fourier transform F(w) = iwme!#! /2. 
Check your answer using MATLAB. 
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Use partial fractions to invert the following Fourier transforms: 


1 1 

x (1 + iw)(1 + 2iw) 2 (1 + dw)(1-— iw) 
iw 1 

* (1 + iw)(1 + 2iw) (1 + iw)(1 + 2iw)? 


Then check your answer using MATLAB. 


By taking the appropriate closed contour, find the inverse of the following Fourier transforms 
by contour integration. The parameter a is real and positive. 


1 w 
w2 + a? "uw? + a? 
š w 9 w? 
^ (w? + a?)? ^ (w? + a?)? 
1 1 
10. ——— 11. —— 
w? — 3iw — 3 (w — ia)?n*? 
w? 3 
12. 13. —————— 
(w? — 1)? + 4a?w? (2 — wi)(1 + wi) 


Then check your answer using MATLAB. 


14. Find the inverse of F(w) = cos(w)/(w* + a”), a > 0, by first rewriting the transform as 


—iw 


e 
2(w? + a?) + 2(w? + a?) 


iw 


F(w) = 


and then using the residue theorem on each term. 


15. Find!? the inverse Fourier transform for 


etiw etio iw 
Fs = - eS 
(o) = C ai) (Rete) ^ (u —ai) (RE — e 3r)" 


where a > 0 and R > 1. Hint: You must find separate inverses for different time intervals. 
For example, in the case of F} (w), you must examine the special cases of t < 0 and t > 0. 


16. As we shall show shortly, Fourier transforms can be used to solve differential equations. 
During the solution of the heat equation, Taitel et al.'? inverted the Fourier transform 


cosh(yvw? + 1) 
Vu? +1 sinh(pVw? + 1/2)’ 


F(w) = 


12 See Scharstein, R. W., 1992: Transient electromagnetic plane wave reflection from a dielectric slab. 
IEEE Trans. Educ., 35, 170-175. 


13 Taitel, Y., M. Bentwich, and A. Tamir, 1973: Effects of upstream and downstream boundary conditions 
on heat (mass) transfer with axial diffusion. Int. J. Heat Mass Transfer, 16, 359-369. 
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where y and p are real. Show that they should have found 


€ -|tl R y 2nmy DS i 
ft = ' 3» cos rx o vl 
aca v Tus 2e p 
In this case, our time variable t was their spatial variable x — £. 


17. Find the inverse of the Fourier transform 


wL Bs 
F(w) = |cos 4 —— — — ——4 , 
e= [arre ia) 
where L, 6, and y are real and positive and sgn(z) = 1 if R(z) > 0 and —1 if R(z) < 0. 


Use the residue theorem to verify the following integrals: 


T n sin(z) = T an(2) m Pes mult 
z e 0 


œ T? +4r 5 xr? +1)? 2e 

°° xsin(ax) E" ?* x? cos(ax) T B 
20. — d= j 21. = 1 be ^ 
0 n Q4 4 -Te | (a? + bP dx 15. ab)e 


where a,b > 0. 


22. The concept of forced convection is normally associated with heat streaming through 
a duct or past an obstacle. Bentwich!^ showed that a similar transport can exist when 
convection results from a wave traveling through an essentially stagnant fluid. In the process 
of computing the amount of heating, he proved the following identity: 


J. — cos(az) de = Infcoth(|a}r/h)], h > 0. 


— oo 


Confirm his result. 
11.5 CONVOLUTION 


The most important property of Fourier transforms is convolution. We shall use it 
extensively in the solution of differential equations and the design of filters because it yields 
in time or space the effect of multiplying two transforms together. 

'The convolution operation is 


= [i f(z)g(t — x) dz = T f(t — z)g(z) da. (11.5.1) 
Then, 


= /. f(z)e ni g(t — ajena dx (11.5.2) 
= / li f(z)G(» 


Je "da = F(w)G(w). (11.5.3) 


14 Bentwich, M., 1966: Convection enforced by surface and tidal waves. Int. J. Heat Mass Transfer, 9, 
663-670. 
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Thus, the Fourier transform of the convolution of two functions equals the product of the 
Fourier transforms of each of the functions. 


e Example 11.5.1 


Let us verify the convolution theorem using the functions f(t) = H(t + a) — H(t — a) 
and g(t) = e™*H (t), where a > 0. 
The convolution of f(t) with g(t) is 


f(t) * g(t) = / e C-? H(t — x) [H(x +a) — H(z — a)] dz = e= f e H(t — x) dx. 
7 7 (11.5.4) 
If t < —a, then the integrand of Equation 11.5.4 is always zero and f(t) « g(t) = 0. If t >a, 
f(t) * g(t) = ae | eda = e (4) — e— (ta), (11.5.5) 


—a 


Finally, for —a < t <a, 


t 
f(t) *g(t) = ef e*dx = 1 — e (t9, (11.5.6) 
In summary, 
0, t < -—a, 
f(t)«g(t) 24 1- etto, —a <t<a, (11.5.7) 
g-6-«) — e7 (tta) a<t 


As an alternative to examining various cases involving the value of t, we could have 

used MATLAB to evaluate Equation 11.5.4. The MATLAB instructions are as follows: 

>> syms f t x 

>> syms a positive 

>> f = 'exp(x-t)*Heaviside(t-x)*(Heaviside(x*a)-Heaviside(x-a))? 

>> int(f,x,-inf,inf) 

This yields 

ans = 

Heaviside(t*a)-Heaviside(t*ta)*exp(-a-t) 
-Heaviside(t-a)*Heaviside(t-a)*exp(a-t) 


The Fourier transform of f(t) * g(t) is 


FIFE * g(t) af [i-e eae [^ le (t-a) _ ¢ 5) ett — (11.5.8) 


a 


= F(w)G(w) (11.5.9) 


I 


2sin(wa) — 2isin(wa) a 1 ) 


w l+wi w l+wi 


and the convolution theorem is true for this special case. The Fourier transform Equation 
11.5.9 could also be obtained by substituting our earlier MATLAB result into fourier and 
then using simplify (ans). 
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e Example 11.5.2 
Let us consider the convolution of f(t) = f, (t)H(t) with g(t) = g,H(t). Note that 


both of the functions are nonzero only for t > 0. 
From the definition of convolution, 


F0 — [^ f e mi a)gy()H (ede = [^ falt- tri na ds 


(11.5.10) 
For t < 0, the integrand is always zero and f(t) * g(t) = 0. For t > 0, 
t 
f) «a = | fett moe) de. (11.5.11) 
0 
Therefore, in general, 
t 
SOI — | | HE- naso) de] n. (11.5.12) 
0 


This is the definition of convolution that we will use for Laplace transforms where all of the 
functions equal zero for t « 0. 


'The convolution operation also applies to Fourier transforms, in what is commonly 
known as frequency convolution. We now prove that 


Froo- Lr en. (11.5.13) 
where " 
F(w)*G(w) = ni F(r)G(w — 7) dr, (11.5.14) 


where F(w) and G(w) are the Fourier transforms of f(t) and g(t), respectively. 


Proof: Starting with 
Lf 
r= x] F(r)e"'* dr, (11.5.15) 
T J—oo 


we can multiply the inverse of F(T) by g(t) so that we obtain 


f(t)g(t) = > T F(T)g(t)e dr. (11.5.16) 


Then, taking the Fourier transform of Equation 11.5.16, we find that 


ries - f^ |g [^ FPOe ar me (11.5.17) 
= F F(r) VA g(t)e tert a| dr (11.5.18) 
= > RAC G=tdre a (11.5.19) 
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Thus, the multiplication of two functions in the time domain is equivalent to the convolution 
of their spectral densities in the frequency domain. 


Problems 
1. Show that e™tH (t) «e * H(t) = te * H(t). Then verify your result using MATLAB. 
2. Show that e-* H(t) * e! H(—t) = 1e-!'l. Then verify your result using MATLAB. 


3. Show that e *H(t) * e ?* H(t) = (e * —e?*) H(t). Then verify your result using 
MATLAB. 


4. Show that 
et — et-?, t <0, 
e'H(—t) « [H(t) - H(t- 2) 24 1— et? 0<t<2, 
0, 2<t. 
Then verify your result using MATLAB. 
5. Show that 
0, t <0, 
t, 0<t<2, 
H(t) -BG-2)«(H(0 -B-2)- 4 geyea 
0, 4 & t. 


'Then try and verify your result using MATLAB. What do you have to do to make it work? 
6. Show that e7!#l x e-ltl = (1 + [t|)e- tl. 
7. Prove that the convolution of two Dirac delta functions is a Dirac delta function. 


11.6 THE SOLUTION OF ORDINARY DIFFERENTIAL EQUATIONS 
BY FOURIER TRANSFORMS 


As with Laplace transforms, we may use Fourier transforms to solve ordinary differential 
equations. However, this method gives only the particular solution and we must find the 
complementary solution separately. 

Consider the differential equation 


y ty=se ll, —oo < t < oo. (11.6.1) 
Taking the Fourier transform of both sides of Equation 11.6.1, 


Y EY A (11.6.2) 


where we used the derivative rule, Equation 11.3.17, to obtain the transform of y' and 
Y(w) = F[y(t)]. Therefore, 
1 


PW SED 


(11.6.3) 
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Applying the inversion integral to Equation 11.6.3, 


y(t) = x ds m "T rop ds (11.6.4) 


We evaluate Equation 11.6.4 by contour integration. For t > 0 we close the line integral 
with an infinite semicircle in the upper half of the w-plane. The integration along this arc 
equals zero by Jordan's lemma. Within this closed contour we have a second-order pole at 
z = i. Therefore, 


eit? l , d " eit? te-t et 

m ERU ER EU le i) — =z ta 0165 
and : ; , i 
š ie p~. e. 

y(t) — o; Qni) | 5 =| TA (2t 4- 1). (11.6.6) 


For t « 0, we again close the line integral with an infinite semicircle but this time it is in 
the lower half of the w-plane. The contribution from the line integral along the arc vanishes 


by Jordan's lemma. Within the contour, we have a simple pole at z — —i. Therefore, 
eit? eit? et 
R i| = li ri =——, 11.6.7 
5| (241) + zi)! ] Ani mE 


and 


uires 2ni) ( =) = = (11.6.8) 


The minus sign in front of the 277 results from the contour being taken in the clockwise 
direction or negative sense. Using the step function, we can combine Equation 11.6.6 and 
Equation 11.6.8 into the single expression 


y(t) = telil + Ste H (t). (11.6.9) 

Note that we only found the particular or forced solution to Equation 11.6.1. The most 
general solution therefore requires that we add the complementary solution Ae *, yielding 
y(t) = Ae’ + fel" + Ste H(t). (11.6.10) 


The arbitrary constant A would be determined by the initial condition, which we have not 
specified. 
We could also have solved this problem using MATLAB. The MATLAB script 


clear 

% define symbolic variables 

syms omega t Y 

% take Fourier transform of left side of differential equation 
LHS = fourier (diff(sym(’y(t)’))+sym(’y(t)’) , t, omega) ; 

^ take Fourier transform of right side of differential equation 
RHS = fourier (1/2*exp(-abs(t)) ,t,omega) ; 

% set Y for Fourier transform of y 

% and introduce initial conditions 

newLHS = subs(LHS,'fourier(y(t),t,omega)?,Y); 
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^4 solve for Y 

Y = solve(newLHS-RHS,Y); 

^ invert Fourier transform and find y(t) 

y = ifourier(Y,omega,t) 

yields 

y = 

1/4*exp(t) *Heaviside(-t)+1/2*exp(-t) *t*Heaviside(t) 
*1/4*exp(-t)*Heaviside(t) 

which is equivalent to Equation 11.6.9. 
Consider now a more general problem of 


yty=f(), =œ < t< œ, (11.6.11) 


where we assume that f(t) has the Fourier transform F'(w). Then the Fourier-transformed 
solution to Equation 11.6.11 is 


>q dk 
^ l-cuwi 


Y(w) 


F(w) = G(w)F(w) Or y(t) = g(t) * f(t), (11.6.12) 


where g(t) = F~+[1/(1 + wi)] = et H(t). Thus, we can obtain our solution in one of two 
ways. First, we can take the Fourier transform of f(t), multiply this transform by G(w), 
and finally compute the inverse. The second method requires a convolution of f(t) with 
g(t). Which method is easiest depends upon f(t) and g(t). 

In summary, we can use Fourier transforms to find particular solutions to differential 
equations. The complete solution consists of this particular solution plus any homogeneous 
solution that we need to satisfy the initial conditions. Convolution of the Green’s function 
with the forcing function also gives the particular solution. 


Problems 


Find the particular solutions for the following differential equations. For Problems 1-3, 
verify your solution using MATLAB. 


Oy aad — 1,- 
1. y" +3y' + 2y = eH (t) 2. y" + Ay + Ay = 1e Hl 


3. y" — 4y' + Ay = e H(t) 4. y" — My = ó(x), 
where A has a positive real part and a negative imaginary part. 
11.7 THE SOLUTION OF LAPLACE'S EQUATION ON THE UPPER HALF-PLANE 


In this section we shall use Fourier integrals and convolution to find the solution of 
Laplace's equation on the upper half-plane y > 0. We require that the solution remains 
bounded over the entire domain and specify it along the x-axis, u(z,0) = f(x). Under these 
conditions, we can take the Fourier transform of Laplace's equation and find that 


? gu 
Los Oy? 


© Ou 


ax e *9* dg = 0. (11.7.1) 
—oo OX 


e |" dx + 
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If everything is sufficiently differentiable, we may successively integrate by parts the first 
integral in Equation 11.7.1, which yields 


SEC deg d. 3u s ll m ðu ; 
v eure [ww d mue Rie Wr > Tp wr d 11.7.2 
I. agit T ma NN ae T ) 
= iw u(x, ye twp. — u? f u(x, ye?" dx 11.7.3) 
= —w*U(w,y), 11.7.4) 
where E 
U(w,y) = J u(x, y)e *^* dz. 11.7.5) 
'The second integral becomes 
PE u ; d? n ; d?U (w, y) 
Lao WE = — ic AD d = ee ako 
M yia dx ap is u(x, y)e : ap (11.7.6) 
along with the boundary condition that 
F(w) = U(w,0) zi f(a) *** dz. (11.7.7) 


Consequently, we reduced Laplace's equation, a partial differential equation, to an ordinary 
differential equation in y, where w is merely a parameter: 


2 
d"U (v; y) em w?U (tw, y) — 0, (11.7.8) 
dy? 

with the boundary condition U(w,0) = F(w). The solution to Equation 11.7.8 is 
U(w,y) = A(w)el*lY + B(uye-", 0< y. (11.7.9) 


We must discard the c4! term because it becomes unbounded as we go to infinity along 
the y-axis. The boundary condition results in B(w) = F(w). Consequently, 


U(w,y) = F(w)e "lv. (11.7.10) 


The inverse of the Fourier transform e~!“!¥ equals 


mes e ley gio dw = mus e" 9e? dw + az | e be dw 11.7.11) 
M E 1 qr oes 
=o e "yg ™? dw + — e "yg dw 11.7.12) 
2m Jo 2m Jo 
1 oo 
= Ji e ^? cos(wax) dw 11.7.13) 
7T Jo 
| lfexp(-uy) i P: 
= m p vm d [—y cos(w) + z sin(wx)] 11.7.14) 
y 0 
1 
ees 11.7.15) 


Tr? +Y? 
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Furthermore, because Equation 11.7.10 is a convolution of two Fourier transforms, its inverse 


is 
"CHEM iU 
u(z,y) = = CIE um dt. (11.7.16) 


Equation 11.7.16 is Poisson’s integral formula’ for the half-plane y > 0 or Schwarz’ integral 
formula.'® 


e Example 11.7.1 


As an example, let u(x,0) = 1 if |x| < 1 and u(#,0) = 0 otherwise. Then, 
jo 1 1— 1 

u(r,y) = / d dt = [a (=) + tan ( JI (11.7.17) 
TJ (=t)? +y? T y y 


Problems 


Find the solution to Laplace’s equation in the upper half-plane for the following boundary 
conditions: 


fil, O0<zz<l, 1 "ENG 
1. u(z,0) = ls otherwise. 2. u(z,0) = { T uen 
To, qom 0, 2o, r< =i: 
3. u(x, 0) = 4. u(x,0) = To, duse 
á We e 0, 1 «Sq. 
To, £ < ay, 
To, —-1«z «0, zd ay < $ < a2, 
5. u(x, 0) = [re ne 0<a<l, 6. u(z,0) = 2, 02 «X t € 43, 
0, otherwise. : í 


T An « dX. 


11.8 THE SOLUTION OF THE HEAT EQUATION 


We now consider the problem of one-dimensional heat flow in a rod of infinite length 
with insulated sides. Although there are no boundary conditions because the slab is of 
infinite extent, we do require that the solution remains bounded as we go to either positive 
or negative infinity. The initial temperature within the rod is u(z,0) = f(x). 

Employing the product solution technique of Section 8.3, we begin by assuming that 
u(x,t) = X(x)T(t) with 

T' - a2AT — 0, (11.8.1) 


15 Poisson, S. D., 1823: Suite du mémoire sur les intégrales définies et sur la sommation des séries. J. 
École Polytech., 19, 404—509. See pg. 462. 


16 Schwarz, H. A., 1870: Über die Integration der partiellen DifferentialgleichungO?u/8z? +0?u/dy? = 0 
für die Flàche eines Kreises. Vierteljahrsschr. Naturforsch. Ges. Zürich, 15, 113-128. 
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and 
X" EF AX =0. (11.8.2) 


Solutions to Equation 11.8.1 and Equation 11.8.2, which remain finite over the entire x- 
domain, are 
X(x) = E cos(kz) + F sin(kz), (11.8.3) 


and 
T(t) = C exp(- ?a?t). (11.8.4) 


Because we do not have any boundary conditions, we must include all possible values of 
k. Thus, when we sum all of the product solutions according to the principle of linear 
superposition, we obtain the integral 


utes f [A(k) cos(ka) + B(k) sin(kz)]e-* *^* dk. (11.8.5) 
0 
We can satisfy the initial condition by choosing 


A(k) = Ju f (x) cos(ka) da, (11.8.6) 


1 oo 
B(k) — JI f(x) sin(ka) da, (11.8.7) 
T Joo 
because the initial condition has the form of a Fourier integral 
f(z) = J [A(k) cos(kx) + B(k) sin(kz)] dk, (11.8.8) 
0 


when t = 0. 
Several important results follow by rewriting Equation 11.8.8 as 


gd): : | j | / ik (2 cule conten) de 1 : f£ysta ke yen (Ra) as en Batt ap, 


(11.8.9) 
Combining terms, 


prs 1 | Kl i ^^ #(€)feos(ké) cos(kz) + sin(k&) sin(ka)] penna (11.8.10) 
1 M 99 P o 
aos k(£ —a)]d£ e * ^ * dk. 8: 
=f [fe costtee - oase (118.11) 
Reversing the order of integration, 
u(x,t) = J HO) n confe — xe a] d£. (11.8.12) 


The inner integral is called the source function. We may compute its value through an 
integration on the complex plane; it equals 


d cos[k(£ — x)] exp( —k2a2t) dk = (ix) T |- Se] l (11.8.13) 
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if 0 « t. This gives the final form for the temperature distribution: 


indi sof FE) ep E] dé. (11.8.14) 


e Example 11.8.1 
Let us find the temperature field if the initial distribution is 
u(z,0) = iex -—, (11.8.15) 
'Then 


u(t) = = ME cx dé fo» EE ac} (11.8.16) 


oo oo ; T, z/2av/t 
m: J er a- [ eT ar| = — e" dr (11.8.17) 


7 Vm —a/2avt /2av/t 7 Vr —a/2avVt 
x /2av/t 
- E e^" dr 2 Ty et( t) (11.8.18) 
T Jo a 


where erf(-) is the error function. 


e Example 11.8.2: Kelvin's estimate of the age of the earth 


In the middle of the nineteenth century, Lord Kelvin!" estimated the age of the earth 
using the observed vertical temperature gradient at the earth's surface. He hypothesized 
that the earth was initially formed at a uniform high temperature Tọ and that its surface 
was subsequently maintained at the lower temperature of Ts. Assuming that most of the 
heat conduction occurred near the earth's surface, he reasoned that he could neglect the 
curvature of the earth, consider the earth's surface planar, and employ our one-dimensional 
heat conduction model in the vertical direction to compute the observed heat flux. 

Following Kelvin, we model the earth's surface as a flat plane with an infinitely deep 
earth below (z > 0). Initially the earth has the temperature To. Suddenly we drop the 
temperature at the surface to Ts. We wish to find the heat flux across the boundary at 
z = 0 from the earth into an infinitely deep atmosphere. 

The first step is to redefine our temperature scale v(z,t) = u(z,t) -- Ts, where v(z, t) is 
the observed temperature so that u(0,t) = 0 at the surface. Next, in order to use Equation 
11.8.14, we must define our initial state for z « 0. To maintain the temperature u(0, t) — 0, 
the initial temperature field f(z) must be an odd function, or 


u To — Ts, z 0, 
f(z) = E — Ty, z<0. (11.8.19) 


17 Thomson, W., 1863: On the secular cooling of the earth. Philos. Mag., Ser. 4, 25, 157-170. 
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From Equation 11.8.14, 


uen- s (as [E521 ae f eof- $52") a ars 


4a?nt 4a?t 


= (To — Ts) a(z) ; (11.8.21) 


following the work in the previous example. 
The heat flux q at the surface z = 0 is obtained by differentiating Equation 11.8.21 
according to Fourier’s law and evaluating the result at z = 0: 


= Ov _ K(Ts — To) 
a=- T dum 2 (11.8.22) 


The surface heat flux is infinite at t = 0 because of the sudden application of the temperature 
Ts at t = 0. After that time, the heat flux decreases with time. Consequently, the time t 
at which we have the temperature gradient Ov(0, t) /Oz is 


(To — Ts)? 


t= aa? [Bv(0, 0/82] 


(11.8.23) 


For the present near-surface thermal gradient of 25 K/km, Ty — Ts = 2000 K, and a? = 1 
mm?/s, the age of the earth from Equation 11.8.23 is 65 million years. 

Although Kelvin realized that this was a very rough estimate, his calculation showed 
that the earth had a finite age. This was in direct contradiction to the contemporary 
geological principle of uniformitarianism: that the earth's surface and upper crust had 
remained unchanged in temperature and other physical quantities for millions and millions 
of years. The resulting debate would rage throughout the latter half of the nineteenth 
century and feature such luminaries as Kelvin, Charles Darwin, Thomas Huxley, and Oliver 
Heaviside.!® Eventually Kelvin's arguments would prevail and uniformitarianism would 
fade into history. 

Today, Kelvin's estimate is of academic interest because of the discovery of radioactivity 
at the turn of the twentieth century. During the first half of the twentieth century, geologists 
assumed that the radioactivity was uniformly distributed around the globe and restricted to 
the upper few tens of kilometers of the crust. Using this model they would then use observed 
heat fluxes to compute the distribution of radioactivity within the solid earth.? Now we 
know that the interior of the earth is quite dynamic; the oceans and continents are mobile 
and interconnected according to the theory of plate tectonics. However, geophysicists still 
use measured surface heat fluxes to infer the interior?? of the earth. 


e Example 11.8.3 


So far we have shown how a simple application of separation of variables and the 
Fourier transform yields solutions to the heat equation over the semi-infinite interval (0, oo) 


18 See Burchfield, J. D., 1975: Lord Kelvin and the Age of the Earth. Science History Publ., 260 pp. 
19 See Slichter, L. B., 1941: Cooling of the earth. Bull. Geol. Soc. Am., 52, 561-600. 


20 Sclater, J. G., C. Jaupart, and D. Galson, 1980: The heat flow through oceanic and continental crust 
and the heat loss of the earth. Rev. Geophys. Space Phys., 18, 269—311. 
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via Equation 11.8.5. Can we still use this technique for more complicated versions of the 
heat equation? The answer is yes but the procedure is more complicated. We illustrate it 
by solving?! 

Ou Pu 207 u 


— 11.8.24 
ET Cap DE 0<4<w, 0<t, ( ) 
subject to the boundary conditions 
u(0,t) = f(t), lim |u(a,t)| < oo, 0<t, (11.8.25) 
lim u(x,t) 20, lim us(z,t 20, — 0«t, (11.8.26) 
and the initial condition 
u(x,0) = 0, 0 « x « oc. (11.8.27) 


We begin by multiplying Equation 11.8.24 by sin(ka) and integrating over x from 0 to 


af Utee Sin(kx) dx + a? n Use Sin(ka) dx = | uz sin(kax) da. (11.8.28) 
0 0 0 
Next, we integrate by parts. For example, 
n Use Sin(kx) dx = uz sin(kx)| — Zi Uz cos(ka) dx 11.8.29) 
0 0 0 
= -k f us cos(kx) da 11.8.30) 
0 
= —ku(x,t)cos(kx)| — ef u(x, t) sin(ka) da 11.8.31) 
0 0 
= kf(t) — k°U (k, t), 11.8.32) 
where " 
U(k,t) = n uta Saints) dis 11.8.33) 
0 


and the boundary conditions have been used to simplify Equation 11.8.29 and Equation 
11.8.31. Equation 11.8.33 is the definition of the Fourier sine transform. It and its math- 
ematical cousin, the Fourier cosine transform jo u(a,t) cos(ka) dz, are analogous to the 
half-range sine and cosine expansions that appear in solving the heat equation over the 
finite interval (0, L). The difference here is that our range runs from 0 to oo. 

Applying the same technique to the other terms, we obtain 


alk f (t) — k?U' (k,t)] + a? [kf (t) — k?U(k,t)] = U'(k,t) (11.8.34) 


with U(k,0) — 0, where the primes denote differentiation with respect to time. Solving 
Equation 11.8.34, 


a2 k? ak? ak i a2k?7 ak? a?k t a?k?r ak? 
et k°t/(1t+ak )U (k, t) E Te. [sme k?r/(1+ak?) qr |. TET [sme ker /(L+ak?) qu. 
(11.8.35) 


21 See Fetecáu, C., and J. Zierep, 2001: On a class of exact solutions of the equations of motion of a 
second grade fluid. Acta Mech., 150, 135-138. 


556 Advanced Engineering Mathematics with MATLAB 


Using integration by parts on the second integral in Equation 11.8.35, we find that 


1 2,2 2 1 n 2,2 2 
Lp- ila -a?k?t/(1+ak?) _ D (yg a? E T) Qa?) qu | 
Ulet) =; | HO - rov sp | to r 
(11.8.36) 
Because 9 [o 
u(x,t) = Ji U (k, t) sin(kx) dk, (11.8.37) 
0 
2 © si 
T tq | DR n 
T 0 k 
2 [^ sin(kx) _a?k?t/(1+ak?) 1 | Hr. sa ke Cai) 
= k € dk f(0) * soe : f (r)e dr 
(11.8.38) 
2 ?? sin(kx) —a?k?t/(1+ak?) 1 [r 2,2 2 
= Had RSE oo a k =F. a 7/(1+ak") . 
mo-z[ T. ak (0) + aa | (0e dr 
(11.8.39) 
Problems 


For Problems 1-4, find the solution of the heat equation 


Ou 20u 


8i ^ Bap —oo«m«oo, 0<t, 


subject to the stated initial conditions. 


"m — 
0, —œ <x <0, 

3 u(x, 0) = E Ü «a «b, 4. u(x, 0) = 6(z) 
0, b< r< 


oOverin, as a 1e e solution to Problem o cases involving the cooling of lava. 
Lovering?? has applied the solution to Problem 1 t involving the cooling of 1 


5. Solve the spherically symmetric equation of diffusion,?? 
Ou of@u 20u 
a D WB enc < t 
E? (Fe +i), O0<r<w, 0<t, 


22 Lovering, T. S., 1935: Theory of heat conduction applied to geological problems. Bull. Geol. Soc. 
Am., 46, 69-94. 


?3 See Shklovskii, I. S., and V. G. Kurt, 1960: Determination of atmospheric density at a height of 430 
km by means of the diffusion of sodium vapors. Am. Rocket Soc. J., 30, 662-667. 
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with u(r,0) = uo(r). 
Step 1: Assuming v(r,t) = r u(r, t), show that the problem can be recast as 


Ov ,0?v 
— = _—— < 
DI aU O0<r<w, Oct, 


with v(r, 0) = r uo(r). 


Step 2: Using Equation 11.8.14, show that the general solution is 


st) o oos | wfe] Md ap] CHE), odo 


Hint: What is the constraint on Equation 11.8.14 so that the solution remains radially 
symmetric? 


Step 3: For the initial concentration of 


No, 0 € r « ro, 
wt) = 0. To «T i 


show that 


«60 = boat BF) eer (7) m Co 627] 


where erf(-) is the error function. 


Further Readings 


Bracewell, R. N., 2000: The Fourier Transform and Its Applications. McGraw -Hill Book 
Co., 616 pp. This book presents the theory as well as a wealth of applications. 


Korner, T. W., 1988: Fourier Analyis. Cambridge University Press, 591 pp. Presents 
several interesting applications. 


Sneddon, I. N., 1995: Fourier Transforms. Dover, 542 pp. A wonderful book that illustrates 
the use of Fourier and Bessel transforms in solving a wealth of problems taken from the 
sciences and engineering. 


Titchmarch, E. C., 1948: Introduction to the Theory of Fourier Integrals. Oxford University 
Press, 391. A source book on the theory of Fourier integrals until 1950. 


Taylor & Francis 


Taylor & Francis Group 


http://taylorandfrancis.com 
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Chapter 12 
The Laplace Transform 


'The previous chapter introduced the concept of the Fourier integral. If the function is 
nonzero only when t > 0, a similar transform, the Laplace transform,! exists. It is particu- 
larly useful in solving initial-value problems involving linear, constant coefficient, ordinary, 
and partial differential equations. The present chapter develops the general properties and 
techniques of Laplace transforms. 


12.1 DEFINITION AND ELEMENTARY PROPERTIES 


Consider a function f(t) such that f(t) = 0 for t < 0. Then the Laplace integral: 


ED = F(s) = ra f(t)e^* dt (12.1.1) 


defines the Laplace transform of f(t), which we shall write £[f(t)] or F(s). The Laplace 
transform converts a function of t into a function of the transform variable s. 

Not all functions have a Laplace transform because the integral, Equation 12.1.1, may 
fail to exist. For example, the function may have infinite discontinuities. For this reason, 
f(t) = tan(t) does not have a Laplace transform. We can avoid this difficulty by requiring 
that f(t) be piece-wise continuous. That is, we can divide a finite range into a finite number 
of intervals in such a manner that f(t) is continuous inside each interval and approaches 
finite values as we approach either end of any interval from the interior. 


1 The standard reference for Laplace transforms is Doetsch, G., 1950: Handbuch der Laplace-Trans- 
formation. Band 1. Theorie der Laplace- Transformation. Birkhauser Verlag, 581 pp.; Doetsch, G., 1955: 
Handbuch der Laplace- Transformation. Band 2. Anwendungen der Laplace- Transformation. 1. Abteilung. 
Birkhauser Verlag, 433 pp.; Doetsch, G., 1956: Handbuch der Laplace- Transformation. Band 3. Anwen- 
dungen der Laplace- Transformation. 2. Abteilung. Birkháuser Verlag, 298 pp. 
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Another unacceptable function is f(t) = 1/t because the integral Equation 12.1.1 fails 
to exist. This leads to the requirement that the product t”|f(t)| is bounded near t = 0 for 
some number n « 1. 

Finally, | f (£)| cannot grow too rapidly or it could overwhelm the e~* term. To express 
this, we introduce the concept of functions of exponential order. By exponential order we 
mean that there exist some constants, M and k, for which |f(t)| € Me’ for all t > 0. 
Then, the Laplace transform of f(t) exists if s, or just the real part of s, is greater than k. 

In summary, the Laplace transform of f(t) exists, for sufficiently large s, provided f(t) 
satisfies the following conditions: 

e f (t) — 0 for t « 0, 

e f(t) is continuous or piece-wise continuous in every interval, 

e ("|f (t)| < oo as t > 0 for some number n, where n < 1, 


e e *v!|f(t)| < oo as t + oo, for some number so. The quantity so is called the abscissa 
of convergence. 


e Example 12.1.1 


Let us find the Laplace transform of 1, e^*, sin(at), cos(at), and t” from the definition 
of the Laplace transform. From Equation 12.1.1, direct integration yields 


S A 


oo —st 
L(1) -=f e * dt = — 5 =. 54 S > 0, (12.1.2) 
0 So. 9 
£ (e^t) x: gite rat x eS Ot dt (12.1.3) 
0 0 
e~ G-2t 99 1 
= = GN >0, (12.1.4) 
s—a | s-a 
oo "EL ee 
L{sin(at)] = n sin(at)e “dt = — =—|ssin(at) + a cos(at)] (12.1.5) 
0 SIRO 0 
a 
zr ELI (12.1.6) 
oo e- 5t 2 
Licos(at)] = | cos(at)e- **dt = Para [—scos(at) + asin(at)| ; (12.1.7) 
s 
Sa 30 (12.1.8) 
and 
L n\ _ TA n -std last : pem nus n! 0 12.1.9 
(t^) = s t"e *'dt = nle 2 cen. "gue s 0, (12.1.9) 


m=0 


where n is a positive integer. 
MATLAB provides the routine laplace to compute the Laplace transform for a given 
function. For example, 


The Laplace Transform 561 


>> syms anst 

>> laplace(1,t,s) 

ans = 

1/s 

>> laplace(exp(a*t),t,s) 
ans = 

1/ (s-a) 

>> laplace(sin(a*t),t,s) 
ans = 

a/(s^2*a^2) 

>> laplace(cos(a*t),t,s) 
ans = 

s/(s^2*a^2) 

>> laplace(t^5,t,s) 

ans - 

120/s^6 


'The Laplace transform inherits two important properties from its integral definition. 
First, the transform of a sum equals the sum of the transforms, or 


£|ei f (t) + eag(t)] = ee[f (0) + e£ [g(t)]. (12.1.10) 


This linearity property holds with complex numbers and functions as well. 
e Example 12.1.2 


Success with Laplace transforms often rests with the ability to manipulate a given 
transform into a form that you can invert by inspection. Consider the following examples. 


Given F(s) = 4/s?, then 


2 
F(s)=2x 3, and f(t)=20 (12.1.11) 
E 
from Equation 12.1.9. 
Given "nm j 
E E 
F(s) = = 12.1.12 
is) s241 Suec ee? ( ) 
then 
f(t) = cos(t) + 2sin(t) (12.1.13) 
by Equation 12.1.6, Equation 12.1.8, and Equation 12.1.10. 
Because i i j 
F(s) = = (12.1.14) 


s(s-1  s—-1 s 


by partial fractions, then 
f(t) 2e-1 (12.1.15) 
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by Equation 12.1.2, Equation 12.1.4, and Equation 12.1.10. 
MATLAB also provides the routine ilaplace to compute the inverse Laplace transform 
for a given function. For example, 


>> syms S t 

>> ilaplace(4/s^3,s,t) 

ans = 

2*t^2 

>> ilaplace((s42)/(s^2*1),s,t) 
ans - 

cos (t)*2*sin(t) 

>> ilaplace(1/(s*(s-1)),s,t) 
ans = 

-1+exp(t) 


The second important property deals with derivatives. Suppose f(t) is continuous and 
has a piece-wise continuous derivative f'(t). Then 


£f) is fi (the “dt = e-* F(t)| + d f (the “dt (12.1.16) 


by integration by parts. If f(t) is of exponential order, e~* f (t) tends to zero as t — oo, for 
large enough s, so that £[f'(t)] = sF(s) — f(0). Similarly, if f(t) and f'(t) are continuous, 
f" (t) is piece-wise continuous, and all three functions are of exponential order, then 


Lif” (t)] = sL[f'(t)] — (0) = s? F(s) — sf(0) — f"(0). (12.1.17) 


In general, 


Lf (0)] = s" F(s) — s" f(0) — + — sf (0) — FON) (12.1.18) 


on the assumption that f(t) and its first n— 1 derivatives are continuous, f?) (t) is piece-wise 
continuous, and all are of exponential order so that the Laplace transform exists. 
'The converse of Equation 12.1.18 is also of some importance. If 


u(t) f Fiore (12.1.19) 
then 
£[u(t)] = ru gon "ns ar| dt = — — [59 dr : + 3 f (t)e tdt, 
(12.1.20) 
and 


L£ nas ar| = F(s)/s, (12.1.21) 
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where u(0) — 0. 
Problems 


Using the definition of the Laplace transform, find the Laplace transform of the following 
functions. For Problems 1-4, check your answers using MATLAB. 


1. f(t) = cosh(at) 2. f(t) = cos?(at) 
3. f(t) = (¢+ 1)? 4 F) = (t+ 1)e-*t 
DOM x “oa oF 


Using your knowledge of the transform for 1, e^*, sin(at), cos(at), and t”, find the Laplace 
transform of 


7. f(t) = 2sin(t) — cos(2t) + cos(3) — t 8. f(t) =t— 2+ e?! — sin(5t) + cos(2). 
Find the inverse of the following transforms. Verify your result using MATLAB. 
9. F(s) = 1/(s +3) 10. F(s) mq 


11. F(s) = 1/(s? +9) 12. F(s) = (2s + 3)/(s? + 9) 


13. F(s) = 2/(s? + 1) — 15/s? + 2/(s + 1) — 6s/(s? + 4) 

14. F(s) = 3/s + 15/8? + (s + 5)/(s? + 1) — 6/(s — 2). 

15. Verify the derivative rule for Laplace transforms using the function f(t) = sin(at). 
16. Show that Li f(at)] = F (s/a) /a, where F(s) = £[f(t)]. 


17. Using the trigonometric identity sin? (x) = [1 — cos(2x)]/2, find the Laplace transform 
of f(t) = sin?[nt/ (2T))]. 


12.2 THE HEAVISIDE STEP AND DIRAC DELTA FUNCTIONS 
Change can occur abruptly. We throw a switch and electricity suddenly flows. In this 


section we introduce two functions, the Heaviside step and Dirac delta, that will give us the 
ability to construct complicated discontinuous functions to express these changes. 


| Heaviside step function 


We define the Heaviside step function as 


1, ta, 
H(t—a)-— { 0 Pg (12.2.1) 
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Largely a self-educated man, Oliver Heaviside (1850-1925) lived the life of a recluse. It was during 
his studies of the implications of Maxwell's theory of electricity and magnetism that he re-invented 
Laplace transforms. Initially rejected, it would require the work of Bromwich to justify its use. 
(Portrait courtesy of the Institution of Engineering and Technology Archives.) 


where a > 0. From this definition, 


oo —as 


£[H(t — a)] = f edt = Ê a 5»0 (12.2.2) 


a 


Note that this transform is identical to that for f(t) = 1 if a = 0. This should not surprise 
us. As pointed out earlier, the function f(t) is zero for all t < 0 by definition. Thus, 
when dealing with Laplace transforms, f(t) = 1 and H(t) are identical. Generally we will 
take 1 rather than H(t) as the inverse of 1/s. The Heaviside step function is essentially a 
bookkeeping device that gives us the ability to *switch on" and "switch off" a given function. 
For example, if we want a function f(t) to become nonzero at time t = a, we represent this 
process by the product f(t)H(t — a). On the other hand, if we only want the function to 
be “turned on" when a < t < b, the desired expression is then f(t)|H(t — a) — H(t — b)]. 
For t < a, both step functions in the brackets have the value of zero. For a « t « b, the 
first step function has the value of unity and the second step function has the value of zero, 
so that we have f(t). For t > b, both step functions equal unity so that their difference is 
Zero. 


e Example 12.2.1 


Quite often we need to express the graphical representation of a function by a math- 
ematical equation. We can conveniently do this through the use of step functions in a 
two-step procedure. The following example illustrates this procedure. 
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Figure 12.2.1: Graphical representation of Equation 12.2.5. 


Consider Figure 12.2.1. We would like to express this graph in terms of Heaviside 
step functions. We begin by introducing step functions at each point where there is a kink 
(discontinuity in the first derivative) or jump in the graph - in the present case at t = 0, 
t = 1, t = 2, and t = 3. These are the points of abrupt change. Thus, 


f(t) = ao(£) H(t) + as (t) H(t — 1) + ax(£) H(t — 2) + as(t)H (t — 3), (12.2.3) 


where the coefficients ao(t), a1(t), ... are yet to be determined. Proceeding from left to right 
in Figure 12.2.1, the coefficient of each step function equals the mathematical expression 
that we want after the kink or jump minus the expression before the kink or jump. As 
each Heaviside turns on, we need to add in the new t behavior and subtract out the old t 
behavior. Thus, in the present example, 


f(t) 2 (t—0)H(t) -(-0)H(t—1)- [(38—-t) - 11H(t—2) +[0- (3—0]H(t —3) (12.2.4) 


< f(t) =tH(t)— (t - 1)H(t - 1) - (t -2)H(t — 2) + (t - 3H (t — 3). (12.2.5) 


We can easily find the Laplace transform of Equation 12.2.5 by the “second shifting” theorem 
introduced in the next section. 


e Example 12.2.2 


Laplace transforms are particularly useful in solving initial-value problems involving 
linear, constant coefficient, ordinary differential equations where the nonhomogeneous term 
is discontinuous. As we shall show in the next section, we must first rewrite the nonhomo- 
geneous term using the Heaviside step function before we can use Laplace transforms. For 
example, given the nonhomogeneous ordinary differential equation: 


j j -J6 0<t<1 
y +3y e3- l8 1t (12.2.6) 


we can rewrite the right side of Equation 12.2.6 as 


y" + 3y +2y - t C tH(t- 1) 2 t — (t- 1)H(t — 1) - H(t — 1). (12.2.7) 


In Section 12.8 we will show how to solve this type of ordinary differential equation using 
Laplace transforms. 
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The Laplace Transforms of Some Commonly Encountered Functions 


f(t), t>0 F(s) 
1 
1. 1 = 
E 
1 
2 gru m 
sca 
1 —at 1 
3 p (1 € ) s(s EN. 
1 
4. .1 (g—bt | 5—at 
ale a (s + a)(s + b) 
E 
5. ga (0e ^ — ae *!) Emmere 
6. sin(at) s : s 
s?-Fa 
E 
T. cos(at) 2. qe 
s2 4+ 
8. sinh(at) 3 2 ; 
s?—a 
9. cosh(at) 5 B 3 
s?—a 
; 2as 
10. tsin(at) (Sra 
43 
11. 1 — cos(at) a2 + a) 
a? 
12. at — sin(at) ss + a) 
s? — a? 
13. tcos(at) (e2 F a7} 
2a3 
14. sin(at) — at cos(at) " i: DE 
s2 +a 
2 
15. tsinh(at) c La 
s?—a 
dpa 
16. t cosh(at) ^ 2d ay 
s?—a 
. 2a 
17. at cosh(at) — sinh(at) ae 
s?—a 
18. e^" sin(at) y 3 
s+b) +a 
b 
19. €^" cos(at) s B 
s+b) +a 
3,2 
20. (1 + a?t?) sin(at) — at cos(at) = 
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The Laplace Transforms of Some Commonly Encountered Functions (Continued) 


f(t), t» 0 F(s) 
4a? 
21. sin(at) cosh(at) — cos(at) sinh(at) 
st + 4a^ 
2a?s 
22. in(at) sinh(at 
sin(at) sinh(at) s4 1 4a4 
2 3 
23. sinh(at) — sin(at) "m = aa 
2 2 
24. cosh(at) — cos(at) m - — 
sin(at) — bsin(bt : 
"P -— 
SRI (s? + a?)(s? + 02) 
26. PSHE ORE) ee — 
ab? — a2) (s? + a?)(s? + 0?) 
2. cos(at) — cos(bt) à zp : 
D2 — a2 (s? + a?) (s2 + b?) 
. n! 
28. tn > 0 snal 
toleat 1 
29. aani 
9 men n (s +a)” 
(n es 1) — at —2 —at S 
: pog eue (stay 
! 
1. t? —at > =e 
3 € ,n 20 (s+a)rtt 
9nqn— (1/2) 
» >1 —[n+(1/2)] 
3 1985553 Dye = : 
1 
33. Jo(at) Js g a? 
1 
34. Io(at) Ja 


1 


1 
35. S erf(Vat ) =a 


1 "x 
36. a + Va erf(Vat ) 
37 : atorfe(a vt) : 
. —— — ae" "erfc(a 
vint at+J/s 
1 
“erfe(Vat 
38 e“ erfe(V/at ) RUM 
l 
39. (ec — eat) s—a—ws—b 


2 nt3 
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The Laplace Transforms of Some Commonly Encountered Functions (Continued) 


f(t), t>0 F(s) 
i E E 
40. —— + ae” terf(avt) ve 5 
at s—a 
1 E 
41. ——e™ (1 + 2at 
a ( is (s—a)/s—a 
l 2 1 
42. —e* terf(avt 
"ali: (avt) E 
43. L galt a >0 e72vas 
Tt? 
E aji l —2yas 
44. —e a20 € 


2 —ays 
46 2] Lew(- 5) —a erfe( =) ,a20 E 
t Wet sys 
be? S 
b*t+ab a > 

47 erf (bv/t + 5 =) + erfe( 4) a > 0 NUR 

48 ab b te (ot^ zx) ; e^ li 
dbi 2/0/07 Vs (b+ vs) 


Notes: 


Error function: erf(x er dy 
^ Va aA 


Complementary error function: erfc(z) = 1— erf (x) 


| Dirac delta function 


The second special function is the Dirac delta function or impulse function. We define 


it by 
5(t—a) = E. f p n E (12.2.8) 


where a > 0. 
A popular way of visualizing the delta function is as a very narrow rectangular pulse: 


-= im { lle | 0«[t—a| < e/2, (12.2.9) 


0, [t — a| > e/2, 


where € > 0 is some small number and a > 0. See Figure 12.2.2. This pulse has a width 
€, height 1/e, and its center at t — a so that its area is unity. Now, as this pulse shrinks 
in width (e — 0), its height increases so that it remains centered at t = a and its area 
equals unity. If we continue this process, always keeping the area unity and the pulse 
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A == E d 


S(t) 


1/€ 


Figure 12.2.2: The Dirac delta function. 


symmetric about t = a, eventually we obtain an extremely narrow, very large amplitude 
pulse at t = a. If we proceed to the limit, where the width approaches zero and the height 
approaches infinity (but still with unit area), we obtain the delta function ó(t — a). 

'The delta function was introduced earlier during our study of Fourier transforms. So 
what is the difference between the delta function introduced then and the delta function 
now? Simply put, the delta function can now only be used on the interval [0, oo). Outside 
of that, we shall use it very much as we did with Fourier transforms. 

Using Equation 12.2.9, the Laplace transform of the delta function is 


oo 1 ate/2 
£[o(t —a)] = Í ó(t — a)e-*'dt = lim — a * di (12.2.10) 
0 e>0 € a—c/2 
: 1 astes/2 as—es/2 
=lim —[e% —e (12.2.11) 
c0 ES 
e 9$ es | eg es es 
= li 14 | -—14 Har 12.2.12 
PET ES ( 2 8 T 2 8 ) 
= e748 (12.2.13) 


In the special case when a = 0, £[5(t)] = 1, a property that we will use in Section 12.9. 
Note that this is exactly the result that we obtained for the Fourier transform of the delta 
function. 

If we integrate the impulse function, 


t 
0, t<a, 
f ó(r—a)dr = { L isum (12.2.14) 


according to whether the impulse does or does not come within the range of integration. 


'This integral gives a result that is precisely the definition of the Heaviside step function, so 
that we can rewrite Equation 12.2.14: 


f 9e- 94r ne-a. (12.2.15) 
0 


570 Advanced Engineering Mathematics with MATLAB 


Consequently, the delta function behaves like the derivative of the step function, or 


a [o = ó(t — a). (12.2.16) 


Because the conventional derivative does not exist at a point of discontinuity, we can only 
make sense of Equation 12.2.16 if we extend the definition of the derivative. Here we 
extended the definition formally, but a richer and deeper understanding arises from the 
theory of generalized functions.? 


e Example 12.2.3 


Let us find the (generalized) derivative of 


f(t) = 3€? [H(t) - H(t — 1)]. (12.2.17) 
Proceeding formally, 
f'(t) = ec[H (t) — H(t — 1)] + 32? JEt) — 6(¢ — 1)] (12.2.18) 
= 6t [H (t) — H(t — 1)] + 0 — 36(t — 1) (12.2.19) 
6t (A(t) — H(t — 1)] — 35(t — 1), (12.2.20) 


because f(t)ó(t — to) = f(to)d(t — to). 


e Example 12.2.4 


MATLAB also includes the step and Dirac delta functions among its intrinsic functions. 
'There are two types of step functions. In symbolic calculations, the function is Heaviside 
while stepfunction is used in numerical calculations. For example, the Laplace transform 
of Equation 12.2.5 is 


>>syms s,t 
»»laplace(?t-(t-1)*Heaviside(t-1)-(t-2)*Heaviside(t-2)'... 
?+(t-3) *Heaviside(t-3)’,t,s) 
ans = 
1/s*2-exp(-s)/s*2-exp(-2*s)/s*2+exp(-3*s)/s*2 
In a similar manner, the symbolic function for the Dirac delta function is Dirac. There- 
fore, the Laplace transform of (t — 1)ó(t — 2) is 


>>syms s,t 
»»laplace(?(t-1)*Dirac(t-2)?,t,s) 
ans = 

exp (-2*s) 


? The generalization of the definition of a function so that it can express in a mathematically correct 
form such idealized concepts as the density of a material point, a point charge or point dipole, the space 
charge of a simple or double layer, the intensity of an instantaneous source, etc. 
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Sketch the following functions and express them in terms of the Heaviside step functions: 


0, 0<t<2 " 
tio - de-a 2«t«3 2 4er 
0, 3«t Ps 


0«ct«a 
a«t« 2a 
2a < t < 3a 
3a «t 


Rewrite the following nonhomogeneous ordinary differential equations using the Heaviside 


step functions: 


gw ew-[t D ra Ly dy 45 
5. y" + 4y' + 4y = bs uh : 6. y" + 3y + 2y 
T. y-a +2y={ h o 8. y" — 3y' + 2y 
9. y” +y = ids ee ~ 10. y” 4- 3 + 2y 


12.3 SOME USEFUL THEOREMS 


, 0<t<4 
, 4<t 
[0 O0O<t<1 
e, 1l<t 
10, 0«t«1 
10, 1«t 
E t, 0<t<a 
~ Jato), a<t 


Although at first sight there would appear to be a bewildering number of transforms 
to either memorize or tabulate, there are several useful theorems, that can extend the 


applicability of a given transform. 


| First shifting theorem 


Consider the transform of the function e~“ f(t), where a is 
definition, 


—st ,—at 
sto-a 


L [et fE] = 1 E 


or 
L [e ^*f(t)] = F(s + a). 


na f © etto FCE) dt, 


any real number. Then, by 


(12.3.1) 


(12.3.2) 


That is, if F(s) is the transform of f(t) and a is a constant, then F(s + a) is the transform 


of e~ f (t). 


e Example 12.3.1 


Let us find the Laplace transform of f (t) = e7% sin(bt). Because the Laplace transform 


of sin(bt) is b/(s? + b?), 
b 


L [e^ sin(bt)] = E 


(12.3.3) 
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where we simply replaced s by s + a in the transform for sin(bt). 
e Example 12.3.2 


Let us find the inverse of the Laplace transform 


F(s) = a. (12.3.4) 
Rearranging terms, 
F(s) = Eu __ 5+2 — sc3 1 2/2 (12.3.5) 
s?+6s+1 (s+3)?-—8 (s+3)?-8 2\/2(s+3)2-8 
Immediately, from the first shifting theorem, 
f(t) = e7% cosh(2V2t) — sinh (2V2 t). (12.3.6) 


2 


| Second shifting theorem 


The second shifting theorem states that if F(s) is the transform of f(t), then e^ 5 F(s) 
is the transform of f(t — b) H (t — b), where b is real and positive. To show this, consider the 
Laplace transform of f(t — b) H(t — b). Then, from the definition, 


E(f(t — b) H(t - b)] = E: f(t — b) H (t — b)e~* dt (12.3.7) 


d f(t — b)e * dt = d e 8 eT f(x) dg (12.3.8) 
=e my € ** f(x) dx (12.3.9) 


Or 


Lif (t — b) H(t — b)] =e" 5 F(s), (12.3.10) 


where we set x = t — b. This theorem is of fundamental importance because it allows us to 
write down the transforms for “delayed” time functions. That is, functions that “turn on” 
b units after the initial time. 


e Example 12.3.3 


Let us find the inverse of the transform (1 — e~*)/s. Since 


= (12.3.11) 


p G - =) copet G) -ya (=) Cog cec) (12.3.12) 


because C 1 (1/s) = f(t) = 1, and f(t — 1) 2 1. 
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e Example 12.3.4 


Let us find the Laplace transform of f(t) = (t? — 1)H (t — 1). 
We begin by noting that 


(? — 14H(t — 1) — [(£— 1 - 1? — 1]H(t — 1) (12.3.13) 
= [t - 1? + 2(t - 1)|H (t — 1) (12.3.14) 
= (t - 1? H(t — 1) + 2(t - 1)H(t — 1). (12.3.15) 


A direct application of the second shifting theorem then leads to 


cle? -18(- 1] = 2€ 26 7. 


s? E 


(12.3.16) 


e Example 12.3.5 


In Example 12.2.2 we discussed the use of Laplace transforms in solving ordinary 
differential equations. One further step along the road consists of finding Y (s) = Z£[y(t)]. 
Now that we have the second shifting theorem, let us do this. 

Continuing Example 12.2.2 with y(0) — 0 and y'(0) — 1, let us take the Laplace 
transform of Equation 12.2.7. Employing the second shifting theorem and Equation 12.1.18, 
we find that 


s?Y (s) — sy(0) — y’(0) + 3sY (s) — 3y(0) + 2Y (s) = Pb e (12.3.17) 
Substituting in the initial conditions and solving for Y (s), we finally obtain 
1 1 =g d = 
Y (s) = i E e (12.3.18) 


(s+2)(st+1)  s?(s+2)(s+1) ^ s2(s+2)(s +1) % s(s + 2)(s +1) 


Laplace transform of t" f(t) 


In addition to the shifting theorems, there are two other particularly useful theorems 
that involve the derivative and integral of the transform F(s). For example, if we write 


F(s) = Lf] = f f(t)e-*'dt (12.3.19) 
and differentiate with respect to s, then 
F'(s) = l —tf(t)e-*'dt = -Lit fC). (12.3.20) 
0 


In general, we have that 
F(? (s) = (DLE f(t)]. (12.3.21) 
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Laplace transform of f (t)/t 


Consider the following integration of the Laplace transform F(s): 


[^ aec f n f "al dz. (12.3.22) 


Upon interchanging the order of integration, we find that 
a-[ 5 FO stay, 


1 Foe F(t) te eas] dé = -f fa 


(12.3.23) 
Therefore, 
f F(z)dz =£ £] ; (12.3.24) 
e Example 12.3.6 
Let us find the transform of tsin(at). From Equation 12.3.20, 
. d : d a 2as 
Lit sin(at)] = EX xb E 5 | = (GP ay? (12.3.25) 


e Example 12.3.7 


Let us find the transform of [1 — cos(at)]/t. To solve this problem, we apply Equation 


12.3.24 and find that 
1— (at) cae? 
pease) EE £[l—cos(at]| dz = f ze. pda (12.3.26) 
b= 5 z 2 +a? 


t 
= In(z) — 4 In(22 + a?) j = in( 5) (12.3.27) 
= In(1) In( a) = (LX). (12.3.28) 


| Initial-value theorem 


Let f(t) and f’(t) possess Laplace transforms. Then, from the definition of the Laplace 
transform, 


i Fe dt = sF(s) — f(0). (12.3.29) 


Because s is a parameter in Equation 12.3.29 and the existence of the integral is implied 
by the derivative rule, we can let s — oo before we integrate. In that case, the left side of 
Equation 12.3.29 vanishes to zero, which leads to 


jim, sF(s) = f(0). (12.3.30) 


This is the initial-value theorem. 
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e Example 12.3.8 


Let us verify the initial-value theorem using f(t) = e°*. Because F(s) = 1/(s — 3), 
lims-yoo s/(s — 3) = 1. This agrees with f(0) = 1. 


In the common case when the Laplace transform is ratio to two polynomials, we can use 
MATLAB to find the initial value. This consists of two steps. First, we construct sF(s) by 
creating vectors that describe the numerator and denominator of sF (s) and then evaluate 
the numerator and denominator using very large values of s. For example, in the previous 
example, 
>>num = [1 0]; 
>>den = [1 -3]; 
>>initialvalue = polyval(num,1e20) / polyval (den, 1e20) 
initialvalue = 

1 


| Final-value theorem 


Let f(t) and f’(t) possess Laplace transforms. Then, in the limit of s > 0, Equation 
12.3.29 becomes 


i—oo i—oo 


[a f'(t) dt = lim | f'(r)dr = lim f(t) — f(0) = lim sF(s) — f(0). (12.3.31) 
0 0 


Because f (0) is not a function of t or s, the quantity f(0) cancels from Equation 12.3.31, 
leaving 
lim f(t) = lim sF(s). (12.3.32) 


t—oo s—0 


Equation 12.3.32 is the final-value theorem. It should be noted that this theorem assumes 
that limy+. f(t) exists. For example, it does not apply to sinusoidal functions. Thus, we 
must restrict ourselves to Laplace transforms that have singularities in the left half of the 
s-plane unless they occur at the origin. 

In the case when f(t) is a periodic function with a period T, Gluskin? showed that 


T 
lim sF(s) = 7 f f) dt. (12.3.33) 


e Example 12.3.9 
Let us verify the final-value theorem using f(t) = t. Because F(s) = 1/s?, 
lim sF(s) = lim 1/s = oc. (12.3.34) 
s—0 s—0 


The limit of f(t) as t > oo is also undefined. 


? Gluskin, E., 2003: Let us teach this generalization of the final-value theorem. Eur. J. Phys., 24, 
591—597 


576 Advanced Engineering Mathematics with MATLAB 


Just as we can use MATLAB to find the initial value of a Laplace transform in the case 
when F(s) is a ratio of two polynomials, we can do the same here for the final value. Again 
we define vectors num and den that give sF (s) and then evaluate them at s = 0. Using the 
previous example, the MATLAB commands are: 


>>num = [0 1 0]; 
>>den = [1 0 0]; 
>>finalvalue = polyval(num,0) / polyval(den,0) 
Warning: Divide by zero. 
finalvalue = 
NaN 


This agrees with the result from a hand calculation and shows what happens when the 
denominator has a zero. 


e Example 12.3.10 


Looking ahead, we will shortly need to find the Laplace transform of y(t), which is 
defined by a differential equation. For example, we will want Y(s) where y(t) is governed 
by 

y" + 2y' + 2y = cos(t) + 6(t — 7/2), y(0) = y'(0) = 0. (12.3.35) 


Applying Laplace transforms to both sides of Equation 12.3.35, we have that 
L(y”) + 2£(y) + 2£(y) = £[cos(t)] + L[5(t — 7/2), (12.3.36) 


or 


E 
s2-41 


s?Y (s) — sy(0) — y' (0) + 25Y (s) — 2y(0) + 2Y (s) = dp mds (12.3.37) 


Substituting for y(0) and y'(0) and solving for Y (s), we find that 


—sn/2 

s e 

Y (s) = | $ 12.3.3 
(s) (s2 + 1)(s2 +2s +2) s2?+2s+2 ( 8) 


Presently this is as far as we can go. 
How would we use MATLAB to find Y (s)? The following MATLAB script shows you 
how: 


clear 

^ define symbolic variables 

syms pi s t Y 

^4 take Laplace transform of left side of differential equation 

LHS = laplace (diff (diff (sym y(t)?)))42*diff(symC y(t)?)) ... 
*2*sym C y (t)?)); 

^ take Laplace transform of right side of differential equation 

RHS = laplace(cos(t)+’Dirac(t-pi/2)’,t,s); 

^ set Y for Laplace transform of y 

% and introduce initial conditions 

newLHS = subs(LHS,’laplace(y(t),t,s)’,’y(0)’,’D(y) (00? ,Y,0,0) ; 

^ solve for Y 

Y = solve(newLHS-RBHS,Y) 


It yields 
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y= 
(s*exp(-1/2*pi*s)*s^2*exp(-1/2*pi*s))/(s^443*s^242*s^342*s42) 


Problems 


Find the Laplace transform of the following functions and then check your work using 
MATLAB. 

1. f(t) = e * sin(2t) 2. f(t) = e?! cos(2t) 
3. f(t) 2 2 H(t — 1) 4. f(t) 2 e H(t — 3) 
5. f(t) = tet + sin(3t)e* + cos(5t)e?! 6. f(t) = tte% + sin(3t)e* + cos(At)e?! 


7. f(t) = tet + sin(2t)e* + cos(3t)e *' 8. f(t) =PH(t—1)+ e H(t — 2) 


9. f(t) = (? + 2)H(t — 1) + H(t - 2) 10. f(t) = (t+ 1? H(t — 1) + H(t- 2) 
ts ias on 12. f(t) = T E 


13. f(t) = te™* sin(2t) 


Find the inverse of the following Laplace transforms by hand and using MATLAB: 


14. F()-5 Tyi 15. F(s) = E 

16. F(s) = a = 13 17. F(s) = E HE 2 

ESUE (s 1? s x 2 uS (s : 2? s x 2 

MOS (s r 2p- 2 LE 5 apos = 

22. F(s) = ot 23. F(s) = IS 

24. F(s) — allis 25. F(s) — — m 

26. F(s) = i ! NE 27. F(s) = ee | z 

28. Find the Laplace transform of f(t) = tet [H (t—1)— H (t—2)] by using (a) the definition of 


the Laplace transform, and (b) a joint application of the first and second shifting theorems. 


29. Write the function 
0<t<a, 


a<t, 


in terms of Heaviside’s step functions. Then find its transform using (a) the definition of 
the Laplace transform, and (b) the second shifting theorem. 
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In Problems 30-33, write the function f(t) in terms of the Heaviside step functions and then 
find its transform using the second shifting theorem. Check your answer using MATLAB. 


t, 0<t<1 
E Marez asosi 1<t<2 
0, 22% 0, M 
<t< 
&  O<t<2 E es 
32. f(t)=4 4-t, 2<t<4 33. f(t) = i Sts 
1, 2<t<3 
0, 4<t 
0, 3«t 


Find Y (s) for the following ordinary differential equations and then use MATLAB to check 
your work. 


34. y" + 3y' + 2y = H(t — 1); y(0) = y'(0) =0 
35. y” +4y=3H(t—4); — y(0)— 1, y' (00-0 


36. y" + 4y' + Ay = tH (t — 2); y(0) =0, y'(0) 22 


37. y" + 3y' +2y = et H (t — 1); y(0) = y'(0) =0 


38. y” —3y' + 2y - e 'H(t-2; — y(0)—2, y'(0) 20 


39. y" — 3y' + 2y = P H(t — 1); y(0) — 0, y/(0) 25 


40. y" + y = sin(t)[1 — H(t — 7)}; y(0) = y'(0) =0 


41. y" + 3y' + 2y =t + [ae- 6-9 — t| H(t — a); y(0) = y'(0) = 0. 


For each of the following functions, find its value at t = 0. Then check your answer using 
the initial-value theorem by hand and using MATLAB. 


42. f(t) 2 t 43. f(t) = cos(at) 
44. f(t) = tet 45. f(t) — e' sin(3t) 
For each of the following Laplace transforms, state whether you can or cannot apply the 


final-value theorem. If you can, find the final value by hand and using MATLAB. Check 
your result by finding the inverse and finding the limit as t > oo. 


1 1 
1 E 
48. F(s) - —1 49. F() - 34 
2 2 
50. F()) = — r3 8L. F()= 3 — 3 
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52. Using the fact that 


g 
e *21- cf e "dn, 
0 


1. asx 1 | LA bn d 
X = x 
s^ PV s+b E o stb Par) | 


if x > 0. Therefore, using the fact? that 


show? that 


v/2 
gem = (<) 1, (2vat ) ; R(v) » —1, 
a 
and the first shifting theorem, show 


1 ax 
£-*|— exp[ — isis =1— p e^" Io (2 26n ) dn, 
E s+b 0 


where J,,(-) is a modified Bessel function of the first kind and order v introduced in Section 
6.5. 


12.4 THE LAPLACE TRANSFORM OF A PERIODIC FUNCTION 


Periodic functions frequently occur in engineering problems and we shall now show how 
to calculate their transform. They possess the property that f(t-- T) = f(t) for t > 0 and 
equal zero for t < 0, where T is the period of the function. 

For convenience, let us define a function x(t) that equals zero except over the interval 
(0, T) where it equals f(t): 


a(t) = { d os k * (12.4.1) 


By definition, 


oo T 2T (k+1)T 
F(s) = | Je “d= J f(te tdt «f f(t)e^*dt 4 --- «f fe dt 4---.. 
0 0 T kT 
(12.4.2) 
Now let z = t — kT, where k = 0,1,2,..., in the kth integral and F(s) becomes 


T T T 
F(s) =| f(z)e ** as | fee pee) ae f fete ie 9er aa scs. 
0 0 0 


(12.4.3) 
However, 


u(z) = f(z) = f(z +T) =... = f(z + kT) =... (12.4.4) 


because the range of integration in each integral is from 0 to T. Thus, F(s) becomes 


T T T 
F(s) = J z(z)e ** dete” f z(z)e ?* dz 4--- ehm f zx(z)e ?* dz---- (12.4.5) 
0 0 0 


4 Liaw, C. H., J. S. P. Wang, R. A. Greenkorn, and K. C. Chao, 1979: Kinetics of fixed-bed absorption: 
A new solution. AICHE J., 25, 376-381. 


5 Watson, E. J., 1981: Laplace Transforms and Applications. Van Nostrand Reinhold Co., p. 195. 
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Or 
F(s) £z (1 4 e 5T + e 2st Haami e EST Tee )X(s). (12.4.6) 


The first term on the right side of Equation 12.4.6 is a geometric series with common ratio 
es? Tf |e-57| < 1, then the series converges and 


X (s) l (12.4.7) 


e Example 12.4.1 


Let us find the Laplace transform of the square wave with period T: 


ft) = ra sop (12.4.8) 


By definition x(t) is 


h, 0<t<T/2, 
zs(!ü)-14-h T/2<t<T, (12.4.9) 
0, I x 
Then 
oo T/2 T 
X(s) = » z(t)e * dt = f he™* dt + pr (—h) e~ * dt (12.4.10) 
2 
— h —sT/2 -sT \ _ h —sT/2 2 
== (1-26 +e )-i(-e ) (12.4.11) 
and 


(peste) hase?) 
s(l—e-sT) ^ s(1-e-sT/2) 


If we multiply numerator and denominator by exp(sT'/4) and recall that tanh(u) = (e" — 
e ")/(e" -- e^"), we have that 


F(s)= 


(12.4.12) 


F(s)— I) ; (12.4.13) 


e Example 12.4.2 


Let us find the Laplace transform of the periodic function 


sin(2Qnt/T), 0<t<T/2, 
f(t) -l ( ; /T) ae oe (12.4.14) 
By definition x(t) is 
sin(Qnt/T),  0<t<T/2, 
x(t) ={ ( i /T) oc (12.4.15) 
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'Then A 
. f 2at\ at 2nT —sTJ2 
Hence, 
X(s) QnT 14-6577? 25 T 1 
F(s)= = = ; 12.4.17 
e) Toe FT 272 44? [eT 7 gh agi" qp-gm ( ) 
Problems 


Find the Laplace transform for the following periodic functions: 


Lf()-sm(), 0tzm f) = f+) 

Oe ccr. quoc 

Rel 0s f) = fü + 2a) 
1, 0<t<a, 

4f0-71 A f= FE +40) 
0, 3a « t « 4a, 


12.5 INVERSION BY PARTIAL FRACTIONS: HEAVISIDE'S EXPANSION THEOREM 


In the previous sections, we devoted our efforts to calculating the Laplace transform 
of a given function. Obviously, we must have a method for going the other way. Given a 
transform, we must find the corresponding function. This is often a very formidable task. 
In the next few sections we shall present some general techniques for the inversion of a 
Laplace transform. 

The first technique involves transforms that we can express as the ratio of two poly- 
nomials: F(s) — q(s)/p(s). We shall assume that the order of q(s) is less than p(s) and 
we have divided out any common factor between them. In principle we know that p(s) has 
n zeros, where n is the order of the p(s) polynomial. Some of the zeros may be complex, 
some of them may be real, and some of them may be duplicates of other zeros. In the case 
when p(s) has n simple zeros (nonrepeating roots), a simple method exists for inverting the 
transform. 

We want to rewrite F(s) in the form: 


a CREE) 
5—5 p(s) 


F(s) = (12.5.1) 


T T 
S— S1  $— $9 


where $1,52,...,5, are the n simple zeros of p(s). We now multiply both sides of Equation 
12.5.1 by s — s, so that 


(12.5.2) 
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If we set s = s,, the right side of Equation 12.5.2 becomes simply a,. The left side takes 
the form 0/0 and there are two cases. If p(s) = (s — s1)g(s), then a1 = q(si1)/9g(s1). If we 
cannot explicitly factor out s — s1, l'Hópital's rule gives 


= 2 1 
gi 9000]. Su ie OU S d) (12.5.3) 
son p(s) s>sı p' (s) p' (1) 

In a similar manner, we can compute all of the coefficients az, where k = 1,2,..., n. 
Therefore, 

£X PG = L7! Fal cope ( = eM m ) (12.5.4) 

p(s) 8—81  8— 89 8— Sn 
= ae?! + age??? +--+ a, ent, (12.5.5) 


This is Heaviside's expansion theorem, applicable when p(s) has only simple poles. 


e Example 12.5.1 


Let us invert the transform s/((s +2)(s? -- 1)]. It has three simple poles at s = —2 and 
s = +i. From our earlier discussion, q(s) = s, p(s) = (s--2)(s?-- 1), and p'(s) = 3s? -4s-- 1. 


Therefore, 
ge : — MÀ + : et 4 it (125 6) 
(s+2)(s2+1)| 12=8+1 -3+ 4i+1 -3 —4i+1 » 

2 —2t i it i —it 

= j- f : 12.5.7 
5^ -U dod ) 
2 a —2—4i a .—2+4i a 

Ed: i i 12.5.8 
5° "4416 IFI ) 
NOU 2 

Ee M sin(t) + 5 cos(t), 12.5.9) 


where we used sin(t) = 


e Example 12.5.2 


— e*t), and cos(t 


) = i(éft + e~*). 


Let us invert the transform 1/[(s — 1)(s — 2)(s — 3)]. There are three simple poles at 
sı = 1, s2 = 2, and s3 = 3. In this case, the easiest method for computing a1, a2, and a3 is 


ET ed -— 12.5.10) 
"eH G= (Gc) 2 bad 
s—2 
Ta teen (oe = De =D) um 
and 3 1 
s 
— ]i = 12.5.12 
s= Bs (¢—1)(s—2)(s—3) 2 
Therefore, 
1 a a a 
-1 Lord en aN cae S_| = Let e+ let, (12.5.18 
s A E] s1 aatan IS +a 
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Note that for inverting transforms of the form F(s)e ** with a > 0, you should use 
Heaviside's expansion theorem to first invert F(s) and then apply the second shifting the- 
orem. 

Let us now find the expansion when we have multiple roots, namely 

d(s) d(s) 


F(s) = 38) e are ee e EE (12.5.14) 


where the order of the denominator, m; + mo + --: + m4, is greater than that for the 
numerator. Once again we eliminated any common factor between the numerator and 
denominator. Now we can write F(s) as 


F(s) 2 V7 by cH ME (12.5.15) 


Multiplying Equation 12.5.15 by (s — s,)™*, 


(s — sx)"^*q(s) 
p(s) 


= akı + ak2(s — Sk) +: + akm, (s — sx) 71 


*(s— 8%)" l TE ciu. Ree (12.5.16) 
s — 8,)™ 8.—.85 


where we grouped together into the square-bracketed term all of the terms except for those 
with ax; coefficients. Taking the limit as s — sy, 


- Mk 
grai E as) 


Jim S (12.5.17) 


Let us now take the derivative of Equation 12.5.16, 


d [£ — sk)™*q(s) 
ds p(s) 


| = aya + 2aga(s — Sk) +--+ + (my — l)agm, (s — gy) s? 


eem i MEE EN m. [t (12.5.18) 


ds s — 8,)™ S — Sn, 


Taking the limit as s > sx, 


ox d [(s — sx)™q(s) 
ak2 = Jim d | $5) : (12.5.19) 
In general, 
1 d/^7 [(s— sk)"*q(s) 
;— li - 12.5.20 
aj = S, — T) deii | »s | ROT 
and by direct inversion, 
m AS akj Mk—j Skt 


k=1 j=1 
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e Example 12.5.3 


Let us find the inverse of 


E 
F(s) = ——————. 12.5.22 
(5) = Gx3y (1-1) ( ) 
We first note that the denominator has simple zeros at s = +i and a repeated root at 
s = —2. Therefore, 
F(s) 2 d d + = 12.5.23) 
og sti qd (s+ 2)?’ n 
where 
_}; ; — 1 
A- lim (s — i)F(s) = eg: 12.5.24) 
B= lim (s+ i)F(s) = da, 12.5.25) 
. d 2 uu" d E 3 
C= Jim, ae +2) ro) = Jim, ds (a a ;) ——34 12.5.26) 
and 
D= lim, (s+ 2? F(s) = -2. 12.5.27) 
s—>— 
Thus, 


f(t) = giae” + golge” — Det — 2te?* = 3 cos(t) + 3: sin(t) - $e? — Bte™. 
(12.5.28) 


In Section 12.9 we shall see that we can invert transforms just as easily with the residue 
theorem. Let us now find the inverse of 
cs + (ca — wd) cs + (ca — wd) 


Fre (sta+w2  (s-a-wi)(s- ad wi) VEN) 


by Heaviside's expansion theorem. Then 


c 4- di c — di NEN SL. d2 c9! n vc! 4 d2e- 8i 


= . — (12.5.30 
2(s--a—wi) 2(s+a+wi) 2(s-ca-—wi) 2(s+a+wi) ( ) 


F(s) = 


where 0 = tan !(d/c). Note that we must choose 0 so that it gives the correct sign for c 
and d. 
Taking the inverse of Equation 12.5.30, 


f(t) = 1/ + Pe atte 4 iye 4 reat wt — /e2 + d?e-t cos(wt + 9). 
(12.5.31) 
Equation 12.5.31 is the amplitude/phase form of the inverse of Equation 12.5.29. It is 
particularly popular with electrical engineers. 


e Example 12.5.4 


Let us express the inverse of 


8s—3 


> 12.5.32 
s2 + 4s +13 ( ) 


F(s) = 
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in the amplitude/phase form. 
Starting with 


= 4+ 19i 4 — 19i 
F(s)— imi M RE a (12.5.33) 
(s+2—3i)(s+2+3i) s+2-3i s+2+3i 
1017¢38:3675°% 10176 38.3675*i 
9 017e | 5.1017e =. (12.5.34) 
s+2— 31 s+2+ 32 
or 
f(t) = 5.101 7e 7 2t + 3it+38.3675° i d 5.1017e- 2t-31t—38.3675^i (12.5.35) 
= 10.2034e ?' cos(3t + 38.3675? ). (12.5.36) 


e Example 12.5.5: The design of film projectors 


For our final example we anticipate future work. The primary use of Laplace transforms 
is the solution of differential equations. In this example we illustrate this technique that 
includes Heaviside's expansion theorem in the form of amplitude and phase. 

This problem? arose in the design of projectors for motion pictures. An early problem 
was ensuring that the speed at which the film passed the electric eye remained essentially 
constant; otherwise, a frequency modulation of the reproduced sound resulted. Figure 
12.5.1(A) shows a diagram of the projector. Many will remember this design from their 
days as a school projectionist. In this section we shall show that this particular design filters 
out variations in the film speed caused by irregularities either in the driving-gear trains or 
in the engagement of the sprocket teeth with the holes in the film. 

Let us now focus on the film head - a hollow drum of small moment of inertia Jı. See 
Figure 12.5.1(B). Within it there is a concentric inner flywheel of moment of inertia J5, 
where Jz > Ji. The remainder of the space within the drum is filled with oil. The inner 
flywheel rotates on precision ball bearings on the drum shaft. The only coupling between the 
drum and flywheel is through fluid friction and the very small friction in the ball bearings. 
The flection of the film-loops between the drum head and idler pulleys provides the spring 
restoring force for the system as the film runs rapidly through the system. 

From Figure 12.5.1 the dynamical equations governing the outer case and inner flywheel 
are (1) the rate of change of the outer casing of the film head equals the frictional torque 
given to the casing from the inner flywheel plus the restoring torque due to the flection of 
the film, and (2) the rate of change of the inner flywheel equals the negative of the frictional 
torque given to the outer casing by the inner flywheel. 

Assuming that the frictional torque between the two flywheels is proportional to the 
difference in their angular velocities, the frictional torque given to the casing from the inner 
flywheel is B(w2—w 1), where B is the frictional resistance, w and w» are the deviations of the 
drum and inner flywheel from their normal angular velocities, respectively. If r is the ratio 
of the diameter of the winding sprocket to the diameter of the drum, the restoring torque 
due to the flection of the film and its corresponding angular twist equals K DA (rwo— w4) dr, 
where K is the rotational stiffness and wo is the deviation of the winding sprocket from its 


6 Cook, E. D., 1935: The technical aspects of the high-fidelity reproducer. J. Soc. Motion Pict. Eng., 
25, 289—312. 
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Figure 12.5.1: (A) The schematic for the scanning light in a motion-picture projector and (B) interior of 
the film drum head. 


normal angular velocity. The quantity rwo gives the angular velocity at which the film is 
running through the projector because the winding sprocket is the mechanism that pulls 
the film. Consequently, the equations governing this mechanical system are 


t 
j, =x | (rwo — wi) dr + B(w — w1), (12.5.37) 

0 

and d 
AUTE = -B(w — w1). (12.5.38) 


With the winding sprocket, the drum, and the flywheel running at their normal uniform 
angular velocities, let us assume that the winding sprocket introduces a disturbance equiv- 
alent to a unit increase in its angular velocity for 0.15 second, followed by the resumption 
of its normal velocity. It is assumed that the film in contact with the drum cannot slip. 
The initial conditions are w4(0) = w»(0) = 0. 

Taking the Laplace transform of Equation 12.5.37 and Equation 12.5.38 and using 
Equation 12.1.18, 


K K 
(n +B+ ) Q4(s) — BQ2(s) = TE Als) =rKL ff wo(T) ar| ; (12.5.39) 
s S 0 
and 
—B((s) + (Jos + B)Q2(s) = 0. (12.5.40) 
The solution of Equation 12.5.39 and Equation 12.5.40 for Q4(s) is 


rK (s + ao)Qo(s) 
Q (s) = 12.5.41 
1(s) Ji s +698? + bis + by’ ( ) 


where typical values” are 


rK B BK 
— = 90.8 = — = 1.47, b= 
Jı 2o J2 * O87 uada 


— 231, (12.5.42) 


7 Ji = 1.84 x 104 dyne cm sec? per radian, J2 = 8.43 x 10* dyne cm sec? per radian, B = 12.4 x 104 
dyne cm sec per radian, K = 2.89 x 109 dyne cm per radian, and r = 0.578. 


The Laplace Transform 587 


K B 
hc aur dud n 


= = 8.20. 12.5.43 
Jı JiJ2 ( ) 


The transform Q(s) has three simple poles located at sı = —1.58, s2 = —3.32 + 11.62, and 
S3 = —3.32 — 11.6i. 

Because the sprocket angular velocity deviation wo(t) is a pulse of unit amplitude and 
0.15 second duration, we express it as the difference of two Heaviside step functions 


wolt) = H(t) — H(t — 0.15). (12.5.44) 
Its Laplace transform is 
l 4 
Qo(s) = — — —e~ 9188 (12.5.45) 
S s 


so that Equation 12.5.41 becomes 


rK (s + ag) 
Jı s(s — s1)(s — s2)(s — sa 


Q1(s) = (1— 679195), (12.5.46) 


'The inversion of Equation 12.5.46 follows directly from the second shifting theorem 
and Heaviside's expansion theorem, or 


w(t) = Ko + Kie + K3e?* + Ke?! (12.5.47) 
RN [Ko 4+ K,e%¢-9-15) ES Koe82(¢—0.-15) + K3e83 9-15) H (t _ 0.15), 


where 
rK S + ao 
Ko= = 0.578, 12.5.48 
° 4 (s= 81)(s — s2)(s — ss) s=0 l 
K 
K A = 0.046, 12.5.49) 
Jı 8(s — 82)(s — 83) | s-s 
K 2 
Rec ZEE = 0.326e165^i. 12.5.50) 
J, s(s— s1)(s — 83) | 5-5, 
and 
K o; 
pel peu = 0.32667 195^1. (12.5.51) 
Ji s(s — s1)(s — 82) |s=s; 


Using Euler's identity cos(t) = (e'* + e7#)/2, we can write Equation 12.5.47 as 


(t) = 0.578 + 0.046e~*°8! + 0.652e~* 3"! cos(11.6t + 165?) 
— {0.578 + 0.046e 1-597015) + 0.652¢~ 3-92t-0-18) 
x cos[11.6(t — 0.15) + 165°]} H(t — 0.15). (12.5.52) 


Equation 12.5.52 is plotted in Figure 12.5.2. Note that fluctuations in w;(t) are damped 
out by the particular design of this film projector. Because this mechanical device dampens 
unwanted fluctuations (or noise) in the motion-picture projector, this particular device is 
an example of a mechanical filter. 
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Figure 12.5.2: The deviation w(t) of a film drum head from its uniform angular velocity when the sprocket 
angular velocity is perturbed by a unit amount for the duration of 0.15 second. 


Problems 


Use Heaviside's expansion theorem to find the inverse of the following Laplace transforms: 


1 s+3 
IUS a 2 FS) De 0) 
3. F(s) = ane T MIR. 


(s+ 2)(s 4- 1)(s — 3) (s? + 4)(s + 1) 


Find the inverse of the following transforms and express them in amplitude/phase form: 


1 1 
Buceo ess bp Esc. cR s 
iie no ER () - 2r 6.313 
2s— 5 1 
7. F(s) = 272 Si eee ae 
(s) s? + 16 (s) s(s? + 2s + 2) 
s+2 
eG Rear yar 


12.6 CONVOLUTION 


In this section we turn to a fundamental concept in Laplace transforms: convolution. 
We shall restrict ourselves to its use in finding the inverse of a transform when that transform 
consists of the product of two simpler transforms. In subsequent sections we will use it to 
solve ordinary differential equations. 


We begin by formally introducing the mathematical operation of the convolution prod- 
uct 


t t 
fit) a(t) - | f(e=a)g(v) de = [ regit 2) ae. (12.6.1) 
0 0 
In most cases the operations required by Equation 12.6.1 are straightforward. 


e Example 12.6.1 


Let us find the convolution between cos(t) and sin(t). 


cos(t) * sin(t) = f sin(t — x) cos(x) dz = Jd [sin(t) + sin(t — 2z)] dr (12.6.2) 
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t t 
= 2 sin(t) dx + / sin(t — 2x) dx (12.6.3) 
0 0 
1. t 1 t Tai 
= 5 sin(t) i + 4 cos(t — 2|; = 5tsin(t). (12.6.4) 


e Example 12.6.2 


Similarly, the convolution between t? and sin(t) is 


t? «sin(t) = fe — a)? sin(z) dx 12.6.5) 
É2-(t-zy cos(x)|, — 2 [ 8-2) cos(x) dx 12.6.6) 
0 
= ť —9(t-z) sin(«)|, — a sin(x) da 12.6.7) 
0 
= t? + 2cos(t) — 2 12.6.8) 


by integration by parts. 


e Example 12.6.3 


Consider now the convolution between e* and the discontinuous function H(t — 1) — 
H (t — 2): 


e x [H(t — 1) - H(t — 2)] = f e "LH(x — 1) — H(z — 2)] dx (12.6.9) 
= é T e *[H(z — 1) — H(z — 2)] dz. (12.6.10) 
0 


In order to evaluate the integral, Equation 12.6.10, we must examine various cases. If t « 1, 
then both of the step functions equal zero and the convolution equals zero. However, when 
1 « t « 2, the first step function equals one while the second equals zero as the dummy 
variable x runs between 1 and t. Therefore, 


t 
e «[H(t-1) - H(t-2)] = n e *dy = e^! — 1, (12.6.11) 
1 


because the portion of the integral from zero to one equals zero. Finally, when t > 2, the 
integrand is only nonzero for that portion of the integration when 1 < x < 2. Consequently, 


2 
e x [H(t —1)  H(t-2)] = e f e “de = e! — er? (12.6.12) 
1 
Thus, the convolution of e* with the pulse H(t — 1) — H(t — 2) is 


el _ 1, 1<t<2, (12.6.13) 
t—-1 _ et^? 2 < t. 


, 


0 0<t<1, 
^e) mea) ={ 
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MATLAB can also be used to find the convolution of two functions. For example, in 
the present case the commands 
syms x t positive 
int ("exp(t-x)*(Heaviside(x-1)-Heaviside(x-2))',x,0,t) 
yield 
ans = 
-Heaviside(t-1)-*Heaviside(t-1)*exp(t-1)-*Heaviside(t-2) 
-Heaviside(t-2)*exp(t-2) 


The reason why we introduced convolution stems from the following fundamental the- 
orem (often called Borel’s theorem? ). If 


w(t) = u(t) x v(t) (12.6.14) 


then 
W(s) = U(s)V (s). (12.6.15) 


In other words, we can invert a complicated transform by convoluting the inverses to two 
simpler functions. The proof is as follows: 


W(s) = [ n da! g "tdi 12.6.16) 
= [ f er) — retail dx 12.6.17) 
= [ u(x) n ofre ar dz 12.6.18) 
E n u(z)e*as| [mena =l vta); 12.6.19) 


where t£ — r 4- x. 


e Example 12.6.4 


Let us find the inverse of the transform 


S _ s 1 s : = . EDU 
(xD C3 Nue L{cos(t)|L[sin(t)] = £|[cos(t) x sin(t)] = £[5t sin(t)] 
(12.6.20) 
from Example 12.6.1. 
e Example 12.6.5 
Let us find the inverse of the transform 
1 1 a a 1 ; . 
TW = (a Ta x 3 a z) = qz ^ lsin(at)] £|sin(at)]. (12.6.21) 


8 Borel, É., 1901: Lecons sur les séries divergentes. Gauthier-Villars, p. 104. 
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Therefore, 
bee : Lf sinat )] sin(az) d. 12.6.22) 
= in[a(t — x)| sin(ax) dx .6. 
(s? + a2)? a2 Jo 

T f 1 ft 

= x: cos[a(t — 2x)] dz — T cos(at) dx 12.6.23) 

1 Ee Í : 

ia sin[a(t — 2z)] aa cos(at) a: ' 12.6.24) 
T 

= zg Sin(at) — at cos(at)]. 12.6.25) 


e Example 12.6.6 
Let us use the results from Example 12.6.3 to verify the convolution theorem. 
We begin by rewriting Equation 12.6.13 in terms of the Heaviside step functions. Using 
the method outline in Example 12.2.1, 
f(t) «g(t) = (e^! —1) H(t- 1) + (1- e?) H(t — 2). (12.6.26) 


Employing the second shifting theorem, 


Lif *g] = - — + - (12.6.27) 
e 3 e ?$5 1 e 3 e ?5 

H s(s—1) s(s—1) s-1 ( E E ) 2628) 

= £[e*|£[H (t — 1) - H(t — 2)] (12.6.29) 


and the convolution theorem holds true. If we had not rewritten Equation 12.6.13 in terms 
of step functions, we could still have found Z[f * g] from the definition of the Laplace 
transform. 


Problems 


Verify the following convolutions and then show that the convolution theorem is true. Use 
MATLAB to check your answer. 


lop t 2. 1 x cos(at) = sin(at)/a 

3.1«ef 2e —1 A txt =O /6 

5. tx sin(t) = t — sin(t) 6.txe'=e'—-t-1 

i asin = d i az) 8. t H(t—1) = 1t D2H(t — 1) 


9. H(t— a) x H(t— b) = (t-a—0t)H(t —a— b) 
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10. t«[H(t) - H(t- 2] = 


Use the convolution theorem to invert the following functions: 


1 1 
11. F(s) = 12. F(s) = ———— 
(5) s?(s — 1) (s) s? (s +a)? 
13. Given? 
a/s 
UE | = nar). Bie 
E 
show that 
a/s 1 a/s 
e e 
1 
s—1 (+5) s’ NEW 
and 
ca/5 vat 72 
Pm (: — ;) = [&(t) +e] « Io(2Vat ) =e + EU en(-F) I,(r) dr, 


where [,,(-) denotes a modified Bessel function of the first kind and order n, which was 
introduced in Section 6.5. There we will show that I(T) = I(T). 


14. Prove that the convolution of two Dirac delta functions is a Dirac delta function. 
12.7 INTEGRAL EQUATIONS 


An integral equation contains the dependent variable under an integral sign. The convo- 
lution theorem provides an excellent tool for solving a very special class of these equations, 
Volterra equation of the second kind:'° 


f(t) -f K{t, x, f (z)] dx = g(t), 0<t<T. (12.7.1) 


These equations appear in history-dependent problems, such as epidemics,!! vibration prob- 
lems,!? and viscoelasticity.'? 


9 Watson, op. cit., p. 195. 


0 Fock, V., 1924: Über eine Klasse von Integralgleichungen. Math. Z., 21, 161-173; Koizumi, S., 1931: 
On Heaviside's operational solution of a Volterra's integral equation when its nucleus is a function of (x — £). 
Philos. Mag., Ser. 7, 11, 432-441. 


1 Wang, F. J. S., 1978: Asymptotic behavior of some deterministic epidemic models. SIAM J. Math. 
Anal., 9, 529—534. 


2 Lin, S. P., 1975: Damped vibration of a string. J. Fluid Mech., 72, 787-797. 


3 Rogers, T. G., and E. H. Lee, 1964: The cylinder problem in viscoelastic stress analysis. Q. Appl. 
Math., 22, 117-131. 
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e Example 12.7.1 


Let us find f(t) from the integral equation 


Ttc u-i] f(a) sin(t — x) da. (12.7.2) 


The integral in Equation 12.7.2 is such that we can use the convolution theorem to find 
its Laplace transform. Then, because Z[sin(t)] = 1/(s? +1), the convolution theorem yields 


F(s) 


t 
L / f(x) sin(t — x) da! — SEDI (12.7.3) 
Therefore, the Laplace transform converts Equation 12.7.2 into 
4  3F(s) 
= : 12.7.4 
F()- 3 -LRA (12.7.4) 
Solving for F(s), 
A(s? +1) 
F(s) = ———. 12.7.5 
()=Sarrp (12.7.5) 
By partial fractions, or by inspection, 
1 3 
F(s) = ——— 12.7.6 
()7 (12.7.6) 
Therefore, inverting term by term, 
f(t) = t + 2 sin(2t). (12.7.7) 
Note that the integral equation 
t 
f()= s-s f f(t — x)sin(x) dx (12.7.8) 
0 
also has the same solution. 
e Example 12.7.2 
Let us solve the equation 
t 
ft) + o | f(r) dr = B — C cos(wt), f(0) — 0. (12.7.9) 
0 


Again the integral is one of the convolution type; it differs from the previous example 
in that it includes a derivative. Taking the Laplace transform of Equation 12.7.9, 
2p B C 
sF(s) - f(0) - ^ oa ^ (12.7.10) 


E s. s2-w2: 


Because f(0) = 0, Equation 12.7.10 simplifies to 


Cs? 


2 2 


(12.7.11) 
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Solving for F(s), 


B Cs? 
F(s) = ; 12.7.12 
(s) s+a2 (s? a2)(s2 +w?) ( ) 
Using partial fractions to invert Equation 12.7.12, 
B aC wC 
f(t) = (2 + J= =) sin(at) = iE = g? sin(wt). (12.7.13) 
e Example 12.7.3 
Let us solve!^ the integral equation 
" t 
Ü)- ———— - d z^ 12.7.14 
10 = gea | ft food ee (12.7.14) 
Taking the Laplace transform of Equation 12.7.14, we obtain 
F? 1 
A E (12.7.15) 


2(1+2a) s-41 


Solving for F(s) so that F(s) + 0 as s — oo, we have 


Poa 2at+1 /(2a+1)(st+1)—-2a 2a+1 v2a+1 /(2a+1)s+1 
Ln" a (Qa+1)\(s+1) a a s+1 l 
(12.7.16) 
Taking the inverse of Equation 12.7.16, 
2a+1 2a+1 
fd) = 281 gg — X98 1 p, (12.7.17) 
where g(t) is the inverse of the Laplace transform G(s), 
2 1 1 H1/(1+2 
oea ea 5 Ue (12.7.18) 
s+1 V/s - 1A/s 4 1/(2a +1) 
H(s) 
= y2a4d- 1 sH(s) + ———— 12.7.19 
SY TET l ) 
and j 
H(s)= : (12.7.20) 
Vs +1\/s + 1/(2a +1) 
Taking the inverse of H(s), we find that 
a+1 at 
h(t) = t) [i 12.7.21 
() exp( it) (ze) ( ) 


14 Hounslow, M. J., 1990: A discretized population balance for continuous systems at steady state. 
AICHE J., 36, 106-116. 
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and 


j a+1 a+1 at i a a+1 at 
re exp( x) n(m) "2a+1 exp( x) n (se) 
(12.7.22) 
where Jo(-) and I;(-) are modified Bessel functions of the first kind. See Section 6.5. 
Because sH (s) = £[h/(t)] +h(0) and A(0) = 1, h’(t) = £C 1 [sH(s)] —6(t) or £ 1 [sH (s)] 
= h'(t) + ó(t). Then, 


4 se) (12.7.23) 


a a+1 at 
= /2a +1 |&(t) 4 t) I 
bo zal sie) (ze) 


a a+1 at 
t) ; 12.7.24 
xcu mt) (seri) ( ) 
Finally, substituting Equation 12.7.24 into Equation 12.7.17, 
a+1 at at 
= ———— I F 12.7.2 
PO exp( it) Le: (aa) ( s 
Problems 


Solve the following integral equations: 


1. f(t) 2 142 f ft—az)e** dz 2. f(t) ad f(x) sin(t — x) dx 
3. sy =t+ f f(t — x)e * dx 4. ro =e - | f(t — z)e * dx 
9-e«[ f(x) sin(t — x) dx 6. f (t) -se-aj f(a) sin(t — x) dx 


7. f(t) =- f f(t — x)sinh(2x) dz 8. f (t) =142/ f(t — x) cos(x) dx 


9. f(t) =e 42 f enoui 10. f(£) =t? + f Tabuta 


11. f(t) ie ia) f(x) cos(t — x) dx 12. f(t) -6+4 | fe - nds 

13. f(t) = avt — [ Ex i 

14. Solve the following equation for f(t) with the condition that f(0) = 
f'(t)2t4 [ f(t — x) cos(x) dz. 

15. Solve the following equation for f(t) with the condition that f(0) = 


f'(t) = sin(t )+ f f a) cosl ) cos(x) da 
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16. During a study of nucleation involving idealized active sites along a boiling surface, 
Marto and Rohsenow!? solved the integral equation 


Ac pisc f Oa 
0 m2 


to find the position x(t) of the liquid/vapor interface. If A, B, and C are constants and 
x(0) = 0, find the solution for them. 


17. Solve the following equation for x(t) with the condition that x(0) = 0: 


1 t x'(T) 
z(t) +t = —= dr, 
(0 m Vt—T 
where c is constant. 


18. During a study of the temperature f(t) of a heat reservoir attached to a semi-infinite 
heat-conducting rod, Hubert? solved the integral equation 


f= 


pg 
vm 0 V i—T , 
where a and £ are constants and f(0) = 0. Find f(t) for him. Hint: 


a a ap 


g3/2( 31/2 + B) s(s = 82) s3/?(s = Bey 


19. During the solution of a diffusion problem, Zhdanov, Chikhachev, and Yavlinskii!" 
solved an integral equation similar to 


t 
| f(r) [1 — erf (avt — 7)] dr = at, 
0 
2 x 
where erf(z) = — n ev dy is the error function. What should they have found? Hint: 
T 


0 
You will need to prove that 


vt —a?t - a 


€ 


avr — ss-cai 


L£ | erf(av/t) — a aerilavt) + 


20. The Laguerre polynomial!’ 


ylt) = Ln(t) = ~- (t"e™),  n=0,1,2,3,... 


5 Marto, P. J., and W. M. Rohsenow, 1966: Nucleate boiling instability of alkali metals. J. Heat 
Transfer, 88, 183-193. 


6 Huber, A., 1934: Eine Methode zur Bestimmung der Wärme- und Temperaturleitfähigkeit. Monatsh. 
Math. Phys., 41, 35-42. 


7 Zhdanov, S. K., A. S. Chikhachev, and Yu. N. Yavlinskii, 1976: Diffusion boundary-value problem for 
regions with moving boundaries and conservation of particles. Sov. Phys. Tech. Phys., 21, 883-884. 


8 See Section 5.3 in Andrews, L. C., 1985: Special Functions for Engineers and Applied Mathematicians. 
MacMillan, 357 pp. 
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satisfies the ordinary differential equation 
ty" + (1— ty + ny = (ty) — ty’ + ny = 0, 


with y(0) = 1 and y'(0) = —n. 
Step 1: Using Equation 12.1.18 and Equation 12.3.20, show that the Laplace transformed 


version of this differential equation is 


n+l—s |n , n+1 J 
ausi e Y(s) Y(s), 


s—1 S 


Y'(s) 2 


where Y (s) is the Laplace transform of y(t). 


Step 2: Using Equation 12.3.20 and the convolution theorem, show that Laguerre polyno- 
mials are the solution to the integral equation 


ty(t) = (n+ i y(r)dr — net f y(r)e * dr. 


12.8 SOLUTION OF LINEAR DIFFERENTIAL EQUATIONS WITH 
CONSTANT COEFFICIENTS 


For the engineer, as it was for Oliver Heaviside, the primary use of Laplace transforms is 
the solution of ordinary, constant coefficient, linear differential equations. These equations 
are important not only because they appear in many engineering problems but also because 
they may serve as approximations, even if locally, to ordinary differential equations with 
nonconstant coefficients or to nonlinear ordinary differential equations. 

For all of these reasons, we wish to solve the initial-value problem 


d"y d"-ly dy 
dm Fai EIDEM) Toc sa "s Hany = f(t), t>0, (12.8.1) 
by Laplace transforms, where a1,a2,... are constants and we know the value of y, y',..., 


y» at t= 0. The procedure is as follows. Applying the derivative rule Equation 12.1.18 
to Equation 12.8.1, we reduce the differential equation to an algebraic one involving the 
constants a1,02,...,G4, the parameter s, the Laplace transform of f(t), and the values of 
the initial conditions. We then solve for the Laplace transform of y(t), Y (s). Finally, we 
apply one of the many techniques of inverting a Laplace transform to find y(t). 

Similar considerations hold with systems of ordinary differential equations. The Laplace 
transform of the system of ordinary differential equations results in an algebraic set of 
equations containing Y; (s), Yo(s), ..., Y, (s). By some method we solve this set of equations 
and invert each transform Y; (s), Yo(s),..., Y, (s) in turn to give yi(£), yo(t), .... ys (t). 

The following examples will illustrate the details of the process. 


e Example 12.8.1 
Let us solve the ordinary differential equation 


y" + 2y' = 8t, (12.8.2) 
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subject to the initial conditions that y'(0) — y(0) — 0. Taking the Laplace transform of 
both sides of Equation 12.8.2, 


L(y”) + 2£(y) = 8£(t), (12.8.3) 


or 
8 
2 = 
s^Y (s) — sy(0) — y'(0) + 2sY (s) — 2y(0) = EL (12.8.4) 
where Y (s) = £[y(t)]. Substituting the initial conditions into Equation 12.8.4 and solving 
for Y (s), 


V 8 d M © D _ (s+2)A+s(s+2)B+s8s?°(s+2)C+8s°D 
~ s8(s+2) s^ s ^5 s s42 s3(s + 2) ` 


(12.8.5) 
Matching powers of s in the numerators of Equation 12.8.5, C + D = 0, B+2C = 0, 
A+2B=0, and 2A = 8 or A = 4, B = —2,C = 1, and D = —1. Therefore, 
4 TI 1 


Y(s) = aaa (12.8.6) 


Finally, performing term-by-term inversion of Equation 12.8.6, the final solution is 


y(t) = 27? — 2t - 1— e^**, (12.8.7) 


We could have done the same operations using the symbolic toolbox with MATLAB. 
The MATLAB script 


clear 

^ define symbolic variables 

syms s t Y 

^4 take Laplace transform of left side of differential equation 
LHS = laplace(diff(diff(symC y(t)?))) s2*diff(symC y (t)?)))5; 

^ take Laplace transform of right side of differential equation 
RHS - laplace(8*t); 

% set Y for Laplace transform of y 

% and introduce initial conditions 

newLHS = subs(LHS,’laplace(y(t),t,s)’,’y(0)’,’D(y) (0)’,Y,0,0); 
^ solve for Y 

Y = solve(newLHS-RHS,Y); 

^ invert Laplace transform and find y(t) 

y = ilaplace(Y,s,t) 

yields the result 

y= 

il-exp(-2*t)-2*t*2*t^2 


which agrees with Equation 12.8.7. 
e Example 12.8.2 
Let us solve the ordinary differential equation 


y' +y = H(t) —- H(t- 1) (12.8.8) 
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with the initial conditions that y'(0) — y(0) — 0. Taking the Laplace transform of both 
sides of Equation 12.8.8, 


s?Y (s) — sy(0) — y' (0) + Y (s) = : — - (12.8.9) 


where Y (s) = £[y(t)]. Substituting the initial conditions into Equation 12.8.9 and solving 


or Y(s) 
"s Yo -(i =) G a) (12.8.10) 


Using the second shifting theorem, the final solution is 


y(t) = 1 — cos(t) — [1 — cos(t — 1)] H (t — 1). (12.8.11) 


We can check our results using the MATLAB script 


clear 

^ define symbolic variables 

syms s t Y 

^4 take Laplace transform of left side of differential equation 
LHS = laplace(diff(diff(sym(C y(t))))*syn C y(t)?)); 

^ take Laplace transform of right side of differential equation 
RHS = laplace(’Heaviside(t) - Heaviside(t-1)’,t,s); 

^ set Y for Laplace transform of y 

% and introduce initial conditions 

newLHS = subs(LHS,’laplace(y(t),t,s)’,’y(0)’,’D(y) (0)’,Y,0,0); 
^ solve for Y 

Y = solve(newLHS-RHS,Y) ; 

% invert Laplace transform and find y(t) 

y = ilaplace(Y,s,t) 

which yields 


y = 
1-cos(t)-Heaviside(t-1)+Heaviside(t-1)*cos(t-1) 


e Example 12.8.3 
Let us solve the ordinary differential equation 
y" +2y'+y= f(t) (12.8.12) 


with the initial conditions that y'(0) = y(0) = 0, where f(t) is an unknown function whose 
Laplace transform exists. Taking the Laplace transform of both sides of Equation 12.8.12, 


s?Y (s) — sy(0) — y'(0) + 2sY(s) — 2y(0) + Y (s) = F(s), (12.8.13) 


where Y (s) = £[y(t)]. Substituting the initial conditions into Equation 12.8.13 and solving 
for Y (s), 
1 


iu 


F(s). (12.8.14) 
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We wrote Equation 12.8.14 in this form because the transform Y (s) equals the product 
of two transforms 1/(s + 1)? and F(s). Therefore, by the convolution theorem we can 
immediately write 


y(t) = te™ x f(t) = i xe * f(t— x) da. (12.8.15) 


Without knowing f(t), this is as far as we can go. 


e Example 12.8.4: Forced harmonic oscillator 


Let us solve the simple harmonic oscillator forced by a harmonic forcing 
y" +wy = cos(wt), (12.8.16) 


subject to the initial conditions that y'(0) = y(0) = 0. Although the complete solution 
could be found by summing the complementary solution and a particular solution obtained, 
say, from the method of undetermined coefficients, we now illustrate how we can use Laplace 
transforms to solve this problem. 

Taking the Laplace transform of both sides of Example 12.8.16, substituting in the 
initial conditions, and solving for Y (s), 


Y(s) = (VE (12.8.17) 
and 
ies = sin(wt) * cos(wt) = + sin(ut). (12.8.18) 


Equation 12.8.18 gives an oscillation that grows linearly with time although the forcing 
function is simply periodic. Why does this occur? Recall that our simple harmonic oscillator 
has the natural frequency w. But that is exactly the frequency at which we drive the system. 
Consequently, our choice of forcing has resulted in resonance where energy continuously 
feeds into the oscillator. 


e Example 12.8.5 
Let us solve the system of ordinary differential equations: 
22 + y = cos(t), (12.8.19) 
and 
y' — 2x = sin(t), (12.8.20) 


subject to the initial conditions that z(0) = 0, and y(0) = 1. Taking the Laplace transform 
of Equation 12.8.19 and Equation 12.8.20, 


E 
and 
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after introducing the initial conditions. Solving for X(s) and Y (s), 


1 


X(s) = ——Há5 12.8.23 
()--Gri (12.8.23) 
and 9 
E E 
Y(s) = t 2 12.8.24 
Taking the inverse of Equation 12.8.23 and Equation 12.8.24 term by term, 
a(t) = 5 [tcos(t) — sin(t)], (12.8.25) 
and 
y(t) = tsin(t) + cos(t). (12.8.26) 


'The MATLAB script 


clear 

^ define symbolic variables 

syms s t X Y 

% take Laplace transform of left side of differential equations 

LHS1 = laplace(2*diff(sym(’x(t)’))+sym(’y(t)’)); 

LHS2 = laplace(diff(sym(’y(t)’))-2*sym(’x(t)’)); 

^ take Laplace transform of right side of differential equations 

RHS1 = laplace(cos(t)); RHS2 = laplace(sin(t)); 

^ set X and Y for Laplace transforms of x and y 

h and introduce initial conditions 

newLHS1 = subs(LHS1,’laplace(x(t),t,s)’,’laplace(y(t),t,s)’,... 
"x (0)? ,^y(0) ,X, Y,0,1); 

newLHS2 = subs(LHS2, ’laplace(x(t),t,s)’,’laplace(y(t),t,s)’,... 
»x(0)?,’y(0)’,X,Y,0,1); 

% solve for X and Y 

[X,Y] = solve(newLHS1-RHS1,newLHS2-RHS2,X,Y); 

% invert Laplace transform and find x(t) and y(t) 

x = ilaplace(X,s,t); y = ilaplace(Y,s,t) 

uses the symbolic toolbox to solve Equation 12.8.19 and Equation 12.8.20. MATLAB finally 

gives 

x= 

1/2*t*cos(t)-1/2*sin(t) 

y = 

t*sin(t)+cos(t) 


e Example 12.8.6 


Let us determine the displacement of a mass m attached to a spring and excited by 
the driving force 


PO = ma (1- 1.) etr. (12.8.27) 


The dynamical equation governing this system is 


t 
y esty A(1- enr, (12.8.28) 
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where w? = k/m and k is the spring constant. Assuming that the system is initially at rest, 
the Laplace transform of the dynamical system is 


A A 
? E w?)Y (s) = 12.8.29 
(P +08) = Ln UT (12.8.29) 
Or d P 
Y(s) = ; 12.8.30 
(s) (s2 +w?)(s+1/T) T(s? +w?)(s+1/T)? ( ) 
Partial fractions yield 
Y{)= A 1 s—1/T A 
S= RFI \sti/T stu?) Tw?+1/T?)? 
1/T? — w? 2 2 
,[Th-o^ — z/T | w +1/T? | cMT ] TT 
s? +w? s?tw2  (s+1/T)?  s4+1/T 
Inverting Equation 12.8.31 term by term, 
AT? sin(wt) AT? sin(wt) 
t) = ———_ =T t 1 272 
y(t) 14+ w?T? e panana wT | (1+ w?T?)? T diu. wT 


+2 d i cos(wt)| +(1+ TOAT eT}, (12.8.32) 


The solution to this problem consists of two parts. The exponential terms result from 
the forcing and will die away with time. This is the transient portion of the solution. The 
sinusoidal terms are those natural oscillations that are necessary so that the solution satisfies 
the initial conditions. They are the steady-state portion of the solution and endure forever. 
Figure 12.8.1 illustrates the solution when wT = 0.1, 1, and 2. Note that the displacement 
decreases in magnitude as the nondimensional frequency of the oscillator increases. 


e Example 12.8.7 


Let us solve the equation 
y” + 16y = ó(t — 7/4) (12.8.33) 


with the initial conditions that y(0) = 1, and y'(0) = 0. 
Taking the Laplace transform of Equation 12.8.33 and inserting the initial conditions, 


(s? -- 16)Y (s) =s+e—27/4, (12.8.34) 

Or 
Y 2 em 12.8.35 
(9 = x16 * x16 LABS) 


Applying the second shifting theorem, 
y(t) = cos(4t) + 1 sin[4(t — 7/4) H(t — 1/4) = cos(4t) — 1sin(4t) H(t — 1/4). (12.8.36) 


We can check our results using the MATLAB script 
clear 
^ define symbolic variables 
syms pi s t Y 
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Figure 12.8.1: Displacement of a simple harmonic oscillator with nondimensional frequency wT as a 
function of time t/T. The top frame shows the forcing function. 


^4 take Laplace transform of left side of differential equation 
LHS = laplace (diff (diff (sym(’y(t)’)))+16*sym(’y(t)’)); 
^4 take Laplace transform of right side of differential equation 
RHS = laplace(’Dirac(t-pi/4)’,t,s); 
^ set Y for Laplace transform of y 
^ and introduce initial conditions 
newLHS = subs(LHS,’laplace(y(t),t,s)’,’y(0)’,’D(y) (00^ , Y,1,0); 
^4 solve for Y 
Y = solve(newLHS-RHS,Y) ; 
% invert Laplace transform and find y(t) 
y = ilaplace(Y,s,t) 
which yields 
y= 
cos (4*t) -1/4*Heaviside(t-1/4*pi)*sin(4*t) 
We can also verify that Equation 12.8.36 is the solution to our initial-value problem by 
computing the (generalized) derivative of Equation 12.8.36, or 


y (t) = —4sin(4t) — cos(4t) H(t — 1/4) — 4 sin(4t)ó(t — 1/4) (12.8.37) 
= —4sin(4t) — cos(4t) H (t — 1/4) — 4 sin(r)ô(t — «/4) (12.8.38) 
= —Asin(4t) — cos(4t) H(t — 1/4), (12.8.39) 
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since f(t)d(t — to) = f(to)d(t — to). Similarly, 


y" (t) = —16 cos(4t) + 4sin(4t) H (t — 1/4) — cos(4t)d(t — 1/4) (12.8.40) 
= —16cos(4t) + 4sin(4t) H (t — 1/4) — cos(x)ó(t — 7/4) (12.8.41) 
= —16 cos(4t) + Asin(4t) H (t — 1/4) + ó(t — 1/4). (12.8.42) 


Substituting Equation 12.8.36 and Equation 12.8.42 into Equation 12.8.32 completes the 
verification. A quick check of y(0) and y'(0) also shows that we have the correct solution. 


e Example 12.8.8: Oscillations in electric circuits 


During the middle of the nineteenth century, Lord Kelvin!? analyzed the LCR electrical 
circuit shown in Figure 12.8.2, which contains resistance R, capacitance C, and inductance 
L. For reasons that we shall shortly show, this LCR circuit has become one of the quintessen- 
tial circuits for electrical engineers. In this example, we shall solve the problem by Laplace 
transforms. 

Because we can add the potential differences across the elements, the equation govern- 
ing the LCR circuit is 


Lo + RI + sf Idr = (12.8.43) 
where J denotes the current in the circuit. Let us solve Equation 12.8.43 when we close 


the circuit and the initial conditions are [(0) = 0 and Q(0) = —Qo. Taking the Laplace 
transform of Equation 12.8.43, 


Lx 0 
(zs +R+ a) KL- (12.8.44) 
Solving for I (s), 
T Qo wj Qo wj Qo 
I(s)— = = 12.8.45 
(s) Cs(Ls+R+1/Cs) s?-c2as-ctwà  (sco)-cwi-o? ( ) 
where a = R/(2L), and w = 1/(LC). From the first shifting theorem, 
wj Qo 
I(t) = —~e  sin(wt), (12.8.46) 
w 


where w? = wĝ — o? > 0. The quantity w is the natural frequency of the circuit, which 
is lower than the free frequency wo of a circuit formed by a condenser and coil. Most 
importantly, the solution decays in amplitude with time. 

Although Kelvin's solution was of academic interest when he originally published it, 
this radically changed with the advent of radio telegraphy?? because the LCR circuit de- 
scribed the fundamental physical properties of wireless transmitters and receivers.?! The 


19 Thomson, W., 1853: On transient electric currents. Philos. Mag., Ser. 4, 5, 393-405. 


20 Stone, J S., 1914: The resistance of the spark and its effect on the oscillations of electrical oscillators. 
Proc. IRE, 2, 307-324. 


21 See Hogan, J. L., 1916: Physical aspects of radio telegraphy. Proc. IRE, 4, 397-420. 
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Figure 12.8.2: Schematic of a LCR circuit. 


inescapable conclusion from numerous analyses was that no matter how cleverly the re- 
ceiver was designed, eventually the resistance in the circuit would dampen the electrical 
oscillations and thus limit the strength of the received signal. 

This technical problem was overcome by Armstrong,?? who invented an electrical circuit 
that used De Forest’s audion (the first vacuum tube) for generating electrical oscillations 
and for amplifying externally impressed oscillations by “regenerative action." The effect of 
adding the “thermionic amplifier" is seen by again considering the LRC circuit as shown in 
Figure 12.8.3 with the modification suggested by Armstrong.?? 

'The governing equations of this new circuit are 


dI dl, 
— Id M — 12.8.47 
nt enses PME (12.8.47) 
and 
dl, 


where the plate circuit has the current Ip, the resistance Ro, the inductance Lz, and the 
electromotive force (emf) of u fo Idr/C. The mutual inductance between the two circuits 
is given by M. Taking the Laplace transform of Equation 12.8.47 and Equation 12.8.48, 


LisI(s) + RI(s) + uc + MsI,(s) = a (12.8.49) 
and 
LasI,(s) + Rol, (s) + MsI(s) + alls) =0. (12.8.50) 


Eliminating Tp(s) between Equation 12.8.49 and Equation 12.8.50 and solving for I(s), 


I(s) = (Los + Ro)Qo 
(LiLo — M?)Cs? + (RL + RoLi)C s? + (La + CRRo — pM)s + Ro 


(12.8.51) 


22 Armstrong, E. H., 1915: Some recent developments in the audion receiver. Proc. IRE, 3, 215-247. 


?3 See Ballantine, S., 1919: The operational characteristics of thermionic amplifiers. Proc. IRE, 7, 
129-161. 
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Figure 12.8.3: Schematic of an LCR circuit with the addition of a thermionic amplifier. (From Ballantine, 
S., 1919: The operational characteristics of thermionic amplifiers. Proc. IRE, 7, 155.) 


For high-frequency radio circuits, we can approximate the roots of the denominator of 
Equation 12.8.51 as 

Ro 
Lo + CRRo 2 uM’ 


PES (12.8.52) 


and 
Ro Roly + RL» 


9237 9.5 +CRRo — uM) 2(l4La — M?) 


In the limit of M and Ro vanishing, we recover our previous result for the LRC circuit. 
However, in reality, Ro is very large and our solution has three terms. The term associated 
with s; is a rapidly decaying transient while the s2 and s3 roots yield oscillatory solutions 
with a slight amount of damping. Thus, our analysis shows that in the ordinary regenerative 
circuit, the tube effectively introduces sufficient "negative" resistance so that the resultant 
positive resistance of the equivalent LCR circuit is relatively low, and the response of an 
applied signal voltage at the resonant frequency of the circuit is therefore relatively great. 
Later, Armstrong?^ extended his work on regeneration by introducing an electrical circuit 
- the superregenerative circuit - where the regeneration is made large enough so that the 
resultant resistance is negative, and self-sustained oscillations can occur. It was this 
circuit?’ that led to the explosive development of radio in the 1920s and 1930s. 


+ iw. (12.8.53) 


e Example 12.8.9: Resonance transformer circuit 


One of the fundamental electrical circuits of early radio telegraphy?” is the resonance 
transformer circuit shown in Figure 12.8.4. Its development gave transmitters and receivers 
the ability to tune to each other. 

The governing equations follow from Kirchhoff's law and are 


dl di, 1 


t 
L M I,dr = E(t 12.8.54 
tM Ge + a nar BU. (12.8.54) 


24 Armstrong, E. H., 1922: Some recent developments of regenerative circuits. Proc. IRE, 10, 244—260. 
25 See Frink, F. W., 1938: The basic principles of superregenerative reception. Proc. IRE, 26, 76-106. 
26 Lewis, T., 1991: Empire of the Air: The Men Who Made Radio. HarperCollins Publishers, 421 pp. 


27 Fleming, J. A., 1919: The Principles of Electric Wave Telegraphy and Telephony. Longmans, Green, 
911 pp. 
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primary secondary 


Figure 12.8.4: Schematic of a resonance transformer circuit. 


and 
dli dl» 
M L 
di Em 


1 t 
+ RI, + | Idr — 0. (12.8.55) 
Co 0 


Let us examine the oscillations generated if initially the system has no currents or charges 
and the forcing function is E(t) = d(t). 
Taking the Laplace transform of Equation 12.8.54 and Equation 12.8.55, 


= = T 
Lıslı + MsI + — =1, (12.8.56) 
sCi 


and 


I 
MsI; + Losl2 + RI; + x =0. (12.8.57) 
SU2 


Because the current in the second circuit is of greater interest, we solve for Tọ and find that 


ET Ms’ 
I = 
a(s) Li La[(1 — k?)s* + 20425? + (w? + w2)s? + 2aw?s + w2w2]" 


(12.8.58) 


where a = R/(2L2), w? = 1/(L1C1), w2 = 1/(L2C2), and k? = M?/ (Lı Lə), the so-called 
coefficient of coupling. 

We can obtain analytic solutions if we assume that the coupling is weak (k? « 1). 
Equation 12.8.58 becomes 


TI = ME (12.8.59) 
i Li Lo(s? + w2)(s? 4-20 + w2): ane 
Using partial fractions and inverting term by term, we find that 
b(t) M 203 sin(wt) _ wi (wz — wi) cos(wit) (12.8.60) 


= T 
LiLo | (w2 — w?)2? + 4a?w? (w2 — w?)? + 402w? 
i aw3 — 3aw2w2 --4o3w2 _ sin(wt) wlw? — w?) + 4a? w? at 


t cos(wt) |, 
(w2 — wit)? + 4a2w? Ww (w3 — w2)? + 402v? k wt) 
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Figure 12.8.5: The resonance curve 1/4/(r? — 1)? + 0.01 for a resonance transformer circuit with r = 
w2/u1. 
where w? = w2 — a?. 

The exponentially damped solutions will eventually disappear, leaving only the steady- 
state oscillations that vibrate with the angular frequency w1, the natural frequency of the 
primary circuit. If we rewrite this steady-state solution in amplitude/phase form, the am- 
plitude is 

M 


Libo /(r? — 1)? + da? Jw? 


(12.8.61) 


where r = w3/w. As Figure 12.8.5 shows, as r increases from zero to two, the amplitude 
rises until a very sharp peak occurs at r = 1 and then decreases just as rapidly as we 
approach r = 2. Thus, the resonance transformer circuit provides a convenient way to tune 
a transmitter or receiver to the frequency w1. 


e Example 12.8.10: Delay differential equation 

Laplace transforms provide a valuable tool in solving a general class of ordinary dif- 
ferential equations called delay differential equations. These equations arise in such diverse 
fields as chemical kinetics?? and population dynamics.?? 

To illustrate the technique,?? consider the differential equation 


x’ = —azx(t— 1) (12.8.62) 


with x(t) = 1 — at for 0 < t < 1. Clearly, z(0) = 1. 
Multiplying Equation 12.8.62 by e^ 5* and integrating from 1 to oo, 


li (he dt = -a | a(t — 1)e™* dt (12.8.63) 
1 1 


28 See Roussel, M. R., 1996: The use of delay differential equations in chemical kinetics. J. Phys. Chem., 
100, 8323-8330. 


?9 See the first chapter of MacDonald, N., 1989: Biological Delay Systems: Linear Stability Theory. 
Cambridge University Press, 235 pp. 


30 See Epstein, I. R., 1990: Differential delay equations in chemical kinetics: Some simple linear model 
systems. J. Chem. Phys., 92, 1702-1712. 
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Figure 12.8.6: The solution to the delay differential equation, Equation 12.8.62, at various times t and 
values of a. 


oo 1 oo 
n z'(t)e ** dt — n a (e ues -a | z(r)e 8+) dr (12.8.64) 
0 0 0 

1 

sX(s)— 1+ af e * dt = —ae™® X (s) (12.8.65) 
0 

sX(s) —1— a e = —ae ?X(s) (12.8.66) 

E 


since z'(t) = —a for 0 < t < 1. Solving for X(s), 
X(s) = (1-- ae ?/s — a/s)/[s(1 + ae */s)]. (12.8.67) 


To facilitate the inversion of Equation 12.8.67, we expand its denominator in terms of a 
geometric series and find that 


X(s) - S (~a) e"s an 4 Soe _ So (aye bos unn. 
n=0 n=0 n=0 
(12.8.68) 
The first and third sums cancel, except for the n = 0 term in the first sum. Therefore, 
1 oo 
X(s) = = + Y (~a) e s jan? (12.8.69) 
P n=0 
and M 
z(t) 21 M7 a" Ht n)(t — n) (12.8.70) 
2-4 (n+1)! Í E 


Figure 12.8.6 illustrates Equation 12.8.70 as a function of time for various values of 
a. For 0 < a < el, x(t) decays monotonically from 1 to an asymptotic limit of zero. 
For e! < a < 7/2, the solution is a damped oscillatory function. If 7/2 < a, then x(t) 
is oscillatory with an exponentially increasing envelope. When a — 7/2, x(t) oscillates 
periodically. o 
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e Example 12.8.11 


Laplace transforms can sometimes be used to solve ordinary differential equations where 
the coefficients are powers of t. To illustrate this, let us solve 


y" + 2ty' — 4y — 0, y(0)=1, lim y(t) > 0. (12.8.71) 


We begin by taking the Laplace transform of Equation 12.8.71 and find that 


s?Y (s) — sy(0) — y'(0) 23 [sY (s) y(0)] — 4Y (s) = 0. (12.8.72) 


An interesting aspect of this problem is the fact that we do not know y'(0). To circumvent 
this difficulty, let us temporarily set y'(0) = —A so that Equation 12.8.72 becomes 


L (5 >) pon x (12.8.73) 
S 


2 2s 


Later on, we will find A. 
Equation 12.8.73 is a first-order, linear, ordinary differential equation with s as its 
independent variable. To find Y (s), we use the standard technique of multiplying it by its 


integrating factor, here u(s) — 53675 /^. and rewriting it as 
d 
us 40] = lA — 183e, (12.8.74) 


Integrating Equation 12.8.74 from s to oo, we obtain 
s*e-* "^y (s) = (s? 4e 9 )4 — A [acci + Vm erfc(s/2)| , (12.8.75) 
Or 


Y (s) = pan ANT E ai (12.8.76) 


s s 82 s3 


We must now evaluate A. From the final-value theorem, lim, ,55 y(t) = lims—o sY (s) = 
0. Therefore, multiplying Equation 12.8.76 by s and using the expansion for the comple- 
mentary error function for small s, we have that 
4 A Ar s? s 


14 ee. 12.8.77 
s s? 4 Vm a ( ) 


In order that lim,.,9 sY (s) 2 0, A = 4/y/r. Therefore, 


4 1 4 4 2 
= P 2). 12.8.7 
Y (s) z3 + 3 me ae erfc(s/2) ( 8) 


The final step is to invert Equation 12.8.78. Applying tables and the convolution 
theorem, 


3 4t 4 f E ETTE Labs ee 
y(t) 2 20 +1 Ju ait z)?e-* dx = (2? +1)[1 — erf(t)] m . (12.8.79) 
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Problems 


Solve the following ordinary differential equations by Laplace transforms. Then use MATLAB 
to verify your solution. 


1. y-2y-21-t; y(0)=1 


2. y'—4y -F3y —e5 y(0) =0,y/(0) =0 


T. yf +4y'+3y=e%  y(0 =0,y'(0) =2 


8. y”—4y' +5y=0; y(0) = 2,y/(0) = 4 
9. y'+y=tH(t-1); y(0)=0 


10. y” +3y+2y=HA(t—1); y(0) =0,y'(0) =1 


11. y”— 3y +2y=H(t—1); y(0)=0,y/(0) =1 
12. y”’+4y=3A(t—4); y(0)=1,y'(0) =0 


13. y” + 4y + 4y = 4H(t— 2); y(0) =0,y/(0) = 0 


14. y” +3y' +2y =e 1H(t—1); y(0) =0,y/(0) = 1 


15. y" —3y+2y=e""A(t— 2); y(0)=0,y'(0) =0 


16. y'—3y + 2y = H(t-1)- H(t-2)  y(0) =0,y/(0) =0 


n La J sin(t), 0xtzm, NA 

18. y il 0, EE y(0) = 0, y'(0) =0 
t, 0<t<a, 

19. e$ +20={ uota, "m" y(0) = 0,y'(0) — 0 
t/a, O<t<a, 

20. y” +wy = 4 1—(t—a)/(b—a), a<t<b,  y(0020,y'(0) =0 
0, b<t; 


21. y'"—2y' +y=3ő(t-2); y(0) =0,y'(0) =1 
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22. y” —5y'+4y=d(t—1); w(0)—0,y'(0) — 0 


23. y" + 5y’ + 6y —3ó0(t—2) — 40(£— 5); y(0)—y'(0)-0 


24. y" E wy' = Aó(t- 7) - BH(t—T); y(0) =y'(0) =0 
25. x'—2r+y=0, y —3r—4y—0; x(0)—1,9(0)—0 
26. z'—2y —1, wv +y-x=0; x(0)—y(0)—0 


27. aw’ +2e—-y'=0, 2’ +y+2=8#?; 2x(0)=y(0) =0 


28. a’ +3a-y=1, a +y +3x=0; «(0) = 2,y(0) =0 
29. Forster, Escobal, and Lieske?! used Laplace transforms to solve the linearized equa- 
tions of motion of a vehicle in a gravitational field created by two other bodies. A simplified 


form of this problem involves solving the following system of ordinary differential equations: 


xz" — 23 = Fi +z + 2y, Qa’ + y" = Fo + 2x + 3y, 


subject to the initial conditions that x(0) = y(0) = z'(0) = y'(0) = 0. Find the solution to 
this system. 


Use Laplace transforms to find the solution for the following ordinary differential equations: 


30. y" + 2ty' — 8y = 0, y(0)=1, y'(0)=0 

Step 1: Show that the Laplace transform for this differential equation is 2sY’(s) + (10 — 
s?YY (s) = —s. 

Step 2: Solve these first-order ordinary differential equations and show that Y (s) = 1/s + 
8/53 + 32/59 + Aes" /^ 35. 

Step 3: Invert Y (s) and show that the general solution is y(t) = 1 + 4t? + 4t*/3. 


31. y" — ty’ + 2y — 0, y(0 —-—1, y’(0) =0 

Step 1: Show that the Laplace transform for this differential equation is sY’(s) + (s? + 
3)Y (s) = —s. 

Step 2: Solve the first-order ordinary differential equations and show that Y (s) = (A — 
2)e7 5/2/53 2/53 — 1/s. 

Step 3: Invert Y (s) and show that the general solution is y(t) = t? — 1. 


32. ty" - (2—t)y -y =0 

Step 1: Show that the Laplace transform for this differential equation is s(s + 1)Y'(s) + 
2(2s + 1)Y (s) = 3y(0). 

Step 2: Solve the first-order ordinary differential equation and show that Y (s) = y(0)/ 
(s+ 1) + y(0)/[2(s + 1)?] + A/[s? (s + 1)?]. 

Step 3: Invert Y (s) and show that the general solution is y(t) = Ci(t + 2)e^* + Co(t — 2). 


31 Forster, K., P. R. Escobal, and H. A. Lieske, 1968: Motion of a vehicle in the transition region of the 
three-body problem. Astronaut. Acta, 14, 1—10. 
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33. ty" — 2(a + bt)y' + b(2a + bt)y = 0, a>0 

Step 1: Show that the Laplace transform for this differential equation is (s — b)?Y’(s) + 
2(1 + a)(s — b)Y (s) = (1 + 2a)y(0). 

Step 2: Solve the first-order ordinary differential equation and show that Y (s) = y(0)/(s — 
b) + A/(s — b)?*?«, 

Step 3: Invert Y (s) and show that the general solution is y(t) = Cie + C5t?«*1e^t, 


12.9 INVERSION BY CONTOUR INTEGRATION 


In Section 12.5 and Section 12.6 we showed how we can use partial fractions and 
convolution to find the inverse of the Laplace transform F(s). In many instances these 
methods fail simply because of the complexity of the transform to be inverted. In this 
section we shall show how we can invert transforms through the powerful method of contour 
integration. Of course, the student must be proficient in the use of complex variables. 

Consider the piece-wise differentiable function f(x), which vanishes for x < 0. We can 
express the function e~% f(x) by the complex Fourier representation of 


f(zr) = 0 d a n e “ f(t)e *** dt| dw, (12.9.1) 


2v —oo 0 


for any value of the real constant c, where the integral 


r= f e “| f(t)| dt (12.9.2) 


exists. By multiplying both sides of Equation 12.9.1 by e** and bringing it inside the first 
integral, 


1 S ; ve . 
f(z) = = | pierda ff fie ee a| dw. (12.9.3) 
2T J. o6 0 
With the substitution z = c + wi, where z is a new, complex variable of integration, 
1 ctoot oo 
fou zl e? / fte a| dz. (12.9.4) 
2m Je ooi 0 


The quantity inside the square brackets is the Laplace transform F(z). Therefore, we can 
express f(t) in terms of its transform by the complex contour integral 


ctoot 
f = a | F(z)e* dz. (12.9.5) 


This line integral, the Bromwich integral;?? runs along the line x = c parallel to the imagi- 
nary axis and c units to the right of it, the so-called Bromwich contour. We select the value 


32 Bromwich, T. J. PA., 1916: Normal coordinates in dynamical systems. Proc. London Math. Soc., 
Ser. 2, 15, 401-448. 
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An outstanding mathematician at Cambridge University at the turn of the twentieth century, 
Thomas John l'Anson Bromwich (1875-1929) came to Heaviside's operational calculus through 
his interest in divergent series. Beginning a correspondence with Heaviside, Bromwich was able to 
justify operational calculus through the use of contour integrals by 1915. After his premature death, 
individuals such as J. R. Carson and Sir H. Jeffreys brought Laplace transforms to the increasing 
attention of scientists and engineers. (Portrait courtesy of the Royal Society of London.) 


of c sufficiently large so that the integral, Equation 12.9.2, exists; subsequent analysis shows 
that this occurs when c is larger than the real part of any of the singularities of F(z). 

We must now evaluate the contour integral. Because of the power of the residue 
theorem in complex variables, the contour integral is usually transformed into a closed 
contour through the use of Jordan's lemma. See Section 11.4, Equations 11.4.9 and Equation 
11.4.10. The following examples will illustrate the proper use of Equation 12.9.5. 


e Example 12.9.1 


Let us invert 
(12.9.6) 


From Bromwich’s integral, 


1 c+ooi (t—3)z 1 (t—3)z 1 (t—3)z 
50 zl - dz= j - dz f - dz, (12.9.7) 


-æi Z?(z — 1) 21i Jo z?(z — 1) ami Jo, z2(z — 1) 


where Cg is a semicircle of infinite radius in either the right or left half of the z-plane and 
C is the closed contour that includes Cg and Bromwich's contour. See Figure 12.9.1. 
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Figure 12.9.1: Contours used in the inversion of Equation 12.9.6. 


Our first task is to choose an appropriate contour so that the integral along Cg vanishes. 
By Jordan's lemma this requires a semicircle in the right half-plane if t— 3 < 0 and a 
semicircle in the left half-plane if t— 3 > 0. Consequently, by considering these two separate 
cases, we force the second integral in Equation 12.9.7 to zero and the inversion simply equals 
the closed contour. 

Consider the case t < 3 first. Because Bromwich's contour lies to the right of any 
singularities, there are no singularities within the closed contour and f(t) = 0. 

Consider now the case t > 3. Within the closed contour in the left half-plane, there is 
a second-order pole at z = 0 and a simple pole at z = 1. Therefore, 


f(t) = Res ee en bauhes = a (12.9.8) 
7 z?(z—1) z?^(z—1) |’ i 
where 
(t-3)2 d (t—3)z (t — 3) (t—3)z (t—3)z 
€ e e e 
Res | ————-;0| = lim — | z? ——__| = li = t 
=| « N x el? 2-1) im, | z-1 ad 
(12.9.9) 
and 
(t—3)z (t—3)2 
seit eee 7 irata = £ — ,t-3 
Res| 7 = 2 = lim (2 DzG Ee (12.9.10) 
Taking our earlier results into account, the inverse equals 
f(t) = [e7 — (t - 3) - 1] H- 3), (12.9.11) 


which we would have obtained from the second shifting theorem and tables. 
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* 3ni/a A 

* 2Ti/a 

e Ti/a 

+ Lodi 
$- mi/a (60) 
$ -2 Ti/a 

e-3ni/a i 


Figure 12.9.2: Contours used in the inversion of Equation 12.9.12. 
e Example 12.9.2 


For our second example of the inversion of Laplace transforms by complex integration, 
let us find the inverse of 


1 
PS) > SEDEM Ne (12.9.12) 
where a is real. From Bromwich’s integral, 
1 c+oot et? 


Here c is greater than the real part of any of the singularities in Equation 12.9.12. Using 


the infinite product for the hyperbolic sine,?? 
e et 
i = . (12.9.14) 
zsinh(az) az?[1 + a?z?/r?][1 + a?z?2/(4r?)][1 + a?22/(972)] - - - 
Thus, we have a second-order pole at z = 0 and simple poles at zn = -nmi/a, where 


m= l2 aye 

We can convert the line integral Equation 12.9.13, with the Bromwich contour lying 
parallel and slightly to the right of the imaginary axis, into a closed contour using Jordan’s 
lemma through the addition of an infinite semicircle joining ?oo to —ioo, as shown in Figure 
12.9.2. We now apply the residue theorem. For the second-order pole at z = 0, 


e 1 d tee d zet” 
-0| =— hii a EL SET. 6: 
EET |j | 11259 dz | zsinh(az) | 230 dz E ea) 
sia | lg He eee a quaeub 
250 |sinh(az) —sinh(az) sinh" (az) 


33 Gradshteyn, I. S., and I. M. Ryzhik, 1965: Table of Integrals, Series and Products. Academic Press, 
Section 1.431, Formula 2. 
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after using sinh(az) = az + O(z?). For the simple poles zn = -En7i/a, 
e (z " Zn je” et? 
Res | —————-; z, | = lim —————— = li - 12.9.17 
m | sinh(az)’ 3 | atin z sinh(az) itn sinh(az) + az cosh(az) ( ) 


| exp(anzit/a) 
S (12.9.18) 


because cosh(+nri) = cos(nr) = (—1)”. Thus, summing up all of the residues gives 


f(t) = : z D (—1)" experi) »» (—1)" Fep Pee (12.9.19) 
= - + : S 2 sin(nzt/a). (12.9.20) 


In addition to computing the inverse of Laplace transforms, Bromwich’s integral places 
certain restrictions on F(s) in order that an inverse exists. If a denotes the minimum value 
that c may possess, the restrictions are threefold.?^ First, F(z) must be analytic in the 
half-plane x > o, where z = x + iy. Second, in the same half-plane it must behave as z~*, 


where k > 1. Finally, F(x) must be real when x > a. 
e Example 12.9.3 


Is the function sin(s)/(s? + 4) a proper Laplace transform? Although the function 
satisfies the first and third criteria listed in the previous paragraph on the half-plane x > 2, 
the function becomes unbounded as y — +00 for any fixed x > 2. Thus, sin(s)/(s? + 4) 
cannot be a Laplace transform. 


e Example 12.9.4 


An additional benefit of understanding inversion by the residue method is the ability 
to qualitatively anticipate the inverse by knowing the location of the poles of F(s). This 
intuition is important because many engineering analyses discuss stability and performance 
entirely in terms of the properties of the system's Laplace transform. In Figure 12.9.3 we 
have graphed the location of the poles of F(s) and the corresponding f(t). The student 
should go through the mental exercise of connecting the two pictures. 


Problems 


Use Bromwich's integral to invert the following Laplace transforms: 


s+1 1 


LFG)— Gy oye 3) 


34 For the proof, see Churchill, R. V., 1972: Operational Mathematics. McGraw-Hill, Section 67. 
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s-plane fit) 
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s-plane fit) 


| s-plane Kt) 
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s-plane fit) 


Tk 


x 
t 
x 
s-plane Kt) 
x 
t 
x 


Figure 12.9.3: The correspondence between the location of the simple poles of the Laplace transform F(s) 
and the behavior of f(t). 
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1 1 
3. F(s) = s(s — 2j 4. F(s) = s(s + a)2(s? + b?) 
e? 1 
SC 5 S.P) TE e) 
1 1 
12d ar ren pom eT PME secu 


9. Consider a function f(t) that has the Laplace transform F(z), which is analytic in the 
half-plane Re(z) > so. Can we use this knowledge to find g(t), whose Laplace transform 
G(z) equals F[o(z)], where (z) is also analytic for Re(z) > so? The answer to this question 
leads to the Schouten??-Van der Pol’? theorem. 


Step 1: Show that the following relationships hold true: 


oo ctoot 
G(z) = Fly(z)] = f fre" dr, and = g(t) = / Fly(z)let dz. 


Step 2: Using the results from Step 1, show that 


oo 1 ctoot 
g(t) =|} FCT) EI g verge ie dr. 
0 At Jc —ooi 


This is the Schouten-Van der Pol theorem. 


Step 3: If G(z) = F(4/z) show that 


g(t) — wal Tf(r) ew(- 4) dr. 


Hint: Do not evaluate the contour integral. Instead, ask yourself: What function of time 
has a Laplace transform that equals e^ * 7, where is a parameter? Then use tables. 


12.10 THE SOLUTION OF THE WAVE EQUATION 
The solution of linear partial differential equations by Laplace transforms is the most 
commonly employed analytic technique after separation of variables. Because the transform 


consists solely of an integration with respect to time, the transform U(x, s) of the solution 
of the wave equation u(x, t) is 


U (x, s) =| u(x, te * dt, (12.10.1) 
0 


assuming that the wave equation only varies in a single spatial variable x and time t. 


35 Schouten, J. P., 1935: A new theorem in operational calculus together with an application of it. 
Physica, 2, 75-80. 


36 Van der Pol, B., 1934: A theorem on electrical networks with applications to filters. Physica, 1, 
521-530. 
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Partial derivatives involving time have transforms similar to those that we encountered 
in the case of functions of a single variable. They include 


£ [ui (x, t)] = sU (x, s) — u(z, 0), (12.10.2) 


and 
Llu (c, t)] = s?U (x, s) — su(z, 0) — uz (ax, 0). (12.10.3) 


These transforms introduce the initial conditions via u(z,0) and u;(x,0). On the other 
hand, derivatives involving x become 


£[us (a, t)] = 4 {Liu(2, t)]} = aea, (12.10.4) 
and " P 
Lluss(s,t)] = 4 lule, £)]) = EM (12.10.5) 


Because the transformation eliminates the time variable, only U (x, s) and its derivatives 
remain in the equation. Consequently, we transform the partial differential equation into a 
boundary-value problem involving an ordinary differential equation. Because this equation 
is often easier to solve than a partial differential equation, the use of Laplace transforms 
considerably simplifies the original problem. Of course, the Laplace transforms must exist 
for this technique to work. 

The following schematic summarizes the Laplace transform method: 


partial differential equation 


+ initial conditions solution to original problem 


| 


Laplace transform Inverse transform 


t 


ordinary differential equation solution of boundary-value 
+ boundary conditions problem 


In the following examples, we illustrate transform methods by solving the classic equa- 
tion of telegraphy as it applies to a uniform transmission line. The line has a resistance R, 
an inductance L, a capacitance C, and a leakage conductance G per unit length. We denote 
the current in the direction of positive x by I; V is the voltage drop across the transmission 
line at the point x. The dependent variables J and V are functions of both distance x along 
the line and time t. 

To derive the differential equations that govern the current and voltage in the line, 
consider the points A at x and B at x + Az in Figure 12.10.1. The current and voltage 
at A are I(x,t) and V(z,t); at B, I + Sl Ax and V + WV Aa. Therefore, the voltage drop 
from A to B is -X Ar and the current in the line is J + 2L Ag. Neglecting terms that are 
proportional to (Az)?, 


+ boundary conditions 


OI OV 
(5 + rt) At = — 9, Ae: (12.10.6) 
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A H B 
—- OOO 
I L Ax R Ax r+ dt ax 
òx 


a eo oV 
V CAx____ 2 G Ax VES Ax 


ôl 
| 6x 
e . 


x K x+Ax 


Figure 12.10.1: Schematic of a uniform transmission line. 


The voltage drop over the parallel portion HK of the line is V while the current in this 
portion of the line is -2l Az. 'Thus, 


aV ar 
(65; je cv) Az. Ac. (12.10.7) 


Therefore, the differential equations for J and V are 


al av 

L RI x (12.10.8) 
s av 8l 

CAPS GV =- (12.10.9) 


Turning to the initial conditions, we solve these simultaneous partial differential equa- 
tions with the initial conditions 
I(z,0) = Io(a), (12.10.10) 


and 
V(z,0) = Vo(z) (12.10.11) 


for 0 « t. There are also boundary conditions at the ends of the line; we will introduce 
them for each specific problem. For example, if the line is short-circuited at x = a, V = 0 
at x = a; if there is an open circuit at x = a, I = 0 at x = a. 

To solve Equation 12.10.8 and Equation 12.10.9 by Laplace transforms, we take the 
Laplace transform of both sides of these equations, which yields 


(Ls + R)I(a,s) = up + LIo(a), (12.10.12) 
and Fe 
(Cs + G)V (x, s) = a + CVo(z). (12.10.13) 


Eliminating I gives an ordinary differential equation in V 


E 
depu eom. C(Ls + R)Vo(z), (12.10.14) 
da? dx 
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where q? = (Ls + R)(Cs + G). After finding V, we may compute J from 


1 dV , Lly(z) 


T= . 
Is+Rdx Ls+R 


(12.10.15) 


At this point we treat several classic cases. 
e Example 12.10.1: The semi-infinite transmission line 


We consider the problem of a semi-infinite line 0 < x with no initial current and charge. 
The end x = 0 has a constant voltage E for 0 < t. 
In this case, 
eV a 
qz tV=0, Ozn (12.10.16) 


'The boundary conditions at the ends of the line are 
V(0,t) = E, 0 « t, (12.10.17) 


and V(z,t) is finite as x — oo. The transform of these boundary conditions is 


V(0,s) = E/s, and lim V(a,s) — 0. (12.10.18) 


Z— 00 


The general solution of Equation 12.10.16 is 
V(a,s) = Ae ** + Bet". (12.10.19) 


The requirement that V remains finite as x — oo forces B = 0. The boundary condition at 
x = 0 gives A = E/s. Thus, 


V(z,s) = = exp |- Vs R(Cs +G) (12.10.20) 


We discuss the general case later. However, for the so-called “lossless” line, where R = G = 
0, 


E 
V(z,s) = — exp(—sz/c), (12.10.21) 
E 
where c = 1/V LC. Consequently, 
V(z,t) = EH(t = 3 (12.10.22) 
C 


where H(t) is Heaviside’s step function. The physical interpretation of this solution is as 
follows: V (x,t) is zero up to the time z/c at which time a wave traveling with speed c from 
x = 0 would arrive at the point x. V (x,t) has the constant value E afterwards. 

For the so-called “distortionless” line?" R/L = G/C = p, 


V(z,t) = Ee-**'* p (t = 3 (12.10.23) 
C 


37 Prechtl and Schürhuber (Prechtl, A., and R. Schürhuber, 2000: Nonuniform distortionless transmission 
lines. Electr. Eng. [Berlin], 82, 127—134) generalized this problem to nonuniform transmission lines. 
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In this case, the disturbance not only propagates with velocity c but also attenuates as we 
move along the line. 

Suppose now, that instead of applying a constant voltage E at x = 0, we apply a 
time-dependent voltage, f(t). The only modification is that in place of Equation 12.10.20, 


V(a,s) = F(s)e **. (12.10.24) 
In the case of the distortionless line, q = (s + p)/c, this becomes 


V(v,s) = F(s)e- 6*9/c (12.10.25) 


and venz PN _ =) H(t 2 3 : (12.10.26) 


Thus, our solution shows that the voltage at x is zero up to the time z/c. Afterwards V (x, t) 
follows the voltage at x = 0 with a time lag of a/c and decreases in magnitude by e~??/°. 


e Example 12.10.2: The finite transmission line 


We now discuss the problem of a finite transmission line 0 < x < l with zero initial 
current and charge. We ground the end x = 0 and maintain the end xz = l at constant 
voltage E for 0 « t. 

'The transformed partial differential equation becomes 


eV = 
3a q?V =0, O<a<l. (12.10.27) 
The boundary conditions are 
V(0,t)=0, and V(Lt)- E, O0<t. (12.10.28) 


The Laplace transform of these boundary conditions is 


V(0,s)=0, and V(l,s)- Ejs. (12.10.29) 
The solution of Equation 12.10.27 that satisfies the boundary conditions is 


= E sinh(qz) 
V ——————. 12.10.30 
(2,8) s sinh(ql) ( ) 
Let us rewrite Equation 12.10.30 in a form involving negative exponentials and expand the 
denominator by the binomial theorem, 


PN E 1 — e-?d 
V(x, s) = x xm pem 
E 
= ee a9 = pum) (1 us e 2d + etd Je 3 (12.10.32) 
E 
= E [e «0-2 — e-4(L*x) 4 e-a(3l-2) _ g-a(3Lez) 4, ‘|. (12.10.33) 


S 


In the special case of the lossless line where q = s/c, 


V(x s) - E [o-*-2)« = e s(ltx)/c E e 5Gl-2)/c m e 5QGUEz)/c HERES ]. (12.10.34) 
S 
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direct " twice thrice reflected 
E  ———— | =- — ar 
i ] í Loye 
(L-x)/c (I+x)/c (3l-x)/c (3l+x)/c 


Figure 12.10.2: The voltage within a lossless, finite transmission line of length l as a function of time t. 


Or 


V (z,t) - E|n(: —) a 2) a(t x) a(t E) al 
c c c c 

(12.10.35) 

We illustrate Equation 12.10.35 in Figure 12.10.2. The voltage at x is zero up to the 
time (l — z)/c, at which time a wave traveling directly from the end x = l would reach the 
point zx. The voltage then has the constant value E up to the time (l + z)/c, at which time 
a wave traveling from the end x = l and reflected back from the end x = 0 would arrive. 
From this time up to the time of arrival of a twice-reflected wave, it has the value zero, and 
so on. 


e Example 12.10.3: The semi-infinite transmission line reconsidered 


In the first example, we showed that the transform of the solution for the semi-infinite 
line is 
EN E 4 
V(z,s)- —e **, (12.10.36) 
E 


where q? = (Ls + R)(Cs+G). In the case of a lossless line (R = G = 0), we found traveling 
wave solutions. 

In this example, we shall examine the case of a submarine cable,?? where L = G = 0. 
In this special case, 


E 
V(z,s) = —e-*V s/s, (12.10.37) 


where & = 1/(RC). From a table of Laplace transforms,?? 


Equation 12.10.37 and find that 


we can immediately invert 


V(a,t)=E erfe( (12.10.38) 


g 
2 rt) 
where erfc is the complementary error function. Unlike the traveling wave solution, the 


voltage diffuses into the cable as time increases. We illustrate Equation 12.10.38 in Figure 
12.103. 


38 First solved by Thomson, W., 1855: On the theory of the electric telegraph. Proc. R. Soc. London, 
Ser. A, 7, 382-399. 


39 See Churchill, R. V., 1972: Operational Mathematics. McGraw-Hill Book, Section 27. 
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V(x,tyE 


DE 10 100 1000 
X 


Figure 12.10.3: The voltage within a submarine cable as a function of distance for various values of kt. 
e Example 12.10.4: A short-circuited, finite transmission line 


Let us find the voltage of a lossless transmission line of length l that initially has the 
constant voltage E. At t = 0, we ground the line at x = 0 while we leave the end x = l 
insulated. 

'The transformed partial differential equation now becomes 


dV s sE 


V= 12.10.3 
dr? c ve Cc? ( B) 
where c = 1/V LC. The boundary conditions are 
V(0, s) = 0, (12.10.40) 
and m 
= 1 dV(l,s) 
= = 12.10.41 
T(,s)=-p- 2" =0 ) 
from Equation 12.10.15. 
The solution to this boundary-value problem is 
= his — 
Vus ac c (12.10.42) 


E s cosh(sl/c) 


The first term on the right side of Equation 12.10.42 is easy to invert and the inversion 
equals E. 'The second term is much more difficult to handle. We will use Bromwich's 
integral. 

In Section 12.9 we showed that 


_ı f cosh[s(l — z)/]| | 1  f****eosh[z(l — z)/c]e'* 
E { s cosh(sl/c) \ x z cosh(zl/c) a 


(12.10.43) 


—ooí 


To evaluate this integral we must first locate and then classify the singularities. Using the 
product formula for the hyperbolic cosine, 


c utm a Az? (1-2)? 

242 F 9c2n2 Je 

= — cT " 12.10.44 
z cosh(zl/c) 2[1 + an (2 4 422 i ( ) 


9c? 12 


cosh[z(L— x)/c] | [1+ 
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This shows that we have an infinite number of simple poles located at z = 0, and z, = 
+(2n — 1)zci/ (20), where n = 1,2,3,.... Therefore, Bromwich’s contour can lie along, and 
just to the right of, the imaginary axis. By Jordan's lemma we close the contour with a 
semicircle of infinite radius in the left half of the complex plane. Computing the residues, 


cosh[z(|l— z)/c]e* |)  ,, cosh[z(l— z)/cje^ — 
s { z cosh(zl/c) 0] 20 cosh(zl /c) =1, (12.10.45) 
and 
cosh[z (1 — z)/c]e'^ m (z _ Zn) cosh[z(1 m x) /cjet* 
ER { zZ cosh(zl/c) ; en} - I Zz cosh(zl/c) (12.10.46) 


. cosh[(2n — 1)z(l — x)i/ (21)] exp|-E(2n — 1)mcti/(21)] 
[((2n — 1)ri/2] sinh[(2n — 1)ri/2] 


(12.10.47) 
2(—1)" (2n — 1)n(l — x) (2n — 1)reti 
(2n - Ds zi 2l | iR | 2l | l 
(12.10.48) 


Summing the residues and using the relationship that cos(t) = (e** + e~)/2, 


V.) - E zh : D (-1)*t cos [= 1)r(l — 2 cos [S pen) 


= opc 2l 2l 
(12.10.49) 
4E  (-1)"*1 2n — 1)n(0 — 2n — 1)rct 
-2y CU cos! E Jat 2] cos |£ i j )ne I (12.10.50) 
n=1 
An alternative to contour integration is to rewrite Equation 12.10.42 as 
a E e7 s/c [1 £ pP VA 
V(z,s) = r t i323 (12.10.51) 
E —sc/c —s(21—2)/c —s(2l--x)/c 
=— i-e —e +e +] (12.10.52) 
E 
so that 
2l — 2l 
Vat) = E[ H(t 3 (i z) | a(t =) e (12.10.53) 
c c c 


e Example 12.10.5: The general solution of the equation of telegraphy 


In this example we solve the equation of telegraphy without any restrictions on R, C, 
G, or L. We begin by eliminating the dependent variable I(x, t) from the set of equations, 
Equation 12.10.8 and Equation 12.10.9. This yields 
av OPV 


OV 
CLP +(GL+ RC) >, +RGV = Dat (12.10.54) 
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We next take the Laplace transform of Equation 12.10.54 assuming that V(x,0) = f(x), 
and V;(r,0) = g(a). The transformed version of Equation 12.10.54 is 


d?V = 
e [CLs? + (GL + RC)s + RG|V = —CLg(z) — (CLs + GL + RC) f(x), (12.10.55) 
ia 2V 2 2 
dV (s+p)*-—o*, glx) s | 2p 
da? ucc mE at a) Te), (12.10.56) 


where c? = 1/LC, p= (RC + GL)/2, and o = c?(RC — GL)/2. 

We solve Equation 12.10.56 by Fourier transforms (see Section 11.6) with the require- 
ment that the solution dies away as |x| — oo. The most convenient way of expressing this 
solution is the convolution product (see Section 11.5) 


Vides [zo " (: " z ræ) Ps eel A Claw OS (12.10.57) 


24/ (s + p)? — c? 


From a table of Laplace transforms, 


s2 — a2 


co [s ERI = (av=?) na - 9. (12.10.58) 


where b > 0 and Jo( ) is the zeroth order modified Bessel function of the first kind. Therefore, 
by the first shifting theorem, 


exp [-lal s + 07 = o/c] 
Jeri a 


= e” To [o g- (e/o? a(t = zI) . (12.10.59) 


Using Equation 12.10.59 to invert Equation 12.10.57, we have that 


V(z,t) = de-"'g(z) * Io [ovt — oP] H(t — lal/c) 
de" fi) + È Ule P E} Hl- ijo 
+ £e7? f(x) « lov? = (x/c | H(t — lal/c) 


+ de" f(e et) + f(e- et). (12.10.60) 


The last term in Equation 12.10.60 arises from noting that sF(s) = L[f(t)] + f(0). If we 
explicitly write out the convolution, the final form of the solution is 


V(a,t) = ge" [f(a + ct) + f(x — ct)] 


Hae” f. o) + 2pf (no levem == f« dn 


—ct 


Lege i. DRE leva - e - f 4] dn. (12.10.61) 


x—ct 
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5 


0 ne 


Dis 4910 


Figure 12.10.4: The evolution of the voltage with time given by the general equation of telegraphy for 
initial conditions and parameters stated in the text. 


There is a straightforward physical interpretation of the first line of Equation 12.10.61. 
It represents damped progressive waves; one is propagating to the right and the other to 
the left. In addition to these progressive waves, there is a contribution from the integrals, 
even after the waves pass. These integrals include all of the points where f(x) and g(x) 
are nonzero within a distance ct from the point in question. This effect persists through all 
time, although dying away, and constitutes a residue or tail. Figure 12.10.4 illustrates this 
for p= 0.1, 0 = 0.2, and c= 1. This figure was obtained using the MATLAB script: 


^ initialize parameters in calculation 
clear; dx = 0.1; dt = 0.5; rho over c = 0.1; sigma over c = 0.2; 
X-[-10:dx:10]; T = [0:dt:10]; 4 compute locations of x and t 
for j-1:length(T); t = T(j); 
for i=1:length(X); x = X(i); 
XX(i,j) = x; TT(3,j) = t; detai = 0.05 % set up grid 
^4 compute characteristics x*ct and x-ct 
characteristic.1 = x - t; characteristic 2-2 x + t; 
^ compute first term in Equation 12.10.61 
F = inline(’stepfun(x,-1.0001)-stepfun(x,1.0001)’); 
u(i,j ) = F(characteristic 1) + F(characteristic 2); 
% find the upper and lower limits of the integration 
upper = characteristic_2; lower = characteristic_1; 
if t > 0 & upper > -1 & lower < 1 
if upper > 1 upper = 1; end 
if lower < -1 lower = -1; end 
^ set up parameters needed for integration 
interval = upper-lower; 
NN = interval / deta_i; 
if mod(NN,2) > O NN = NN + 1; end; 
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deta - interval / NN; 
^ compute integrals in Equation 12.10.61 by Simpson's rule 
^ sumi deals with the first integral while sum2 is the second 
sumi = 0; sum2 = 0; eta = lower; 
for k - 0:2:NN-2 
arg = sigma over c * sqrt(t*t-(x-eta)*(x-eta)); 
sumi = sumi + besseli(0,arg) ; 
if (arg == 0) 
sum2 = sum2 + 0.5 * sigma_over_c * t; 
else 
sum2 = sum2 + t * besseli(1,arg) / arg; end 
eta = eta + deta; 
arg = sigma over c * sqrt(t*t-(x-eta)*(x-eta)); 
sumi = sumi + 4*besseli(0,arg) ; 
if (arg == 0) 
sum2 = sum2 + 4 * 0.5 * sigma over.c * t; 
else 
sum2 = sum2 + 4 * t * besseli(1,arg) / arg; end 
eta = eta + deta; 
arg = sigma over c * sqrt(t*t-(x-eta)*(x-eta)); 
sumi = sumi + besseli(O,arg); 
if (arg == 0) 
sum2 = sum2 + 0.5 * sigma over c * t; 
else 
sum2 = sum2 + t * besseli(i1,arg) / arg; end 
end 
u(i,j) = u(i,j) + 2 * rho.over.c * deta * sumi / 3 
+ sigma over. c * deta * sum2 / 3; 
end 
^ multiply final answer by damping coefficient 
u(i,j) = 0.5 * exp(-rho_over_c * t) * u(i,j); 
end;end; 
^4 plot results 
mesh(XX,TT,real(u)); colormap spring; 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(C TIME? ,?Fontsize?,20) 
zlabel(’SOLUTION’ ,?Fontsize',20) 


We evaluated the integrals by Simpson's rule for the initial conditions f(x) = H(x +1) — 
H (x — 1), and g(x) = 0. If there was no loss, then two pulses would propagate to the left 
and right. However, with resistance and leakage the waves leave a residue after their leading 
edge has passed. 


Problems 


1. Use transform methods to solve the wave equation 


u Ou 


OB Og? O0<a<il, 0 « t, 


with the boundary conditions u(0,t) = u(1,t) = 0, 0 < t, and the initial conditions u(x, 0) = 
0, w(0,t) =10<a<1. 
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2. Use transform methods to solve the wave equation 


u u 

= = = 0 1, O<t 

ETE og» «rcli, <t, 
with the boundary conditions u(0,t) = uz(1,t) = 0, 0 < t, and the initial conditions 
u(z,0) = 0, u(0,t) 22,0 «€ x « I. 
3. Use transform methods to solve the wave equation 


u Ou 

—— = a 0 1, O<t 

ETE og «rcli, <t, 
with the boundary conditions u(0, t) = u(1,t) = 0, 0 < t, and the initial conditions u(x, 0) = 
sin(zz), usx, 0) = —sin(nz), 0< z « 1. 
4. Use transform methods to solve the wave equation 


u u 

Sm. E gu 0«rc«a, 0 « t, 
with the boundary conditions u(0, t) = sin(wt), u(a,t) = 0, 0 < t, and the initial conditions 
u(x,0) = u(z,0) 20,0 <x <a. Assume that wa/c is not an integer multiple of 7. Why? 


5. Use transform methods to solve the wave equation 


0? 9? 
am as O<a<L, Oct, 


with the boundary conditions uz(0,t) = — f (t), u;(D,t) = 0, 0 < t, and the initial conditions 
u(r,0) = uw(r,0 = 0, 0 < « « L. Hint: Invert the Laplace transform following the 
procedure used in Example 12.10.2. 


6. Use transform methods to solve the wave equation 
— q'(t), a«z«b, 0<t, 


with the boundary conditions u(a,t) = 0, uz(b,t) = 0, 0 < t, and the initial conditions 
u(r,0) = 0, u(z,0) = —q(0), a < x < b. Hint: To find U(z, s), express both U(z, s) and 
the right side of the ordinary differential equation governing U(x,s) in an eigenfunction 
expansion using sin{ (2n + 1)r(x — a)/[2(b — a)]). These eigenfunctions satisfy the boundary 
conditions. 


7. Use transform methods to solve the wave equation 


a? 9? 
5B ag = te 0<ar<w, 0<t, 


with the boundary conditions 


u(0,t)=1—e*, lim |u(z,t)| ~ x”, n finite, 0<t, 
z—0o0 


The Laplace Transform 


and the initial conditions u(z,0) = 0, u;(z,0) = z, 0 < x < oc. 


8. Use transform methods to solve the wave equation 


Qu Ou 
aB ~ pga = te 0«z «oo, 0 « t, 
T 


with the boundary conditions 
u(0,t) =cos(t), lim |u(z,t)| ~ x”, n finite, 0 « t, 
z—00 
and the initial conditions u(z,0) = 1, u;(z, 0) = 0, 0 < x < oc. 


9. Use transform methods to solve the wave equation 


u Ou 


Be gg 0<a<L, 0 « t, 
T 


with the boundary conditions 


O^u(L,t) k Ou(L,t) 
u(0,t) = 0, og ids ss dh 0 «t, 
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and the initial conditions u(z,0) = u;(z,0) = 0,0 < x < L, where k, m, and g are constants. 


10. Use transform methods^? to solve the wave equation 


u pac 


= 0 «1, 0«t 
88  ° Oz a), TEM ce Su 


with the boundary conditions 


lim lu(z,t) «oo, u(1,t) = Asin(wt), 0 « t, 
cz 


and the initial conditions u(z,0) = u;(z,0) = 0, 0 < x < 1. Assume that 2w Z cn, where 


Jo(Gn) = 0. Hint: The ordinary differential equation 
d ( dU 5 


U(x,s) = al (3v) + c2Ko(=vz) ; 


has the solution 


where Jo(x) and Ko(x) are modified Bessel functions of the first and second kind, respec- 


tively. Note that J,(iz) = i^I,(z) and I,(tz) = i"J,(z) for complex z. 


11. A lossless transmission line of length Z has a constant voltage E applied to the end 
x = 0 while we insulate the other end [V;(£,t) = 0]. Find the voltage at any point on the 


line if the initial current and charge are zero. 


40 Suggested by a problem solved by Brown, J., 1975: Stresses in towed cables during re-entry. J. Spacecr. 


Rockets, 12, 524—527. 
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12. Solve the equation of telegraphy without leakage 


Ou Ou Oru 
subject to the boundary conditions u(0, t) = 0, u(£, t) = E, 0 < t, and the initial conditions 
u(z,0) = u;(z,0) 20, 0 « x< £. Assume that 4r? L/CR?E > 1. Why? 


13. The pressure and velocity oscillations from water hammer in a pipe without friction?! 
are given by the equations 

Op 0u aad ðu 1 Op 

— = -pí >, n — =--—, 

at 8x at pda 
where p(r,t) denotes the pressure perturbation, u(x,t) is the velocity perturbation, c is 
the speed of sound in water, and p is the density of water. These two first-order partial 
differential equations can be combined to yield 

op — 40?p 


08 ^ Aa? 
Find the solution to this partial differential equation if p(0,t) — po, and u(L,t) — 0, and 
the initial conditions are p(z,0) = po, p(x, 0) = 0, and u(a,0) = uo. 


14. Use Laplace transforms to solve the wave equation*? 


u (s ,20u 2u 


aa | ; «r«oo, 0<t, 
Ot? ôr? rr :j MNT a 


subject to the boundary conditions that 


u(a,t) = A (1 — e7*/*) HG), lim u(rif) 30, — 0«t, 


Toco 


and the initial conditions that u(r,0) = uz(r,0) = 0, a < r < oo. Hint: The homogeneous 

solution to the ordinary differential equation 
dy m 2dy 2y 
d? rdr r? 


u cosh(br) ^ sinh(br) 1 1 E" 
SEIT cus po tO W.tgx]t ^ 


b’y =0 


is 


15. Use Laplace transforms to solve the wave equation? 


0? (ru) 2g O? (ru) 
Ot? Or? " 


a«r«oo, Oc«t, 


^! See Rich, G. R., 1945: Water-hammer analysis by the Laplace-Mellin transformation. Trans. ASME, 
67, 361—376. 


42 Wolf, J. P., and G. R. Darbre, 1986: Time-domain boundary element method in visco-elasticity with 
application to a spherical cavity. Soil Dynam. Earthq. Eng., 5, 138-148. 


43 Originally solved using Fourier transforms by Sharpe, J. A., 1942: The production of elastic waves by 
explosion pressures. I. Theory and empirical field observations. Geophysics, 7, 144—154. 
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subject to the boundary conditions that 
,(( O7u " 2 Ou 
PO Aor? ^ 3r Or 


where a > 0, and the initial conditions that u(r,0) = u;(r,0) = 0, a < r < oo. 


= poe “H(t), lim u(r,t) > 0, 0 « t, 


T— 00 


r=a 


16. Consider a vertical rod or column of length L that is supported at both ends. The 
elastic waves that arise when the support at the bottom is suddenly removed are governed 
by the wave equation ^^ 


u u 

B aa O0<a<L, 0<t, 

where g denotes the gravitational acceleration, c? = E/p, E is Young's modulus, and p is the 
mass density. Find the wave solution if the boundary conditions are u,(0,t) = uz(L, t) = 0, 
0 « t, and the initial conditions are 


gx? — Ou(x, 0) 


u(a, 0) = 202 ? Ot 


= 0, 0 «€ x « L. 


17. Use Laplace transforms to solve the hyperbolic equation 


u  O?u 
———-4H1-20 0 « 1, O<t 
OB ap * , qom Jy Sb 


subject to the boundary conditions that u,(0,t) = 0, uz(1,t) = 1, 0 < t, and the initial 
conditions that u(a#,0) = u;(z,0) 2 0, 0 «€ z « 1. 


18. Solve the telegraph-like equation ?? 


LM Ou _ 2 Oru, ðu 
Ot? Ot — Ox? Ox 


subject to the boundary conditions 


Ou(0,t) _ : 
De T —uod(t), Jim u(x,t) > 0, 0<t, 


and the initial conditions u(x,0) = uo, u;(z,0) = 0, 0 < z < oo, with ac > k. 


Step 1: Take the Laplace transform of the partial differential equation and boundary con- 
ditions and show that 


utes), UES) (PH) gia (E) 


dx? dx c? c? 


^4 See Hall, L. H., 1953: Longitudinal vibrations of a vertical column by the method of Laplace transform. 
Am. J. Phys., 21, 287-292. 


45 See Abbott, M. R., 1959: The downstream effect of closing a barrier across an estuary with particular 
reference to the Thames. Proc. R. Soc. London, Ser. A, 251, 426-439. 
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with U'(0, s) = —uo, and lim; ,s5 U(x, s) > 0. 


Step 2: Show that the solution to the previous step is 


exp E (s+ E) 4 eje 


? 


U(z,s)— Ot age 9/2 
E 


2 /(s+%)2+a2/c 
where 4a? = o? c? — k? > 0. 


Step 3: Using the first and second shifting theorems and the property that 


r (vara) - e no [SED] 
0 


t — 72 
show that 
t A(av? — 7? 
u(x,t) = uo + uoce? A(t — z/c) |e- 9*7? — a : wrest ) peer 2p ; 
a/c VP — T2 


19. As an electric locomotive travels down a track at the speed V, the pantograph (the 
metallic framework that connects the overhead power lines to the locomotive) pushes up 
the line with a force P. Let us find the behavior*® of the overhead wire as a pantograph 
passes between two supports of the electrical cable that are located a distance L apart. We 
model this system as a vibrating string with a point load: 


u u P s(t = 


= L t. 
ETE mp up 0crz«L, 0< 


Let us assume that the wire is initially at rest [u(z, 0) = u;(z,0) = 0 for 0 < x < L] and 
fixed at both ends [u(0, t) = u(L,t) = 0 for 0 < t]. 


Step 1: 'lake the Laplace transform of the partial differential equation and show that 


dU P 
ce (x, 8) $ 207900 


2 = 
s^U(z,s)— d 


Step 2: Solve the ordinary differential equation in Step 1 as a Fourier half-range sine series 


NTE 


U(a,s) = 2 B,(s) sin(277) ; 


where 


= 2P6,, 1 1 n,—Ls/V 
Bao zy ap perra ee] L C16. 


46 See Oda, O., and Y. Ooura, 1976: Vibrations of catenary overhead wire. Q. Rep., (Tokyo) Railway 
Tech. Res. Inst., 17, 134—135. 
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Qan = nzc/ L and B, = nzV/L. This solution satisfies the boundary conditions. 


Step 3: By inverting the solution in Step 2, show that 


u(z, t) 


_ 2P ` e V ee as (=) 


an, — Bn C an 5 L 


2 a(t 2) Dr sia (25) S e V inot AM) 


pL L an — Ba c an — Ba 


n=1 


or 


2P 3 m V c = (je) 


a Lon BR c o$—PBi 


L 
Heal DESC (50 toram 


m=i 


The first term in both summations represents the static uplift on the line; this term dis- 
appears after the pantograph passes. The second term in both summations represents the 
vibrations excited by the traveling force. Even after the pantograph passes, they continue 
to exist. 


20. Solve the wave equation 


18u u Louz i ó(r — a) TR obs 
— T a 
c0t? Or? rOr r? o2 ^C x , ! 


where 0 « a « a, subject to the boundary conditions 


Ou(a, t) 


h 
ar + gelat) = 0, 0 « t, 


lim |u(r, t)| < oo, 
r—0 
and the initial conditions u(r,0) = uj(r,0) 20,0 € r « a. 


Step 1: Take the Laplace transform of the partial differential equation and show that 


@U(r,s) | 1dU(r, s) Brod ó(r — a) 
} = ao Pd 
dr? r dr (5 n =) UNSS) saz’ a 
with di " 
ftd bas OO a ease. 
r—0 T a 


Step 2: Show that the Dirac delta function can be reexpressed as the Fourier-Bessel series 


= 2a > B2 Ji (Bna/a) 
a? L (BR +h? — 1)? (Bn) 


(r — a) Ji(Bnr/a), OSr<a, 


where 8, is the nth root of 8J1(8) +h J,(8) = BJo(B) + (h — 1)J1(8) = 0 and Jo( ), Ji() 
are the zeroth and first-order Bessel functions of the first kind, respectively. 


636 Advanced Engineering Mathematics with MATLAB 


Step 3: Show that the solution to the ordinary differential equation in Step 1 is 


22 A( (Bna/a) J 1(Bnr/a) $ 
& (B2 +h? — 1) JE (Ba) 5? + c? 5; [a? 


Note that this solution satisfies the boundary conditions. 


Step 4: Taking the inverse of the Laplace transform in Step 3, show that the solution to 
the partial differential equation is 


wot = ZS aes erra [s (8). 


21. Solve the hyperbolic equation 


+u=0, 0 « z,t, 


subject to the boundary conditions u(0, t) = e~*, lim, 400 u(x,t) — 0,0 < t, and u(z, 0) = 1, 
lim; 55 |u(a, t)| < Me*5, 0 < k, M,2,t. 


Step 1: 'lake the Laplace transform of the partial differential equation and show that 


dU 
QUE) +U =0, U(0,s) = lim U(z,s) > 0. 


dx S 1 3 2—00 


Step 2: Show that 


Step 3: Using tables and the convolution theorem, show that the solution is 


t 
u(x,t) = Jo(2Vzt) — e= f e Jo(2y aT ) dr 
0 
where Jo(-) is the Bessel function of the first kind and order zero. 


22. Solve the hyperbolic equation 


0?u Ou Ou 
b— = b, x,t 
Anat a re 0, 0<a,b,2,t, 


subject to the boundary conditions u(0, t) = e**, lim, +. u(x,t) — 0, 0 < t, and the initial 
conditions u(z, 0) = 1, limp+oo |u(z, t)| < Me™, 0 < k, M,t, a. 


Step 1: 'Take the Laplace transform of the partial differential equation and show that 


1 
+asU =a, U(0,s) = , lim U(z,s) > 0. 


Ss—c @—00 


(s 4- b) 


dU (x, s) 
dx 
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Step 2: Show that 


1 ni b 
U(r,s)—---4 = oo (2). 


Step 3: Using tables, the first shifting theorem, and the convolution theorem, show that 
the solution is 


t 
u(z,t)—1- aim] asf (2v ber ) dr, 


0 
where Io(-) is the modified Bessel function of the first kind and order zero. 


12.11 THE SOLUTION OF THE HEAT EQUATION 


In the previous section we showed that we can solve the wave equation by the method 
of Laplace transforms. This is also true for the heat equation. Once again, we take the 
Laplace transform with respect to time. From the definition of Laplace transforms, 


Llu(a,t)] = U (x, s), (12.11.1) 
E [us(z, t)] = sU (x, s) — u(x,0), (12.11.2) 

and à 
£[uss (2,t)] = COD (12.11.3) 


We next solve the resulting ordinary differential equation, known as the auxiliary equation, 
along with the corresponding Laplace transformed boundary conditions. The initial condi- 
tion gives us the value of u(x,0). The final step is the inversion of the Laplace transform 
U(a,s). We typically use the inversion integral. 


e Example 12.11.1 


To illustrate these concepts, we solve a heat conduction problem?" in a plane slab of 
thickness 2L. Initially the slab has a constant temperature of unity. For 0 < t, we allow 
both faces of the slab to radiatively cool in a medium that has a temperature of zero. 

If u(x,t) denotes the temperature, a? is the thermal diffusivity, h is the relative emis- 
sivity, t is the time, and z is the distance perpendicular to the face of the slab and measured 
from the middle of the slab, then the governing equation is 


Ou ,9^u 

a L L, O<t 12.11.4 
or ^ gg <r<L, 0<t, ( ) 

with the initial condition 
u(x,0)=1, -L«zc«L, (12.11.5) 

and boundary conditions 

L,t —L,t 

DuC) + hu(L,t) — 0, a d Fhu(—-L,t) 20, Oct. (12.11.6) 


47 Goldstein, S., 1932: The application of Heaviside's operational method to the solution of a problem 
in heat conduction. Z. Angew. Math. Mech., 12, 234-243. 
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Taking the Laplace transform of Equation 12.11.4 and substituting the initial condition, 


d? 
a? U (2, s) 


de sU (x, s) = —1. (12.11.7) 
x£ 


If we write s = a?g?, Equation 12.11.7 becomes 


d?U (x, s) 


1 
2 
qd U(x,s)— Saa (12.11.8) 


From the boundary conditions, U (x, s) is an even function in x and we may conveniently 
write the solution as 


1 
U(a,s) = — + Acosh(qz). (12.11.9) 
E 
From Equation 12.11.6, 
r h 
qAsinh(qL) + — + hAcosh(qL) = 0, (12.11.10) 
E 
and 
1 h cosh 
U(x, s) = CORB UN) (12.11.11) 


s  s[gsinh(qL) + hcosh(qL)] 


The inverse of U(z,s) consists of two terms. The first term is simply unity. We will 
invert the second term by contour integration. 

We begin by examining the nature and location of the singularities in the second term. 
Using the product formulas for the hyperbolic cosine and sine functions, the second term 
equals 


n(i+ Se) (1 ' gx) 


(12.11.12) 
sjer (1+ oe) (1 + aki) eh (1 E) (1 + spa) «| 


Because q? = s/a?, Equation 12.11.12 shows that we do not have any ys in the transform 
and we need not concern ourselves with branch points and cuts. Furthermore, we have only 
simple poles: one located at s — 0 and the others where 

q sinh(qL) + h cosh(qL) = 0. (12.11.13) 


If we set q = ià, Equation 12.11.13 becomes 


h cos(AL) — Asin(AL) = 0, or AL tan(AL) = hL. (12.11.14) 


From Bromwich’s integral, 


X h cosh(qx) 1 f h cosh(qx)e* 
i = d 12.11.15 
{ s[qsinh(qL) + hcosh(qL)| j 21i Jo z|g sinh(qL) + h cosh(qL)] a ) 
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0.6 
gaans? 


2 1 


Figure 12.11.1: The temperature within the portion of a slab 0 < z/L < 1 at various times a?t/ L? if the 
faces of the slab radiate to free space at temperature zero and the slab initially has the temperature 1. The 
parameter hL — 1. 


where q = z!/?/a and the closed contour C consists of Bromwich's contour plus a semicircle 


of infinite radius in the left half of the z-plane. The residue at z = 0 is 1 while at zn = —a?A2, 
h cosh(qa:)e'* .. h(z - a232) cosh(qx)e* 
R EN l : a 12.11.16 
sl z[qsinh(qL) + h cosh(qL)] e" Pers z[qsinh(qL) + hcosh(qL)| ( ) 
S Tin h cosh(qx)et? 
© z>zn z|(1 + hL) sinh(qL) + qL cosh(qL)]/(2a2q) 


(12.11.17) 
B 2ha? Ani cosh(iA,z) exp(—A2a?t) 
7 Eg) AL isin) + tA, Deos D] 
(12.11.18) 


2h cos(A,2) exp(—a?A2t) 
An[(1 + AL) sin(AnL) + An L cos(A4L)]. 


(12.11.19) 

Therefore, the inversion of U (x, s) is 

2 cos(Àn z) exp(—a? A2 t) 
t)=1—41-2h 12.11.20 
unt) { > Xn[( + AL) sin(A4L) + An L cos(AnL)] J’ ( ) 
or 
= cos(Anz) exp(—a? M t) 

=2 ; 12.11.21 
uen ean 2. XI + AE) siu(As D) + MbeosDu D] ( ) 


We can further simplify Equation 12.11.21 by using h/A, = tan(A, L). This yields hL = 
A, Ltan(A4 L). Substituting these relationships into Equation 12.11.21 and simplifying, 


9^. sin(A4 D) cos(A42) exp(—a? Mt) 
t)=2 n 12.11.22 
Jt > An L + sin(An L) cos(AnL) ( ) 


Figure 12.11.1 illustrates Equation 12.11.23. It was created using the MATLAB script 
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clear 
hL = 1; m = 0; M = 100; dx = 0.05; dt = 0.05; 
^ create initial guess at zeron 
zero = zeros(length(M)); 
for n = 1:10000 
ki = O.1*n; k2 = O.1*(n*1); 
prod = ki * tan(k1); y1 = hL - prod; y2 = hL - k2 * tan(k2); 
if (yl*y2 <= O & prod < 2 & m < M) m = m*1; zero(m) = ki; end; 
end; 
^4 use Newton-Raphson method to improve values of zero_n 
for n = 1:M; for k = 1:10 
f = hL - zero(n) * tan(zero(n)); 
fp = - tan(zero(n)) - zero(n) * sec(zero(n))^2; 
zero(n) = zero(n) - f / fp; 
end; end; 
^4 compute Fourier coefficients 
for m - 1:M 
a(m) = 2 * sin(zero(m)) / (zero(m) + sin(zero(m))*cos(zero(m))); 
end 
^ compute grid and initialize solution 
X = [0:dx:1]; T = [0:dt:2]; 
u 7 zeros(length(T),length(X)); 
XX = repmat(X,[length(T) 1]); TT = repmat(T’,[1 length(X)]); 
% compute solution from Equation 12.11.22 
for m = 1:M 
u = u + a(m) * cos(zero(m)*XX) .* exp(-zero(m)*zero(m)*TT) ; 
end 
surf(XX,TT,u) 
xlabel(’distance’,’Fontsize’,20); ylabel(’time’, ’Fontsize’ ,20) 
zlabel(’U(X,T)’,’Fontsize’ ,20) 


e Example 12.11.2: Heat dissipation in disc brakes 


Disc brakes consist of two blocks of frictional material known as pads that press against 
each side of a rotating annulus, usually made of a ferrous material. In this problem we deter- 
mine the transient temperatures reached in a disc brake during a single brake application.*® 
If we ignore the errors introduced by replacing the cylindrical portion of the drum by a 
rectangular plate, we can model our disc brakes as a one-dimensional solid, which friction 
heats at both ends. Assuming symmetry about x — 0, the boundary condition there is 
u4(0,t) = 0. To model the heat flux from the pads, we assume a uniform disc deceleration 
that generates heat from the frictional surfaces at the rate N(1— Mt), where M and N are 
experimentally determined constants. 

If u(x,t), k, and a? denote the temperature, thermal conductivity, and diffusivity of 
the rotating annulus, respectively, then the heat equation is 

Ou 8u 


l—-—a-É- L t 12.11.2 
or^ 5 Bp O0<a<L, 0<t, ( 3) 


48 From Newcomb, T. P., 1958: The flow of heat in a parallel-faced infinite solid. Brit. J. Appl. Phys., 
9, 370-372. See also Newcomb, T. P., 1958/59: Transient temperatures in brake drums and linings. Proc. 
Inst. Mech. Eng., Auto. Div., 227-237; Newcomb, T. P., 1959: Transient temperatures attained in disk 
brakes. Brit. J. Appl. Phys., 10, 339-340. 
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with the boundary conditions 


Qu(L,t) . 
On ~ 


=0, N(l1-Mt), Oct. (12.11.24) 


The boundary condition at x = L gives the frictional heating of the disc pads. 
Introducing the Laplace transform of u(x,t), defined as 


U (z,, 8) = u(a,t)e** dt, (12.11.25) 
0 


the equation to be solved becomes 


aU S 


vr E aU =0, (12.11.26) 


subject to the boundary conditions that 


wes) =0, and ate) - zG =): (12.11.27) 
The solution of Equation 12.11.26 is 
U(a,s) = Acosh(qx) + Bsinh(qz), (12.11.28) 
where q = s!/?/a. Using the boundary conditions, the solution becomes 
U(z,s) = ME =) ME (12.11.29) 


It now remains to invert the transform, Equation 12.11.29. We will invert cosh(qz)/ 
[sqsinh(qL)]; the inversion of the second term follows by analog. 

Our first concern is the presence of s!/? because this is a multivalued function. However, 
when we replace the hyperbolic cosine and sine functions with their Taylor expansions, 
cosh(qx)/[sq sinh(qL)] contains only powers of s and is, in fact, a single-valued function. 

From Bromwich’s integral, 


c4-ooí tz 
ral cosh(qx) |- 1 i cosh(qz)e dz, (12.11.30) 


sqsinh(qL)|  2ri J. 4; zqsinh(qL) 


where q = z1/? /a. Just as in the previous example, we replace the hyperbolic cosine and 
sine with their product expansion to determine the nature of the singularities. The point 
z = 0 is a second-order pole. The remaining poles are located where zl Lja = nri, or 
Zn = —n?^m?a?/L?, where n = 1,2,3,.... We have chosen the positive sign because z!/? 
must be single-valued; if we had chosen the negative sign the answer would have been the 
same. Our expansion also shows that the poles are simple. 

Having classified the poles, we now close Bromwich’s contour, which lies slightly to the 
right of the imaginary axis, with an infinite semicircle in the left half-plane, and use the 
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residue theorem. The values of the residues are 


h tz 1 d _ 2 h tz 
gq | CER l = | ten icu oe (12.11.31) 
zqsinh(qL) 1! 230 dz zqsinh(qL) 
h tz 
Sac J zeka e (12.11.32) 
z20 dz qsinh(qL) 
zz? Per 
a? d zji [ete 2! sts | 
E li 12.11.33 
= tim, Ef zt LE. j ( ) 
œ, d zz? zp 
-a g ltte zat ] (12.11.34) 
a? g* p 
= t 12.11. 
L { m 2a? a}. ( 3B) 
and 
cosh(qx)e!* .  cosh(gr) , Z— Zn 
Sy | S| ——e"* |] 1 — 12.11. 
es E sinh(qL)’ i oon. zq M S sinh(qL) ( 36) 
h tz 
= i << Sone (12.11.37) 
z>zn zq cosh(qL)L/(2a?q) 
hi JE cada ure 
ME (nzzi/ L) exp(—n Ta t/ L*) (12.11.38) 
(—n?22?a? / L?) cosh(nzi)L/(2a?) 
2L(—1)" —n3g2a2t/ L2 
= 7m cos(na/L)e oe (12.11.39) 
When we sum all of the residues from both inversions, the solution is 
aN x? D? 2LN = (—1)” —n?r?a?t/L? 
u(x,t) = " te 2»? az] rur. cos(nma/L)e / 
a^;NM(t? tx? tL? " z* r? L? i TL* 
KL |2 2a? 6a?  24a*  12a^ — 360a^ 
2L*N M «5 (-1)" 
= cos(nzz/ L)e "7 «UL. (12.11.40) 


a? k^ nt 
n=1 


Figure 12.11.2 shows the temperature in the brake lining at various places within the 
lining [r' = z/L] if a? = 3.3 x 10? cm?/sec, & = 1.8 x 10? cal/(cm sec?C), L = 0.48 cm, 
and N = 1.96 cal/(cm? sec). Initially the frictional heating results in an increase in the disc 
brake’s temperature. As time increases, the heating rate decreases and radiative cooling 
becomes sufficiently large that the temperature begins to fall. 


e Example 12.11.3 


In the previous example we showed that Laplace transforms are particularly useful 
when the boundary conditions are time dependent. Consider now the case when one of the 
boundaries is moving. 
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TEMPERATURE (°C) 


TIME (SECONDS) 


Figure 12.11.2: Typical curves of transient temperature at different locations in a brake lining. Circles 
denote computed values while squares are experimental measurements. (From Newcomb, T. P., 1958: The 
flow of heat in a parallel-faced infinite solid. Brit. J. Appl. Phys., 9, 372 with permission.) 


We wish to solve the heat equation 


ðu u 
subject to the boundary conditions 
u(z,t)| p- = f(t), and lim u(z,t) 290, 0<t, (12.11.42) 
xd z—00 
and the initial condition 
u(r,0)—0,  0«mcr«oo. (12.11.43) 


This type of problem arises in combustion problems where the boundary moves due to the 
burning of the fuel. 

We begin by introducing the coordinate 7 = x — fit. Then the problem can be refor- 
mulated as 


Ou Ou 20u 
= , 0<n<w, 0<t, 12.11.44 
Ot 555 7 On? Hohe ( ) 
subject to the boundary conditions 
u(0,t)=f(), lim wt) +0,  0«t, (12.11.45) 
7]—0o 


and the initial condition 
u(n,0)=0,  0«m«oo. (12.11.46) 
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Taking the Laplace transform of Equation 12.11.44, we have that 


@U(n,s) , B dU(ms) s 
dn? ^a? dm a? 


U (n, s) — 0, (12.11.47) 


with 
U(0,s) = F(s), and lim U (n, s) 0. (12.11.48) 
n— o0 


The solution to Equation 12.11.47 and Equation 12.11.48 is 


2 
U(n, 8) EC on ys | B) (12.11.49) 


Because 


A E E TE (12.11.50) 
j a 4a? J)’ 
where 
= t 2 n , Bvt 
$(n,t) 2 ile enfer 4 - 3") + e8"/2a ene «E. 12.11.51 
(n) = 5 | 2av/t 2a 2avyt 2a ( ) 
and à P 
erfe(x) = 1 — = | e^" dn, (12.11.52) 
VT Jo 
we have by the convolution theorem that 
MET 
u(n, t) = e-?n/2e ji f(t — 7)9(n, 7) dr, (12.11.53) 
0 
Or ] 
u(r,t)— e Be Bt) /2a* | f(t — 7) ®(a — Br,7) dr. (12.11.54) 
0 
Problems 
1. Solve a ga 
u u , 
BE Hu a^(u — To), 0c«rc«l, 0<t, 


subject to the boundary conditions u,(0,t) = uz(1,t) = 0, 0 < t, and the initial condition 
u(z,0) =0,0<a<1. 


2. Solve ^ E 
u u 
a Jx?’ 0<a<il, 0<t, 


subject to the boundary conditions u,(0,t) = 0, u(1,t) = t, 0 < t, and the initial condition 
u(z,0)—0,0 «x « I. 


3. Solve 5 E 
u U 
pope pears 1 t 
2r O0<a<l, 0<t, 
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subject to the boundary conditions u(0,t) = 0, u(1,t) = 1, 0 < t, and the initial condition 
u(z,0)—0,0 «€ x « I. 


4. Solve 
Ou 0?u j 
Ot Ax?’ 


subject to the boundary conditions us (—4,t) = 0, us (4,t) = ô(t), 0 € t, and the initial 
condition u(x, 0) = 0, -3 « x « 1. 


5. Solve 
Ou ou, 
ðt ðr? ^" 
subject to the boundary conditions u(0,t) = u(1,t) = 0, 0 < t, and the initial condition 
u(z,0)—0,0 «x « I. 


6. Solve? 


0«c«rx«oo, 0<t, 


subject to the boundary conditions 


u(0,7) 21, lim u(z,t) > 0, 0 « t, 


LoCo 


and the initial condition u(x,0) = 0, 0 < z < oo. Hint: Use tables to invert the Laplace 
transform. 


7. Solve 
Pu = ou 0<a< O0<t 
8t Oa?’ VS 


subject to the boundary conditions 


du(0, t 
Cut i, iim u(x,t) +0, O<t, 
Ox z— 00 


and the initial condition u(x,0) = 0, 0 < z < oo. Hint: Use tables to invert the Laplace 
transform. 


8. Solve 
Ou Pu 
Ot Ox?’ 


subject to the boundary conditions 


0«c«r«oo, Oct, 


u(0,7) 21, lim u(z,t) > 0, 0 « t, 


Tz—00 


and the initial condition u(z,0) = e77, 0 < x < oo. Hint: Use tables to invert the Laplace 
transform. 


49 Tf u(x,t) denotes the Eulerian velocity of a viscous fluid in the half space x > 0 and parallel to the wall 
located at z — 0, then this problem was first solved by Stokes, G. G., 1850: On the effect of the internal 
friction of fluids on the motions of pendulums. Proc. Cambridge Philos. Soc., 9, Part II, [8]-[106]. 
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9. Solve " E j 
ENDE d iilis on 
a 7l ag ^ u F + dul, 0<zr<œ, 0<t, 


where ó is a constant, subject to the boundary conditions 
u(0,t) = uo, lim u(x,t) > 0, 0 « t, 
z—:00 
and the initial condition u(z,0) = 0, 0 < z < oo. Note that 
1 E 
E È exp (-2aV/s 482 ) = jean ( 5. + svt) + je erte( 5 — evt) ; 
S vt vt 
where erfc(-) is the complementary error function. 


10. During their modeling of a chemical reaction with a back reaction, Agmon et al.°° 


solved 
Ou u 


—-a47—-, 
Ot Ox? 
subject to the boundary conditions 


0«zx«oo, 0<t, 


t 
Kg + au, (0, t) + ana | us (0, T) dr = &,u(0, t), 
0 


lim u(x,t) — 0, 0 « t, 


z— Co 


and the initial condition u(z,0) = 0, 0 < x < oo, where kq and «, denote the intrinsic 
dissociation and recombination rate coefficients, respectively. What should they have found? 


11. Solve?! P » 
A: 7 au 7 Bt 0«cz«oo, 0<t, 


subject to the boundary conditions 


pu(0,t) — u,(0,t) = e(07-85.— lim u(a,t) > 0, 0<t, 


wL— Co 


and the initial condition u(z,0) = 0, 0 < x < oo, where 8, p, and c are constants and 


c £p. 


12. Solve B E 
u —42 u —ka 
ar "aa^ ] 0«z«oo, Oct, 
subject to the boundary conditions 
Ou(0, t 
EUM —0, lim u(x,t) = uo, 0 « t, 
HH 1—00 


50 Agmon, N., E. Pines, and D. Huppert, 1988: Geminate recombination in proton-transfer reactions. 
II. Comparison of diffusional and kinetic schemes. J. Chem. Phys., 88, 5631—5638. 


51 Saidel, G. M., E. D. Morris, and G. M. Chisolm, 1987: Transport of macromolecules in arterial wall 
in vivo: A mathematical model and analytic solutions. Bull. Math. Biol., A9, 153-169. 
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and the initial condition u(z,0) = uo, 0 < x < oo. 


13. Solve 
Ou u 


Ot ^ Ax 
subject to the boundary conditions u(0,t) = t, u(L,t) = 0, 0 < t, and the initial condition 
u(x,0)=0,0< x< L. 


— P, O0<a<L, 0<t, 


14. Solve T ga 
u 2 u 
— —a^—.ck L k 
DE ir t u, O0<a<L, 0<k,t, 
subject to the boundary conditions u(0,t) = u(L,t) = To, 0 < t, and the initial condition 
u(r,0) = To, 0 € £ < L. 


15. An electric fuse protects electrical devices by using resistance heating to melt an en- 
closed wire when excessive current passes through it. A knowledge of the distribution of 
temperature along the wire is important in the design of the fuse. If the temperature rises 
to the melting point only over a small interval of the element, the melt will produce a 
small gap, resulting in an unnecessary prolongation of the fault and a considerable release 
of energy. Therefore, the desirable temperature distribution should melt most of the wire. 
For this reason, Guile and Carne?? solved the heat conduction equation 
2 

ot ca! a(l tau), -L<a<lL, 0<t, 
to understand the temperature structure within the fuse just before meltdown. The second 
term on the right side of the heat conduction equation gives the resistance heating, which is 
assumed to vary linearly with temperature. If the terminals at x = +L remain at a constant 
temperature, which we can take to be zero, the boundary conditions are u(— L, t) = u(L, t) = 
0, 0 < t. The initial condition is u(z, 0) = 0, —L < x < L. Find the temperature field as a 
function of the parameters a, q, and a. 


16. Solve?? 
ðu u 20u 


ðt ðr? T r Or’ 
subject to the boundary conditions 


O<r<l, 0<t, 


Ou(1, t) 
Ór 
and the initial condition u(r,0) = 0, 0 € r « 1. Hint: Use the new dependent variable 

v(r,t) = ru(r, t). 


lim |u(r, t)| < oo, =1, 0 « t, 
r—0 


17. Solve? 
ðu (u  20uY | a? O? (ru) 
ae. (Sat 2a) q(t) = — 72 + q(t), b<r<w, Oc«t, 


52 Guile, A. E., and E. B. Carne, 1954: An analysis of an analogue solution applied to the heat conduction 
problem in a cartridge fuse. AIEE Trans., Part 1, 72, 861-868. 


53 See Reismann, H., 1962: Temperature distribution in a spinning sphere during atmospheric entry. J. 
Aerosp. Sci., 29, 151-159. 


54 See Frisch, H. L, and F. C. Collins, 1952: Diffusional processes in the growth of aerosol particles. J. 
Chem. Phys., 20, 1797—1803. 
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subject to the boundary conditions 


Ór r—0o 


Ou(b, t : 
u( , ) = u(b, t lim u(r, t) = up «f q(T) dr, 0 « t, 
0 


and the initial condition u(r,0) = uo, b < r < oc. 


18. Consider?? a viscous fluid located between two fixed walls x = +L. At x = 0 we 
introduce a thin, infinitely long rigid barrier of mass m per unit area and let it fall under 
the force of gravity, which points in the direction of positive x. We wish to find the velocity 
of the fluid u(x,t). The fluid is governed by the partial differential equation 


Ou 3u 


zl ey L t 
En Vau O0<a<L, 0<t, 


subject to the boundary conditions 


Ou(0, t 2u Ou(0, t 
upeo, PO PHO ocr 


and the initial condition u(z, 0) — 0, 0 < x < L. 


19. Consider a viscous fluid located between two fixed walls x = +L. At x = 0 we 
introduce a thin, infinitely long rigid barrier of mass m per unit area. The barrier is acted 
upon by an elastic force in such a manner that it would vibrate with a frequency w if the 
liquid were absent. We wish to find the barrier's deviation from equilibrium, y(t). The fluid 
is governed by the partial differential equation 


Ou 3u 


ae L t. 
ET E O0<a<L, 0< 


'The boundary conditions are 


dy 


dy 
dt? H Ox ~ 


=0, —=u/(0,t t 
? dt u(0, t), 0 « t, 


u(L,t) =m 

and the initial conditions are u(x,0) = 0, 0 < x < L, and y(0) = A, y'(0) = 0. 
20. Solve?" 

du _ pu 

Ot Oa?” 


subject to the boundary conditions u,(0,t) = 0, a?u, (L,t) + au(L,t) = F, 0 < t, and the 
initial condition u(z,0) =0,0<a< L. 


O0<a<L, 0<t, 


55 See Havelock, T. H., 1921: The solution of an integral equation occurring in certain problems of viscous 
fluid motion. Philos. Mag., Ser. 6, 42, 620—628. 


56 See Havelock, T. H., 1921: On the decay of oscillation of a solid body in a viscous fluid. Philos. Mag., 
Ser. 6, 42, 628-634. 


97 See McCarthy, T. A., and H. J. Goldsmid, 1970: Electro-deposited copper in bismuth telluride. J. 
Phys. D, 3, 697—706. 
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21. Solve 5 o 
u u 
—— < 1 <t 
or^ Oa?’ O<a<l, 0<t, 
subject to the boundary conditions 
Ou(1, t) Ou(1, t) 
t) — — u(1,t —ó(t «t 
u(0) 20, 3a) -u(0 | e RE = s(t), 0st, 


and the initial condition u(z, 0) 2 0, 0 € x « 1. 
22. Solve?? the partial differential equation 


Qu ðu  O?u 
fr ~ Ox?’ 


O0<a<l, 0<t, 


where V is a constant, subject to the boundary conditions 
u(0,t)=1, w(Lt)-0, 0<t, 
and the initial condition u(z, 0) 2 0, 0 «€ x « 1. 


23. Solve?? the partial differential equation 


o?u ðu Qu 
aot ^ oe Pas = 0O<a<w, 0<a,b,t, 


subject to the boundary conditions 


u(0,t)=1, lim u(z,t) > 0, 0<t, 


LoCo 


and the initial condition w,(x,0) + au(z,0) = 0, 0 < x < oo. To invert the Laplace 
transform, you may want to review Problem 52 at the end of Section 12.3. 


24. Solve 


r Or "Or Ot 


subject to the boundary conditions 


210-3 PW dt. dusem ex 


lim |u(r,t)) « oo, u(a,t) = 0, 0 x t, 
r—0 


and the initial condition u(r, 0) = 0, 0 € r < a, where ó(t) is the Dirac delta function. Note 
that J4,(iz) =i"I,(z) and L,(iz) = 1*J4(z) for all complex z. 


25. Solve " ied 5 
u u 
Ai c SU caig < 
T -x(r ) + #0), O<r<a, 0<t, 


58 See Yoo, H., and E.-T. Pak, 1996: Analytical solutions to a one-dimensional finite-domain model for 
stratified thermal storage tanks. Sol. Energy, 56, 315-322. 


59 See Liaw, C. H., J. S. P. Wang, R. A. Greenhorn, and K. C. Chao, 1979: Kinetics of fixed-bed 
absorption: A new solution. AICHE J., 25, 376-381. 
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subject to the DO undar y conditions 
lim ult t CO U\a t = 0 0 t 
0 | ( E )| < ? ( d ) ? < ? 


and the initial condition u(r,0) = 0, 0 <r « a. Note that J,(iz) = i"In(z) and In(iz) = 
i” Jn(z) for all complex z. 


26. Solve 
Ou 24 Ou 


= Pr 


Ot ror 


subject to the boundary conditions 


Ji O0O<r<a, Oct, 


lim |u(r,t)| « oo, u(a,t) = e-*/7e, 0 « t, 
r—0 


and the initial condition u(r,0) = 1, 0 € r < a. Note that J,(iz) = i^I,(z) and I,(iz) = 
i” Jn(z) for all complex z. 


27. Solve 
ðu v (55 1 ðu 


at NOr? rr 


subject to the boundary conditions 


J 0<r<b, 0<t, 


lim ju(r,t)| «oo, u(b,t)= kt, 0<t, 
and the initial condition u(r,0) = 0, 0 <r « b. Note that J4(iz) = i” In(z) and L,(iz) = 
i? Jn(z) for all complex z. 


28. Solve the nonhomogeneous heat equation for the spherical shell9? 


2 
aus 20u :J acudir 


E Or? rör rt 
subject to the boundary conditions 


Ou(a, t) 
Or 


= u(B,t) — 0, 0 « t, 
and the initial condition u(r,0) 20, a <r < B. 


Step 1: By introducing v(r,t) = r u(r,t), show that the problem simplifies to 


Ov o (0v A 
a | (+5) a «r « f, 0 « t, 


subject to the boundary conditions 


Ov(a,t)  v(o,t) 
Ór 


= v(B,t) — 0, 0 « t, 


60 See Malkovich, R. Sh., 1977: Heating of a spherical shell by a radial current. Sov. Phys. Tech. Phys., 
22, 636. 
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and the initial condition 
v(r,0) — 0, a «r «f. 


Step 2: Using Laplace transforms and variation of parameters, show that the Laplace 
transform of u(r, t) is 


A { sinh[g(8 — r)] T aq cosh(q7) + sinh (qn) d "a sinh(qn) d \ 
URE 
0 0 


US srq | aq cosh(q£) + sinh(q£) (a +n) (r 4-9)? 


where q = /s/a, and £ = 8 — a. 


Step 3: 'Take the inverse of U(r, s) and show that 


«ost = AG 3) k iG | J] 


20? SS sin, (8 — r)] exp(—a? nt) i: sin(y£n) g \ 
r£? sin? (y,£)(8 + a2(42) ds (e=) 


n=0 


where Yn is the nth root of ay + tan(¢y) = 0, and ô = 1 + a/£. 
12.12 THE SUPERPOSITION INTEGRAL AND THE HEAT EQUATION 


In our study of Laplace transforms, we showed that we can construct solutions to 
ordinary differential equations with a general forcing f(t) by first finding the solution to 
a similar problem where the forcing equals Heaviside’s step function. Then we can write 
the general solution in terms of a superposition integral according to Duhamel’s theorem. 
In this section we show that similar considerations hold in solving the heat equation with 
time-dependent boundary conditions or forcings. 

Let us solve the heat condition problem 


ðu 0^8 


Bu CSRS h 0<t, 12.12.1 
ðt | Ox? j ( ) 
with the boundary conditions 
u(0,t)=0, u(L,t)= f(t), 0<t, (12.12.2) 
and the initial condition 
u(z,0)=0, O<a<L. (12.12.3) 


The solution of Equation 12.12.1 through Equation 12.12.3 is difficult because of the 
time-dependent boundary condition. Instead of solving this system directly, let us solve the 
easier problem 


ðA 0A 


with the boundary conditions 
A(0,t)=0, A(L,t)=1, 0<t, (12.12.5) 


and the initial condition 
A(z,0)=0, O<a<L. (12.12.6) 
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Separation of variables yields the solution 


TL 


2 Pe —1)” 2224 
A(z,t) = T to = sin( 2) ew( = = ) (12.12.7) 


Consider the following case. Suppose that we maintain the temperature at zero at the 
end « = L until t = 7; and then raise it to the value of unity. The resulting temperature 
distribution equals zero everywhere when t < 7; and equals A(x,t — 71) for t > 7;. We have 
merely shifted our time axis so that the initial condition occurs at t = 71. 

Consider an analogous, but more complicated, situation of the temperature at the end 
position z = L held at f(0) from t = 0 to t = r1 at which time we abruptly change it by 
the amount f(71) — f(0) to the value f(7,). This temperature remains until t = 72 when 
we again abruptly change it by an amount f(72) — f(1). We can imagine this process 
continuing up to the instant t = 7,. Because of linear superposition, the temperature 
distribution at any given time equals the sum of these temperature increments: 


u(x,t) = f(O)A(@, t) + [f(1) — FOJA, t — 1) + [f(2) — f()] AG, t — 72) 
Tec (f(t) — f(™m—-1)|A(2,t — Tn), (12.12.8) 


where Tn is the time of the most recent temperature change. If we write 
Afk = f (Tk) — f(Tk-1), and ATk = Tk — Tk—1; (12.12.9) 


Equation 12.12.8 becomes 


u(x,t) = f(0)A(z, t) + X` Alz, t — n) An. (12.12.10) 
k=1 


Consequently, in the limit of At, — 0, Equation 12.12.10 becomes 
t 
u(a,t) = f(0)A(z, t) «f A(z,t — 7)f'(7) dr, (12.12.11) 
0 


assuming that f(t) is differentiable. Equation 12.12.11 is the superposition integral. We can 
obtain an alternative form by integration by parts: 


u(x,t) = f(t)A(z, 0) -f fo Poa a — n) dr, (12.12.12) 


or 


u(x,t) = f (t) A(x, 0) «f f f(r ae aa dr, (12.12.13) 


because 
OA(x,t — T) OA(a,t — T) 
= i .12.14 
ÓT Ot (12:12:14) 
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To illustrate?! the superposition integral, suppose f(t) = t. Then, by Equation 
12.12.11, 


«o7 [T2 ' 39 = (22) e] p n| bar (12.12.15) 


t 21? S(-1)" ?n2g?t 
mmy = sin(“**) | ew( == H (12.12.16) 


e Example 12.12.1: Temperature oscillations in a wall heated by an alternating current 


In addition to finding solutions to heat conduction problems with time-dependent 
boundary conditions, we can also apply the superposition integral to the nonhomogeneous 
heat equation when the source depends on time. JeglicÓ? used this technique in obtaining 
the temperature distribution within a slab heated by alternating electric current. If we 
assume that the flat plate has a surface area A and depth L, then the heat equation for the 
plate when electrically heated by an alternating current of frequency w is 


Ou 20u 2q 
a = 


à ° Bat = opm wt,  O«s«L OX<t, (12.12.17) 
p 


where q is the average heat rate caused by the current, p is the density, Cp is the specific 
heat at constant pressure, and a? is the diffusivity of the slab. We will assume that we 
insulated the inner wall so that 
Ou(0, t 
QUOD 5 quse (12.12.18) 
Ox 


while we allow the outer wall to radiatively cool to free space at the temperature of zero or 


NLUD 


Apt hu(L,t)=0,  O«t, (12.12.19) 


where « is the thermal conductivity and h is the heat transfer coefficient. The slab is 
initially at the temperature of zero or 


u(z, 0) = 0, 0 « x « L. (12.12.20) 


To solve the heat equation, we first solve the simpler problem of 


0A  ,0A 


61 This occurs, for example, in McAfee, K. B., 1958: Stress-enhanced diffusion in glass. I. Glass under 
tension and compression. J. Chem. Phys., 28, 218-226. McAfee used an alternative method of guessing 
the solution. 


$2 Jeglic, F. A., 1962: An analytical determination of temperature oscillations in a wall heated by 
alternating current. NASA Tech. Note No. D-1286. In a similar vein, Al-Nimr and Abdallah (Al-Nimr, 
M. A., and M. R. Abdallah, 1999: Thermal behavior of insulated electric wires producing pulsating signals. 
Heat Transfer Eng., 20(4), 62-74) found the heat transfer with an insulated wire that carries an alternating 
current. 
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Table 12.12.1: The First Six Roots of the Equation kn tan(k,) = h* 
h* ky ko k3 k4 ks ke 


0.001 0.03162 3.14191 6.28334 9.42488 12.56645 15.70803 
0.002 0.04471 3.14223 6.28350 9.42499 12.56653 15.70809 
0.005 0.07065 3.14318 6.28398 9.42531 12.56677 15.70828 
0.010 0.09830 3.14477 6.28478 9.42584 12.56717 15.70860 
0.020 0.14095 3.14795 6.28637 9.42690 12.56796 15.70924 
0.050 0.22176 3.15743 6.29113 9.43008 12.57035 15.71115 
0.100 0.31105 3.17310 6.29906 9.43538 12.57432 15.71433 
0.200 0.43284 3.20393 6.31485 9.44595 12.58226 15.72068 
0.500 0.65327 3.29231 6.36162 9.47748 12.60601 15.73972 
1.000 0.86033 3.42562 6.43730 9.52933 12.64529 15.77128 
2.000 1.07687 3.64360 6.57833 9.62956 12.72230 15.83361 
5.000 1.31384 4.03357 6.90960 9.89275 12.93522 16.01066 
10.000 1.42887 4.30580 7.22811 10.20026 13.21418 16.25336 
20.000 1.49613 4.49148 7.49541 10.51167 13.54198 16.58640 

oo 1.57080 4.71239 7.853899 10.99557 14.13717 17.27876 


with the boundary conditions 


JA(0,t) _ 0, OME +hA(L,t)=0, 0<t, (12.12.22) 
Ox Ox 


and the initial condition 
A(x,0) = 0, O0<a<L. (12.12.23) 


The solution A(z,t) is the indicial admittance because it is the response of a system to 
forcing by the step function H(t). 

We solve Equation 12.12.21 through Equation 12.12.23 by separation of variables. We 
begin by assuming that A(x,t) consists of a steady-state solution w(x) plus a transient 
solution v(z, t), where 


a^w"(zx) — —1, w'(0)=0, x«w'(L)+hw(L) — 0, (12.12.24) 
Ov ,90?v | Ow(0,t) Ov(L, t) 
= = 0 ——— + hv(L,t) =0 12.12.25 
=! pe? oe » Ke + ho(L,t) = 0, ( ) 
and 
v(z, 0) = —w(z). (12.12.26) 
Solving Equation 12.12.24, 
L- x? KL 
Turning to the transient solution u(x,t), we use separation of variables and find that 
= knz a?k2t 
t) = Ch a = |, 12.12.28 
v(z, t) 2. cos( L Jew( T2 ) ( ) 


where k, is the nth root of the transcendental equation: kn tan(kn) = hL/« = h*. Table 
12.12.1 gives the first six roots for various values of hL/k. 
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Our final task is to compute Ch. After substituting t = 0 into Equation 12.12.28, we 
are left with a orthogonal expansion of —w(z) using the eigenfunctions cos(k,z/L). From 
Equation 6.3.4, 


ies Jo, —w(@) cos(kna/L)dx ^ — —L sin(kn)/(a2k3) —— L? sin(k,,) 
" Jor cos? (kina /L) dz Lkn + sin(2kn)/2]/(2kn) TR + sin(2k,)/2]' 
(12.12.29) 
Combining Equation 12.12.28 and Equation 12.12.29, 
21? C sin(k,) cos(k n/L) .. a?k?t 
t)= ny). 12.12.30 
UCET 2^ ik, + sin(2kn)/] I? ( ) 


Consequently, A(x, t) equals 


A(z, t) = 


L? — x? " kL — 2L? & sin(k,) cos(kn v/L) | ( a^ k21 


. 12.12.31 
24? ha? a? A+ Kk. + sin(Qkn)/2] rE) ( ) 


We now wish to use the solution Equation 12.12.31 to find the temperature distribution 
within the slab when it is heated by a time-dependent source f(t). As in the case of time- 
dependent boundary conditions, we imagine that we can break the process into an infinite 
number of small changes to the heating, which occur at the times t = 71, t = T2, etc. 
Consequently, the temperature distribution at the time t following the change at t = 7, and 
before the change at t = Tn+1 is 


u(a,t) = f(0)A(x, t) DX E jM Ay ot (12.12.32) 


where 
Afk = f (tr) = f (a) and ATk — Tk — Tk—1- (12.12.33) 


In the limit of At, — 0, 


i A(a,t — 
u(x,t) = f(O) A(z, t) «f A(z,t — r)f'(r) dr = f(t) A(z,0) + f(r) AIA a 
0 
(12.12.34) 
In our present problem 
= 2q +2 / mu | 2qu | E 
f(t) 00, AL sin* (wt), and f'(t) — PIED sin(2wt). (12.12.35) 
Therefore, 
2qu Be. ID? —az? «KL 
u(z,t) = AA sin(aor)| 542 + hee 
opo es sin(k,) knz a?k2 (t — 7) 
—-—_—_ 12.12.36 
a Kis + sin(2kn)/2 o(p) eo je aae 
q L-z kL t 
= 2 12.12.37 
zl 2a? i xs) SZC) ( ) 


656 Advanced Engineering Mathematics with MATLAB 


SOLUTION 


Figure 12.12.1: The nondimensional temperature a? ApCpu(z, t)/qL within a slab that we heat by alter- 
nating electric current as a function of position z/L and time a?t/ L? when we insulate the x = 0 end and 


let the x = L end radiate to free space at temperature zero. The initial temperature is zero, hL/« = 1, and 
2/([2,)) = 
a? [(L?w) — 1. 


AL?qu | 45. sin(kn) exp(—a?k2t/L?) nae knz [e s a*k2 7 d- 
a?pCpAL £— k3 [ky + sin(2kn)/2] L j Jo TOAT, P T 


qL L—a? R 
= {| 212 TAL [1 — cos(2wt)] 
p 


4sin(k,) cos(k,x/L) 
Žž, k2 [kn + sin(2kn)/2][4 + a*k2 /(L*0?)] 


n=1 ^ 


2 k2 224 
x E sin(2wt) — 2 cos(2wt) + 2exp (T ) \ (12.12.38) 
Figure 12.12.1 illustrates Equation 12.12.38 for AL/& = 1, and a?/(L?w) = 1. This figure 
was created using the MATLAB script 


clear 
asq_over_omegaL2 = 1; h_star = 1; m = 0; M = 10; 
dx = 0.1; dt = 0.1; 
^ create initial guess at kn 
zero = zeros(length(M)); 
for n - 1:10000 
ki = O.1*n; k2 = O.1*(n*1); 
prod = ki * tan(k1); 
yi = h_star - prod; y2 = h.star - k2 * tan(k2); 
if (yl*y2 <= O & prod < 2 & m < M) m = m*1; zero(m) = k1; end; 
end; 
^ use Newton-Raphson method to improve values of kn 
for n = 1:M; for k = 1:10 
f = h.star - zero(n) * tan(zero(n)); 
fp = - tan(zero(n)) - zero(n) * sec(zero(n))^2; 
zero(n) = zero(n) - f / fp; 
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end; end; 
^4 compute grid and initialize solution 
X = [0:dx:1]; T = [0:dt:6]; 
tempi = (0.5 + 1/h_star)*ones(1,length(X)) - 0.5*X.*X; 
temp2 = ones(1,length(T)) - cos(2*T); 
u = tempi? * temp2; 
XX = XY? * ones(1,length(T)); 
TT = ones(1,length(X))? * T; 
^ compute solution from Equation 12.12.38 
for m - 1:M 
xtemp1 = zero(m) * zero(m); 
xtemp2 = 4 + asq over omegal2*asq over omegaL2*xtempi*xtemp1; 
xtemp3 = asq.over.omegaL2 * xtempi; 
xtemp4 = zero(m) + sin(2*zero(m))/2; 
xtemp5 = asq over omegaL2 * xtemp1; 
aaaaa = 4 * sin(zero(m)) / (xtemp1 * xtemp2 * xtemp4); 
u = u - aaaaa * cos(zero(m)*X)’ 
* (xtemp5 * sin(2*T) - 2 * cos(2*T) + 2 * exp(-xtemp5 * T)); 
end 
surf (XX, TT,u) 
xlabel(’DISTANCE’ ,’Fontsize’ ,20); ylabel(’TIME’, ’Fontsize’ ,20) 
zlabel(’SOLUTION’ , ’Fontsize’ ,20) 
The oscillating solution, reflecting the periodic heating by the alternating current, rapidly 
reaches equilibrium. Because heat is radiated to space at x = L, the temperature is maxi- 
mum at x = 0 at any given instant as heat flows from x = 0 to x = L. 


e Example 12.12.2 


Consider the following heat conduction problem with time-dependent forcing and/or 
boundary conditions: 


^ = a? L(u) + f(P,t), Ü «t (12.12.39) 
B(u) = g(Q,t), 0 « t, (12.12.40) 
and 

u(P,0) = h(P), (12.12.41) 

where 

o ðu o ðu o Ou 
L(u) = Co 1 Cı Ər (K: a) 1 C^» Axe (x a) C3 zs (ss a) ; (12.12.42) 
Ou Ou Ou 

B = + 12.12.43 
(u) Co + C1 Uri + C2 Ba» C3 Bus? ( ) 


P denotes an arbitrary interior point at (#1, 22,273) of a region R, and Q is any point on 
the boundary of R. Here c;, Ci, and K; are functions of x1, £2, and x3 only. 

Many years ago, Bartels and Churchill® extended Duhumel’s theorem to solve this heat 
conduction problem. They did this by first introducing the simpler initial-boundary-value 


$3 Bartels, R. C. F., and R. V. Churchill, 1942: Resolution of boundary problems by the use of a 
generalized convolution. Am. Math. Soc. Bull., A8, 276-282. 
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problem: 
a =a?L(v)+f(P,ti),  0«t, (12.12.44) 
Biv) =9(Q,t1),  0«t, (12.12.45) 
and 
v(P,0) = h(P), (12.12.46) 


which has a constant forcing and boundary conditions in place of the time-dependent ones. 
Here tı denotes an arbitrary but fired instant of time. Then Bartels and Churchill proved 
that the solution to the original problem is given by the convolution integral 


t 
u(P,t) — x|/ v(P,t—T,Tr)dr|. (12.12.47) 


To illustrate®* this technique, let us solve 


ðu 4/0. 20u a? 0? (ru) 
rim (5: | a) = ope a<r<fB, 0<t, (12.12.48) 
subject to the boundary conditions 
o t 
ula, t) = ue, i ) =0, 0<t, (12.12.49) 
T 
and the initial condition u(r,0) = ug, o < r < f. 
We begin by solving the alternative problem 
Ov o [8v 20v a? 0? (rv) 
— = = t . E 
> e (Siem) -— Rao e«r«B, O«t (12.12.50) 
subject to the boundary conditions 
1 O tt 
v(a, t, t) 2 uge ^, DAS) =0, 0<t, (12.12.51) 
r 


and the initial condition v(r,0,7^) = uo, œ < r < 8, or equivalently 


ðw „[/@w 20w a? 6? (rw) 
— = | = t 12.12.52 
Ot e (53 Lx) r Or ^" Spr SS ( 92) 
subject to the boundary conditions 
o Bt 
w(a, t,t’) = 0, DIIS) — 0, 0 « t, (12.12.53) 
r 


and the initial condition w(r, 0, t’) = uo(1— g 9). a<r « B, where v(r,t, t") = uge + 
w(r, t, t’). 


64 See Reiss, H., and V. K. LaMer, 1950: Diffusional boundary value problems involving moving bound- 
aries, connected with the growth of colloidal particles. J. Chem. Phys., 18, 1—12. 
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'The heat condition problem Equation 12.12.52 and Equation 12.12.53 can be solved 
using separation of variables. Following Example 8.3.6, we find that 


oo 


1— =et ) _ i 
TT = Sl : : 2» aL : — 9) eiie (12.12.54) 
ni n 


where kn is the nth root of Bk = tan[k(6 —o)], and 2cn = {8 sin?[k, (8 —o)] — o). Therefore, 


auo 0 t = sin[k,(r — o)] |. 9242 CEN 
DNE Lent a?ka (t=T 12.12. 
uat) = 2987. u Buc I dr (12.12.55) 
auo sin[kn(r — o)] 8 [ 243 (4.7) 242 (t—7) 
= OR tT) Lg 9 Rn UT) OF ef 12.12.56 
T 2 Kitr Ot | Jo s Í E ( ) 
_ auo < sinfkn(r—-a)] 8 | 1— eT kat get. eTa knt (12.12.57) 
m = kn€n Ot a*k2 a?k2 — c Us 
acug xc sin[k,(r — o)] e-* st — eet 
= 12.12.58 
r > kn€n c—a?k2  ' ( ) 
and the final answer is 
E arcup e? k2t u e & dit 
u(x,t) = ue + n (c PIA, sin[kn(r — a)]. (12.12.59) 
Problems 


1. Solve the heat equation®° 


Ou 20u 


ae. o<r<L, df 
8t | 8g 


subject to the boundary conditions u(0,t) = u(L,t) = f(t), 0 « t, and the initial condition 
u(x,0)=0,0< x< L. 
Step 1: First solve the heat conduction problem 


ðA _ 29A 


a ETE O0<a<L, 0<t, 


subject to the boundary conditions A(0,t) = A(L,t) = 1, 0 « t, and the initial condition 
A(2,0) 20, 0 « z « L. Show that 


4 sin|(2n — l)nz/L| | 429m 12421/12 
A(a,t) 21 = [( nat) pe (2n—1) t/L . 


nci 


5 See Tao, L. N., 1960: Magnetohydrodynamic effects on the formation of Couette flow. J. Aerosp. Sci., 
27, 334—338. 
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Step 2: Use Duhamel's theorem and show that 


L? L 


Ana? = . [Gn — Tz] aion pner f° 2(95 132424 /12 
u(x,t) = — So (2n 1)sin] | e7% onc x t/L 1 f(r)e* QncD s ee dqr, 

nci 0 
2. A thermometer measures temperature by the thermal expansion of a liquid (usually 
mercury or alcohol) stored in a bulb into a glass stem containing an empty cylindrical 
channel. Under normal conditions, temperature changes occur sufficiently slowly so that 
the temperature within the liquid is uniform. However, for rapid temperature changes 
(such as those that would occur during the rapid ascension of an airplane or meteorological 
balloon), significant errors could occur. In such situations the recorded temperature would 
lag behind the actual temperature because of the time needed for the heat to conduct in or 
out of the bulb. During his investigation of this question, McLeod? solved 


LN "x (ra): 0€r«b, 0<t, 


Ot rOr\ Or 


subject to the boundary conditions lim,_,9 |u(r,t)| < co, and u(b, t) = y(t), 0 < t, and the 
initial condition u(r, 0) = 0, 0 <r < b. The analysis was as follows: 


Step 1: First solve the heat conduction problem 


A 2 A 
à a ZG J O<r<b, Oct, 


Ot r Or\ Or 


subject to the boundary conditions lim,.,o |A(r, t)| < oo, and A(b,t) = 1, 0 < t, and the 
initial condition A(r,0) = 0, 0 € r < b. Show that 


A(r, t) i1) S "E nT (b) e e 
1( 


where Jo(kn) = 0. 
Step 2: Use Duhamel’s theorem and show that 
2a? o kn Jo(knr/b) i —a?k? (t-7)/W? 
u(r, t) = m 206) f e(r)e dr. 


n=1 


Step 3: If y(t) = Gt, show that 


Jo( (k nt /b) b2 —a?k?t/b2 
unt) =26) E reae 2 -1). 


McLeod found that for a mercury thermometer of 10-cm length a lag of 0.01? C would 
occur for a warming rate of 0.032°C s^! (a warming gradient of 1.9°C per thousand feet 


66 McLeod, A. R., 1919: On the lags of thermometers with spherical and cylindrical bulbs in a medium 
whose temperature is changing at a constant rate. Philos. Mag., Ser. 6, 37, 134-144. See also Bromwich, 
T. J. PA., 1919: Examples of operational methods in mathematical physics. Philos. Mag., Ser. 6, 37, 
407—419; McLeod, A. R., 1922: On the lags of thermometers. Philos. Mag., Ser. 6, A3, 49-70. 
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and a descent of one thousand feet per minute). Although this is a very small number, 
when he included the surface conductance of the glass tube, the lag increased to 0.85?C. 
Similar problems plague bimetal thermometers®’ and thermistors?? used in radiosondes 
(meteorological sounding balloons). 


3. A classic problem®? in fluid mechanics is the motion of a semi-infinite viscous fluid that 
results from the sudden movement of the adjacent wall starting at t = 0. Initially the fluid 
is at rest. If we denote the velocity of the fluid parallel to the wall by u(x,t), the governing 
equation is 

Qu 0?u 

8t Ax?’ 
with the boundary conditions u(0,t) = V(t), limz.. u(x,t) — 0, 0 < t, and the initial 
condition u(x,0) = 0, 0 < x < oc. 


0«cr«oo, 0<t, 


Step 1: Find the step response by solving 


OA OPA 


Sr t xen 0 «€ x « oo, 0 « t, 


subject to the boundary conditions 


A(0,t)=1, lim A(x,t) > 0, 0<t, 


LoCo 


and the initial condition A(z,0) = 0, 0 < z < oo. Show that 


HH 2 = 2 
A(a,t = erte( 5) -+f e " dm, 
(t) 2/vt VT Jaja vi 3 


where erfc(-) is the complementary error function. Hint: Use Laplace transforms. 
Step 2: Use Duhamel's theorem and show that the solution is 


t Em! 4 2 oo 2 
u(z,t) = | Vit D) ans 1 pr) dr — J v(: a ;) e dn. 
0 2VTvT T Ja /Avt Avr 


4. During their study of the propagation of a temperature step in a nearly supercritical van 
der Waals gas, Zappoli and Durand-Daubin"? solved 


Ou 20u 


Bi ^ pr? 0 «€ x « oo, 0 « t, 


with the boundary conditions u(0,t) = uo — 2f (t), lims u(x,t) + 0, 0 < t, and the 
initial condition u(z,0) = 0, 0 < x < oo, where up is a constant. 


97 Mitra, H., and M. B. Datta, 1954: Lag coefficient of bimetal thermometer of chronometric radiosonde. 
Indian J. Meteorol. Geophys., 5, 251—261. 


$8 Badgley, F. I., 1957: Response of radiosonde thermistors. Rev. Sci. Instrum., 28, 1079-1084. 


$9 This problem was first posed and partially solved by Stokes, G. G., 1850: On the effect of the internal 
friction of fluids on the motions of pendulums. Proc. Cambridge Philos. Soc., 9, Part II, [8]-[106]. 


70 Zappoli, B., and A. Durand-Daubin, 1994: Heat and mass transport in a near supercritical fluid. Phys. 
Fluids, 6, 1929-1936. 


662 Advanced Engineering Mathematics with MATLAB 


Step 1: Find the step response by solving 
OA 207A 
Pap Bee ts 
Ot Ox?” 


subject to the boundary conditions A(0,t) = 1, lim,-,~ A(z,t) > 0, 0 < t, and the initial 
condition A(z,0) = 0, 0 < z < oo. Show that 


0<xr<ow, 0<t, 


A(a,t) = ere er dn, 


T oo 
za) 7 Tae 
where erfc(-) is the complementary error function. Hint: Use Laplace transforms. 


Step 2: Use Duhamel’s theorem and show that the solution is 


(x,t) e «( 7 ) : d s(t z’ ) -n q 
u(x,t) = uo erfe( —— | — 2— — —— Je : 
i 2avt 3n z/(2avt) 4a?n? 1 


5. Solve the heat equation 
Qu _ Pu 
Ot Ox?’ 


subject to the boundary conditions u(0,t) = f(t), u;(1,t) = —hu(1,t), 0 < t, and the initial 
condition u(z,0) 20,0 « x « 1. 


O0<a<l, Oct, 


Step 1: First solve the heat conduction problem 
OA | 0? A 
Ot | Ox?’ 


subject to the boundary conditions A(0,t) = 1, A,(1,t) = —AA(1,t), 0 < t, and the initial 
condition A(z,0) = 0, 0 < x < 1. Show that 


O0<a<l, 0<t, 


ha z k2 +h? 2 
Alx, t)=1 2 ie in(knx)e nt 
CE Tan S22 p eva ne Ne 


where kn is the nth root of kcot(k) = —h. 
Step 2: Use Duhamel's theorem and show that 


(k2 +h?) e kant ker 
o o1 RR nlk nk Je [to efa dr. 


12.13 THE SOLUTION OF LAPLACE’S EQUATION 


Laplace transforms are useful in solving Laplace’s or Poisson’s equation over a semi- 
infinite strip. The following problem illustrates this technique. 
Let us solve Poisson’s equation within a semi-infinite circular cylinder 


10 f Qu Ou 2 
= = < 
25r (a) + D zrl b), O<r<a, 0«z«oo, (12.13.1) 


The Laplace Transform 663 


subject to the boundary conditions 
u(r,0) 20, lim u(r, z)|< oo, O<r<a, (12.13.2) 
z= 00 


and 
u(a, z) = 0, 0<z< œ, (12.13.3) 


where 0 < b < a. This problem gives the electrostatic potential within a semi-infinite cylin- 
der of radius a that is grounded and has the charge density of n(z) within an infinitesimally 
thin shell located at r = b. 

Because the domain is semi-infinite in the z direction, we introduce the Laplace trans- 
form 


U(r, s) z u(r, z) e ?? dz. (12.13.4) 
0 
Thus, taking the Laplace transform of Equation 12.13.1, we have that 
dU (r, s) 2 _2 
PUE b d | c s*U(r,s) — su(r,0) — u;(r,0) = y N(s)ótr — b). (12.13.5) 


Although u(r,0) = 0, uz(r,0) is unknown and we denote its value by f(r). Therefore, 
Equation 12.13.5 becomes 


+ s?U(r,s) = f(r) + 2 N(s)ó(r —b, 0<r<a, (12.13.6) 


ld E 3 2 


dr 


rdr 


with limo |U (r, s)| < oo, and U (a, s) = 0. 
To solve Equation 12.13.6 we first assume that we can rewrite f(r) as the Fourier-Bessel 
series 


= S d stel (12.13.7) 


n=1 


where kn is the nth root of the Jo(k) = 0, and 


Avo ZPD = fr f(r) Jo(knr/a) r dr. (12.13.8) 
Similarly, the expansion for the delta function is 


= 2b QA Jo(knb/a)Jo(knr/a) 
a? J? (kn) 


(12.13.9) 


because A 
| ó(r — b) Jo(knr/a) r dr = b Jo(Kknb/a). (12.13.10) 
0 


Why we chose this particular expansion will become apparent shortly. 
'Thus, Equation 12.13.6 may be rewritten as 


Jo(k,r/a), — (12.13.11) 


1d[| dU(r,s)| | " 2N(s M T 
23 dr E TS 2 kin) 


where aj = So Fl r) Jo(knr/a) r dr. 
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The form of the right side of Equation 12.13.11 suggests that we seek solutions of the 
form 


— M > BrJo(knr/a), O<r<a. (12.13.12) 


We now understand why we rewrote the right side of Equation 12.13.6 as a Fourier-Bessel 
series; the solution U(r, s) automatically satisfies the boundary condition U (a, s) = 0. Sub- 
stituting Equation 12.13.12 into Equation 12.13.11, we find that 


U(r s) = 5 a2 6 N(s) D T S Jo(k,r/a) O<r<a. (12.13.13) 


We have not yet determined aj. Note, however, that in order for the inverse of Equation 
12.13.13 not to grow as e¥»?/4, the numerator must vanish when s = k,/a and s = kn /a is 
a removable pole. Thus, 


di ION eaa, (12.13.14) 
and 
U(r,s) = E T UE Jo(knr/a), O<r<a. (1213.15) 


The inverse of U(r, s) then follows directly from simple inversions, the convolution theorem, 
and the definition of the Laplace transform. The complete solution is 


_ 2 o^. Jo(knb/a) Jo(knr/a) 
= PD ; E. JE) 


u(r,z 
n=1 


ff n(r)ekn (2-7/4 a - f n(r)e-*» 6-7) dr 
0 


0 


x 


- «metet drs f n(rje enfe tese dr] (12.13.16) 
0 0 


: 3 Ma (12.13.17) 


n=1 


x / n(r)e-"»G*7)/a dr — i n(r)e- Bn G-7)/a dr — f, n(r)e-F»(r-2)/a ar| ] 
0 


0 z 


Problems 
1. Use Laplace transforms to solve 
Ou du 
ou Oye 0«zr«oo, 0<y<a, 


subject to the boundary conditions 


u(0,y)=1, lim]|u(my) «oo  O0<y<a, 
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and 
u(x,0) = u(x,a) = 0, 0 « z « oo. 


2. Use Laplace transforms to solve 


198 / Qu 0?u 
= < 
240 JEPE 0, O<r<a, 0«z«oo, 


subject to the boundary conditions 


w(n0)—1, lim |u(r,2)|<o0,  0sr«a 
z= 00 
and 
lim |u(r, 2)| «oo, u(a,z) =0, 0 « z « oo. 
r= 


Further Readings 


Churchill, R. V., 1972: Operational Mathematics. McGraw-Hill Book Co., 481 pp. A classic 
textbook on Laplace transforms. 


Doetsch, G., 1950: Handbuch der Laplace-Transformation. Band 1. Theorie der Laplace- 
Transformation. Birkhäuser Verlag, 581 pp.; Doetsch, G., 1955: Handbuch der Laplace- 
Transformation. Band 2. Anwendungen der Laplace-Transformation. 1. Abteilung. Birk- 
häuser Verlag, 433 pp.; Doetsch, G., 1956: Handbuch der Laplace-Transformation. Band 3. 
Anwendungen der Laplace- Transformation. 2. Abteilung. Birkhauser Verlag, 298 pp. One 
of the standard reference books on Laplace transforms. 


LePage, W. R., 1980: Complex Variables and the Laplace Transform for Engineers. Dover, 
483 pp. Laplace transforms approached from complex variable theory and a few applica- 
tions. 


McLachlan, N. W., 1944: Complex Variables & Operational Calculus with Technical Appli- 
cations. Cambridge Press, 355 pp. An early book on Laplace transforms from the view of 
complex variables. 


Thomson, W. T., 1960: Laplace Transformation. Prentice-Hall, 255 pp. Presents Laplace 
transforms in the engineering applications that gave them birth. 


Watson, E. J., 1981: Laplace Transforms and Applications. Van Nostrand Reinhold, 205 
pp. A brief and very complete guide to Laplace transforms. 


Taylor & Francis 


Taylor & Francis Group 


http://taylorandfrancis.com 


Simpson's Simpson's 
3/8—rule 1/3-rule 


Ideal Rule 


Ratio of quadrature amplitudes to ideal integration 


Chapter 13 
The Z-Transform 


Since the Second World War, the rise of digital technology has resulted in a corre- 
sponding demand for designing and understanding discrete-time (data sampled) systems. 
These systems are governed by difference equations in which members of the sequence yn 
are coupled to each other. 

One source of difference equations is the numerical evaluation of integrals on a digital 
computer. Because we can only have values at discrete time points th = kT for k = 
0,1,2,..., the value of the integral y(t) = F f(r) dr is 


kT (k—1)T kT 
y(kT) = A f(r) dr =| f(r)dr+ a f(r) dr (13.0.1) 
kT 
= y[(k — 1)T] «f f(r) dr = y[(k — 1)T] + T f(KT), (13.0.2) 
(k—1)T 


because dicas f(r)dr = Tf(kT). The right side of Equation 13.0.2 is an example of 
a first-order difference equation because the numerical scheme couples the sequence value 
y(kT) directly to the previous sequence value y[(k — 1)T]. If Equation 13.0.2 had contained 
y[(k — 2)T], then it would have been a second-order difference equation, and so forth. 

Although we could use the conventional Laplace transform to solve these difference 
equations, the use of z-transforms can greatly facilitate the analysis, especially when we 
only desire responses at the sampling instants. Often the entire analysis can be done using 
only the transforms and the analyst does not actually find the sequence y(kT). 

In this chapter we will first define the z-transform and discuss its properties. Then we 
will show how to find its inverse. Finally we shall use them to solve difference equations. 
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Figure 13.1.1: Schematic of how a continuous function f(t) is sampled by a narrow-width pulse sampler 
f§(t) and an ideal sampler fs(t). 


13.1 THE RELATIONSHIP OF THE Z-TRANSFORM TO THE LAPLACE TRANSFORM! 


Let f(t) be a continuous function that an instrument samples every T' units of time. 
We denote this data-sampled function by f2(t). See Figure 13.1.1. Taking e, the duration 
of an individual sampling event, to be small, we may approximate the narrow-width pulse 
in Figure 13.1.1 by flat-topped pulses. Then f3(t) approximately equals 


(t) = : Y. f(nT) [H(t — nT + e/2) - H(t - nT — e/2), (13.1.1) 


ife « T. 

Clearly the presence of e is troublesome in Equation 13.1.1; it adds one more parameter 
to our problem. For this reason we introduce the concept of the ideal sampler, where the 
sampling time becomes infinitesimally small so that 


O = im fom Jk (t — nT +€/2)— 


=X_ f(nT)5(t — nT). 


c0 € 
n=0 


A(t —nT — zi 
(13.1.2) 


Let us now find the Laplace transform of this data-sampled function. From the linearity 
property of Laplace transforms, 


= M f(nT)£jé(t — n). (13.1.3) 


n=0 


Fs(s) = Lif s(t) lc| Sora (t — nT) 


n=0 


1 Gera (Gera, A. E., 1999: The relationship between the z-transform and the discrete-time Fourier 
transform. IEEE Trans. Auto. Control, AC-44, 370-371) explored the general relationship between 
the one-sided discrete-time Fourier transform and the one-sided z-transform. See also Naumovi¢, M. B., 
2001: Interrelationship between the one-sided discrete-time Fourier transform and one-sided delta transform. 
Electr. Engng., 83, 99-101. 
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Because L[6(t — nT’)] = e-"*7, Equation 13.1.3 simplifies to 


oo 


Fs(s) = M. f(nr)e "T. (13.1.4) 


n=0 


If we now make the substitution that z = e?7, then Fs(s) becomes 


Fede duo. (13.1.5) 
n=0 


where F(z) is the one-sided z-transform? of the sequence f(nT), which we shall now denote 
by fn. Here Z denotes the operation of taking the z-transform while Z-! represents the 
inverse z-transformation. We will consider methods for finding the inverse z-transform in 
Section 13.3. 

Just as the Laplace transform was defined by an integration in t, the z-transform is 
defined by a power series (Laurent series) in z. Consequently, every z-transform has a region 
of convergence that must be implicitly understood if not explicitly stated. Furthermore, 
just as the Laplace integral diverged for certain functions, there are sequences where the 
associated power series diverges and its z-transform does not exist. 

Consider now the following examples of how to find the z-transform. 


e Example 13.1.1 


Given the unit sequence f, = 1, n > 0, let us find F(z). Substituting f, into the 
definition of the z-transform leads to 


Fig > 2" = (13.1.6) 


n=0 


because $75 oz" is a complex-valued geometric series with common ratio z~'. This series 
converges if |z~+| < 1 or |z| > 1, which gives the region of convergence of F(z). 

MATLAB's symbolic toolbox provides an alternative to the hand computation of the 
z-transform. In the present case, the command 
>> syms Z; syms n positive 
>> ztrans(1,n,z) 
yields 


ans — 


z/ (z-1) 


e Example 13.1.2 
Let us find the z-transform of the sequence 


fee n 2 0, (13.1.7) 


? The standard reference is Jury, E. L, 1964: Theory and Application of the z- Transform Method. John 
Wiley & Sons, 330 pp. 
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for a real and a imaginary. 
For a real, substitution of the sequence into the definition of the z-transform yields 


F(z) = 5 g (Tu nu NC D (13.1.8) 
n=0 n=0 


If u = e-?7 z-!, then Equation 13.1.8 is a geometric series so that 


oo 
1 
F(z)— E . 13.1.9 
(9e 3^ - LL (13.1.9) 
n=0 
Because |u| = e~*7|z~1|, the condition for convergence is that |z| > e~°?. Thus, 
E —aT 
F(z) = EP [eee (13.1.10) 


For imaginary a, the infinite series in Equation 13.1.8 converges if |z| > 1, because 
|u| = |z-!| when a is imaginary. Thus, 
z 
Although the z-transforms in Equation 13.1.10 and Equation 13.1.11 are the same in these 
two cases, the corresponding regions of convergence are different. If a is a complex number, 
then 
z 


F(z) = e AD |z| > deck (13.1.12) 


Checking our work using MATLAB, we type the commands: 
>> syms a Zz; syms n T positive 
>> ztrans(exp(-a*n*T),n,z); 
>> simplify (ans) 
which yields 
ans = 
z*exp(a*T) / (z*exp(a*T) -1) 


e Example 13.1.3 
Let us find the z-transform of the sinusoidal sequence 
fn = cos(nwT), n> 0. (13.1.13) 
Substituting Equation 13.1.13 into the definition of the z-transform results in 


F(z)= Y cos(nwT)z ". (13.1.14) 
n=0 


From Euler’s formula, 
cos(nuT) = i(ein9T + e ineT). (13.1.15) 
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so that Equation 13.1.14 becomes 


Dex 
F(z) = s 3 (eme + p) (13.1.16) 
or 
F(z) = $[Z(e™7) + z(e-ne)]. (13.1.17) 
From Equation 13.1.11, 
g(eT)-.—*— >L (13.1.18) 
z— e? 


Substituting Equation 13.1.18 into Equation 13.1.17 and simplifying yields 


F(z) - 3 —— p der (13.1.19) 


e Example 13.1.4 


Let us find the z-transform for the sequence 
1, O0<n<5, 
Q) > s 


From the definition of the z-transform, 


5 oo n 
eu 1 
Z()-F()- 2°+ > (z) ! (13.1.21) 
n=0 n=6 
Ph ee oe ee RUM. 
i z 2 zi ^ zi gP529g—1 
1 1 1 1 1 
1 (13.1.22) 


2z 422 823 1624 3225 
2z 1 3 7 15 31 


= Oyo Os 48 a 16 395 


(13.1.23) 


We could also have obtained Equation 13.1.23 via MATLAB by typing the commands: 
>> syms Z; syms n positive 
>> ztrans(’?1+((1/2) ^n-1) *Heaviside (n-6) ,n,z) 
which yields 


ans = 
2*z/ (2*z-1) *1/2/z*3/4/z^2*1/8/z^3*15/16/z^4*431/32/z^5 


We summarize some of the more commonly encountered sequences and their transforms 
in Table 13.1.1 along with their regions of convergence. 
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Table 13.1.1: Z-Transforms of Some Commonly Used Sequences 


fh, n>0 


fo = k = const. 
fn=90, n>1 


fm = k = const. 
fn = 0, for all values 
of n Am 


k = constant 
kn 
kn? 
ke~*T , a complex 
kne~*"" , a complex 
sin(wonT) 
cos(wonT) 
e-?"7 sin(wonT) 


e-*"T cos(wonT) 


a” , œa constant 


sinh(wonT’) 
cosh(wonT’) 


a^ [n! 
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F(z) 


ke 


z sin(woT) 
z? —2z cos(wo T )--1 


z|z—cos(woT)] 
z? —2z cos(woT)+1 


ze~ 7? sin(woT) 


z?—2ze-9T cos(woT)+e7 247 


ze-?T [ze*? —cos(woT)] 


z?—2ze-9T cos(woT)+e7 247 


2/(z—a) 


az/(z — a)? 


az(z + a)/(z — o)? 


zsinh(woT) 
z? —2z cosh(woT)+1 


z[z—cosh(woT)] 
z? —2z cosh(woT)+1 


e9/ 7 


Region of 
convergence 


\z| 2.0 


|z| 2.0 


|z| > 1 

\z| >1 

\z| >1 
lz] > e=] 
lz] > e=] 
z| 1 
z| 1 


Zier 


|z| > et 
|z| > lal 
|z| > lal 


|z| > lal 


|z| > cosh(woT) 


|z| > sinh(woT) 


|z| 2.0 


|z| 20 
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Figure 13.1.2: The amplitude of the Laplace or Fourier transform (solid line) for the function ae-?* H(t) 
and the z-transform (dashed line) for the sequence fn = ae-^^T 
positive values of a and T' = 1. 


as a function of frequency w for various 


e Example 13.1.5 


In many engineering studies, the analysis is done entirely using transforms without 
actually finding any inverses. Consequently, it is useful to compare and contrast how various 
transforms behave in very simple test problems. 

Consider the time function f(t) = ae~* H(t), a > 0. Its Laplace and Fourier transform 
are identical, namely a/(a + iw), if we set s = iw. In Figure 13.1.2 we illustrate its behavior 
as a function of positive w. 

Let us now generate the sequence of observations that we would measure if we sampled 
f(t) every T units of time apart: f, = ae-?"T, Taking the z-transform of this sequence, it 
equals az/ (z — gc Recalling that z = eT = e/"T. we can also plot this transform as 
a function of positive w. For small w, the transforms agree, but as w becomes larger they 
diverge markedly. Why does this occur? 

Recall that the z-transform is computed from a sequence comprised of samples from a 
continuous signal. One very important flaw in sampled data is the possible misrepresenta- 
tion of high-frequency effects as lower-frequency phenomena. It is this aliasing or folding 
effect that we are observing here. Consequently, the z-transform of a sampled record can 
differ markedly from the corresponding Laplace or Fourier transforms of the continuous 
record at frequencies above one half of the sampling frequency. This also suggests that care 
should be exercised in interpolating between sampling instants. Indeed, in those applica- 
tions where the output between sampling instants is very important, such as in a hybrid 
mixture of digital and analog systems, we must apply the so-called “modified z-transform.” 


Problems 


From the fundamental definition of the z-transform, find the transform of the following 
sequences, where n > 0. Then check your answer using MATLAB. 


l. fn = (3)" 2. f, = ein? 
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1, O<n<5d Ly? —0.1 1 
aasde "ics dps eno 
(i), n>11 
0, n=0 
TEE n= 
a” n22 


13.2 SOME USEFUL PROPERTIES 
In principle we could construct any desired transform from the definition of the z- 


transform. However, there are several general theorems that are much more effective in 
finding new transforms. 


| Linearity 


From the definition of the z-transform, it immediately follows that 


if hn =cıfn +c2g9n, then H(z) 2 e&F(z) - e3G(z), (13.2.1) 


where F(z) = Z(f4), G(z) = Z(gn), H(z) = Z(h,), and c1, c2 are arbitrary constants. 


Multiplication by an expo- 
nential sequence 


If g,—e "Tf, n>O0, then G(z) = F(ze**). (13.2.2) 
This follows from 
Gi = Z(g9n) = X ae =X eT eS Oe) er. (13.2.3) 
n=0 n=0 n=0 


This is the z-transform analog to the first shifting theorem in Laplace transforms. 


| Shifting 


The effect of shifting depends upon whether it is to the right or to the left, as Table 
13.2.1 illustrates. For the sequence f,.»5, no values from the sequence f, are lost; thus, 
we anticipate that the z-transform of f, » only involves F(z). However, in forming the 
sequence fn+2, the first two values of f, are lost, and we anticipate that the z-transform of 
fn+2 cannot be expressed solely in terms of F(z) but must include those two lost pieces of 
information. 
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Table 13.2.1: Examples of Shifting Involving Sequences 


n fn fn—2 fn+2 
0 1 0 4 
1 2 0 8 
2 4 I 16 
3 8 2 64 
4 16 4 


Let us now confirm these conjectures by finding the z-transform of fn+1, which is a 
sequence that has been shifted one step to the left. From the definition of the z-transform, 
it follows that 


Zu)» ha ea hae (13.2.4) 
n=0 n=0 
or is 
Z(fe41) =2>_ fuz" + zfo — zfo, (13.2.5) 
k=1 


where we added zero in Equation 13.2.5. This algebraic trick allows us to collapse the first 
two terms on the right side of Equation 13.2.5 into one and 


Z(fn4i) = zF(z) — z fo. (13.2.6) 

In a similar manner, repeated applications of Equation 13.2.6 yield 
Zl faim) = 2" FO) 2"fy—2" f... —2fi (13.2.7) 
where m > 0. This shifting operation transforms fn+m into an algebraic expression involv- 
ing m. Furthermore, we introduced initial sequence values, just as we introduced initial 
conditions when we took the Laplace transform of the nth derivative of f (t). We will make 


frequent use of this property in solving difference equations in Section 13.4. 
Consider now shifting to the right by the positive integer k, 


dn = fn-kHn-k, n2 0, (13.2.8) 
where Hn- = 0 for n < k and 1 for n > k. Then the z-transform of Equation 13.2.8 is 
G(z) = z "F(z), (13.2.9) 


where G(z) = Z(g,), and F(z) = Z(f,). This follows from 


G(z) E» nz" E» fIn-kHn-kz ^ = >> fee P es E» dus Es b. 
n=0 n=0 n=k m=0 


(13.2.10) 
This result is the z-transform analog to the second shifting theorem in Laplace transforms. 
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In symbolic calculations involving MATLAB, the operator H,—, can be expressed by 
Heaviside(n-k). 


| Initial-value theorem 


The initial value of the sequence fn, fo, can be computed from F(z) using the initial- 
value theorem: 


fo = lim F(z). (13.2.11) 


From the definition of the z-transform, 
oo 
=o faz” = fo + fiz + faz? +... (13.2.12) 


In the limit of z — oo, we obtain the desired result. 


| Final-value theorem 


The value of fn, as n — oo, is given by the final-value theorem: 


dens lim (z — 1)F(z), (13.2.13) 
where F(z) is the z-transform of fn. 


We begin by noting that 


TL 


Z(fazi— fa) = lim. $ (fesi fi)z (13.2.14) 


k=0 
Using the shifting theorem on the left side of Equation 13.2.14, 


n 


F(z) — 2fo — F(z) = lim 3 (feta — fi) *. (13.2.15) 


k=0 
Applying the limit as z approaches 1 to both sides of Equation 13.2.15: 


n 


lim (z — D)F(z) - fo = lim 9 "(fia — fe) (13.2.16) 
k=0 
= lim a [5 — fo) Fa fi) +--+ Us fn-1) + Usa ses] 
13.2.17) 
= lim (fo + fats) 13.2.18) 
== fot foo: 13.2.19) 
Consequently, 


foo = lim (z — 1)F(2). 13.2.20) 
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Note that this limit has meaning only if fə exists. This occurs if F(z) has no second-order 
or higher poles on the unit circle and no poles outside the unit circle. 


| Multiplication by n 


Given 
In = fn; n 2 0, (13.2.21) 
this theorem states that a 
G(z) =—z a (13.2.22) 
z 


where G(z) = Z (gn), and F(z) = Z(fn). 


This follows from 


G(z) = a s iS ne tS 13.2.23 
(z) X nz NC 23 hos ET ( ) 
n=0 n=0 n=0 
| Periodic sequence theorem 
Consider the N-periodic sequence: 
fn = {fofife---fiv-1 fofi...) (13.2.24) 
———<——$ —____ 
first period 
and the related sequence: 
m fn; 0 < n < N e) 1, 
Za — { 0, N<n. (13.2.25) 


This theorem allows us to find the z-transform of f, if we can find the z-transform of £n 
via the relationship 


X 
F(z) = e) po ale at (13.2.26) 
1—z-N 
where X(z) = Z (xz). 
This follows from 
oo N-1 2N-1 3N-1 
F(z)2 M fac" = M sa M nonz” XO macawz "bee. (13.2.27) 
n=0 n=0 n=N n=2N 


Application of the shifting theorem in Equation 13.2.27 leads to 


F(z) = X(z) -z "X(z) -z ?N X(z)4-:- (13.2.28) 
— X(z) [13-27 +H]. (13.2.29) 
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Equation 13.2.29 contains an infinite geometric series with common ratio 27^ , which con- 
verges if |; "| < 1. Thus, 


je" eset. (13.2.30) 


| Convolution 


Given the sequences f, and gn, the convolution product of these two sequences is 
n n 
Wn = fn * 9n = TG = Rode (13.2.31) 
k=0 k=0 


Given F(z) and G(z), we then have that W (z2) = F(z)G(z). 


This follows from 


w()=) » fai GE Sa ae (13.2.32) 
n=0 L k=0 n=0 k=0 


because gay = 0 for k > n. Reversing the order of summation and letting m = n — k, 


W(z) =o M fug "9 = È fam b 
k=0 m=0 


k=0 m=—k 


= F(z)G(z). (13.2.33) 


We can use MATLAB’s command conv( ), which multiplies two polynomials to perform 
discrete convolution as follows: 


>>x = [111111 1]; 


>>y = [1 2 4 8 16 32 64]; 
>>z = conv(x,y) 

produces 

Z = 


1 3 7 15 31 63 127 126 124 120 112 96 64 


The first seven values of z contain the convolution of the sequence x with the sequence y. 
Consider now the following examples of the properties discussed in this section. 


e Example 13.2.1 


From 1 
Z (a^) = ; (13.2.34) 
1—az-! 
for n > 0 and |z| > |a| , we have that 
; 1 
Z (em) (13.2.35) 


The Z-Transform 679 


and i 
Z (e mm) = —— 13.2.36 
(e ) 1 — e-izz-1? ( ) 
if n > 0 and |z| > 1. Therefore, the sequence fn = cos(nx) has the z-transform 
F(z) = Z[cos(nz)] = 42 (e"*) + $Z (e7?) (13.2.37) 
1 1 1 1 1- = 
< €— SEGUE ; (13.2.38) 
21—eivzg-1 21—e-5z-1  1—2cos(x)g-! + 27? 
e Example 13.2.2 
Using the z-transform, 
AC) eet -0 (13.2.39) 
OS Tae)’ ur EX 


we find that 


^"(-a)-1)»7? (13.2.40) 


E rci (13.2.41) 


e Example 13.2.3 
Consider F(z) = 2az 1/(1— az-!)?, where |a| < |z| and |a| < 1. Here we have that 


i . 2az- 


from the initial-value theorem. This agrees with the inverse of F(z): 
fn — n(n 4 1)a^, n 2 0. (13.2.43) 


If the z-transform consists of the ratio of two polynomials, we can use MATLAB to find 
fo. For example, if F(z) = 22?/(z — 1)?, we can find fo as follows: 
»»num - [2 O 0]; 
>>den = conv([1i -1],[1 -1]); 
>>den = conv(den, [1 -1]); 
>>initialvalue = polyval(num,1e20) / polyval(den, 1e20) 
initialvalue = 
2.0000e-20 


Therefore, fo = 0. 
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e Example 13.2.4 


Given the z-transform F(z) = (1— a)z/[(z — 1)(z — a)], where |z| > 1 > a > 0, then 
from the final-value theorem we have that 
: : , 1—-a 
lim fn = lim (z — 1)F(z) = lim ——— = 1. (13.2.44) 
oo z— 


n zoll-—az-i 


This is consistent with the inverse transform f, = 1 — a” with n > 0. 


e Example 13.2.5 


Using the sequences f, — 1 and g, — a", where a is real, verify the convolution 
theorem. 
We first compute the convolution of f, with gn, namely 


1 qti 


I-a =a 


Wn = fa* gn = qe (13.2.45) 
k=0 


Taking the z-transform of wn, 


zZ az z? 


Mee izae aaa a aea A a 


and the convolution theorem holds true for this special case. 


Problems 


Use the properties of z-transforms and Table 13.1.1 to find the z-transform of the following 
sequences. Then check your answer using MATLAB. 


0 n=0 

P —anT - , 

L. fy =nTe T R34 
0, n=0 e 

3. fn = em r> 4. fn = a" cos(n) 


[Hint : Use cos(n) = i(e'" + e-*")] 


5. fn = cos(n — 2) H, 6. fa — 34e ?"T 
0, n=0 

7. fa — si Tod Spee: de 

. fn = sin(nwoT + 0), dam 2. n=2 Joia = fn 
1, n = 3, 


9. fa = (-1)” 


(Hint: fn is a periodic sequence.) 


10. Using the property stated in Equation 13.2.21 and Equation 13.2.22 twice, find the 
z-transform of n? = n[n(1)"]. Then verify your result using MATLAB. 
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11. Verify the convolution theorem using the sequences fn = gn = 1. Then check your 
results using MATLAB. 


12. Verify the convolution theorem using the sequences f, = 1 and g, = n. Then check 
your results using MATLAB. 


13. Verify the convolution theorem using the sequences f, = gn = l/(n!) Then check 
your results using MATLAB. Hint: Use the binomial theorem with x — 1 to evaluate the 
summation. 


14. If a is a real number, show that Z(a” fn) = F(z/a), where Z(f,) = F(z). 
13.3 INVERSE Z-TRANSFORMS 


In the previous two sections we dealt with finding the z-transform. In this section we 
find fn by inverting the z-transform F(z). There are four methods for finding the inverse: 
(1) power series, (2) recursion, (3) partial fractions, and (4) the residue method. We will 
discuss each technique individually. The first three apply only to those functions F(z) that 
are rational functions while the residue method is more general. For symbolic computations 
with MATLAB, you can use iztrans. 


| Power series 


By means of the long-division process, we can always rewrite F(z) as the Laurent 
expansion: 


F(z) = ao +012! + a927? e. (13.3.1) 


From the definition of the z-transform, 
F(z) - M faz" = fot fiz + faz? s, (13.3.2) 
n=0 


the desired sequence fn is given by an. 
e Example 13.3.1 


Let i Pe 
F(z) = — - DE (13.3.3) 


Using long division, N(z) is divided by D(z) and we obtain 


F(z) =i pal r H H (13.3.4) 


Therefore, 


ao = 1, a1 = 1, a3 = 1, a3 = 1, a4 = 1, etc., (13.3.5) 


which suggests that fo = 5 and fn = 1 for n > 1 is the inverse of F(z). 


1 
2 
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e Example 13.3.2 


Let us find the inverse of the z-transform: 


222 — 1.5z 


| T s 
Kc i505 


(13.3.6) 


By the long-division process, we have that 


2 + L5z + 1.25272 + 1.125273 + 
2158405 | 2:2 — 1.52 


22° — 3z + 1 
1.5z — 1 
1.5z — 2.25 + 0.75027 
1.25 — 0.75027 
1.25 — 1.8702 ^ + 


1.1252^^ + 


Thus, fo = 2, fı = 1.5, f2 = 1.25, fz = 1.125, and so forth, or fa = 1 + (5)”. In general, 
this technique only produces numerical values for some of the elements of the sequence. 
Note also that our long division must always yield the power series Equation 13.3.1 in order 
for this method to be of any use. 

To check our answer using MATLAB, we type the commands: 
syms z; syms n positive 
iztrans((2*z^2 - 1.5*z)/(z^2 - 1.5*z + 0.5),z,n) 


which yields 


ans - 
1 + (1/2^n 


| Recursive method 


An alternative to long division was suggested® several years ago. It obtains the inverse 
recursively. 
We begin by assuming that the z-transform is of the form 


aoz™ + a1z™Tt + agz7^7? +--+ + am_12 + Om 


boz™ + b,zm-1 E b52m—? see bm-12 H bm , 


F(z) = (13.3.7) 


where some of the coefficients a; and b; may be zero and bo 4 0. Applying the initial-value 
theorem, 


z— 00 


3 Jury, E. I., 1964: Theory and Application of the z- Transform Method. John Wiley & Sons, p. 41; 
Pierre, D. A., 1963: A tabular algorithm for z-transform inversion. Control Engng., 10(9), 110-111; 
Jenkins, L. B., 1967: A useful recursive form for obtaining inverse z-transforms. Proc. IEEE, 55, 574-575. 
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Next, we apply the initial-value theorem to z[F(z) — fo] and find that 


saan (13.3.9) 
(ao — bo fo)z™ + (a1 — bi fo)? +--+ + (am — bm fo) 

= 13.3.10 

poate boz™ + by zm! + bozm—2 +--+ + bm-12 + bm ae 

Bd (13.3.11) 


Note that the coefficient ag — bo fo = 0 from Equation 13.3.8. Similarly, 


fo = lim z[zF(z) — zfo — fil (13.3.12) 
ZOO 
= lim z (ao — bo fo)z™** + (a1 — bi fo — bo fa)z™ + (a2 — be fo — bi f1)2 1 +++: bm fa 
200 boz™ + bizm-1 + bozm-2 E... + bm—12 + bm 
(13.3.13) 
= (a2 — be fo — bi fi) /bo (13.3.14) 


because ag — bo fo = a1 — bı fo — fibo = 0. Continuing this process, we finally have that 
fn = (an — bn fo — bn-1 fi — +++ — bi fni) /bo, (13.3.15) 
where an = bn = 0 for n > m. 
e Example 13.3.3 
Let us redo Example 13.3.2 using the recursive method. Comparing Equation 13.3.7 to 


Equation 13.3.6, ao = 2, a1 = —1.5, ag = 0, bg = 1, by = —1.5, bo = 0.5, and a, = bn = 0 if 
n > 3. From Equation 13.3.15, 


fo = ao/bo = 2/1 = 2, 13.3.16) 
ie hes ASSIA Oy Laas. 13.3.17) 
fo = (a2 — be fo — bi fi) /bo 13.3.18) 
= [0 — (0.5)(2) — (-1.5)(1.5)]/1 = 1.25, 13.3.19) 
and 
fa = (a3 — bs fo — b2 fı — bı fa) /bo (13.3.20) 
= [0 — (0)(2) — (0.5) (1.5) — (—1.5)(1.25)]/1 = 1.125. (13.3.21) 


| Partial fraction expansion 


One of the popular methods for inverting Laplace transforms is partial fractions. A 
similar, but slightly different, scheme works here. 
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e Example 13.3.4 


Given F(z) = z/ (2? — 1), let us find fn. The first step is to obtain the partial fraction 
expansion of F(z)/z. Why we want F(z)/z rather than F(z) will be made clear in a 
moment. Thus, 


Bu : T e (13.3.22) 
2 (z—1)(z—-1) z-1 z+1 
where " í 
iew- ud (13.3.23) 
z |1 2 
and s i 
gapen — (13.3.24) 
Z |= 2 
Multiplying Equation 13.3.22 by z, 
tos H= Z (13.3.25) 
"cael Epl n 


Next, we find the inverse z-transform of z/(z — 1) and z/(z+ 1) in Table 13.1.1. This 
yields 


z( EET and z a )- c (13.3.26) 


Z— zl 


'Thus, the inverse is 
dm 5 [1 — (-1)*], n> 0. (13.3.27) 


From this example it is clear that there are two steps: (1) obtain the partial fraction 
expansion of F(z)/z, and (2) find the inverse z-transform by referring to Table 13.1.1. 


e Example 13.3.5 


Given F(z) = 227/[(z + 2)(z +1)?], let us find fn. We begin by expanding F(z)/z as 


PR o ae E oe re (13.3.28) 
z (z+2)(2+1)? 2+2 241 = (+1)? 
where 
PECES, — (13.3.29) 
Z z=—2 
d i 2 F(z) = 
B= le po | - =4, (13.3.30) 
and a 
C=(z+1) 20) = -2, (13.3.31) 
Z z=—1 
so that 4 3 B 
F@)=— a , (13.3.32) 


«241 z-42 (z41) 
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or 
hi -z3 p 2 zly = . (13.3.33) 
From Table 13.1.1, 
z (x) = (-1)", (13.3.34) 
z- (: = 5) = (-2)", (13.3.35) 
and 
z^ [| s zala zm = -n(-1)* 2 n(-1)**. (13.3.36) 


Applying Equation 13.3.34 through Equation 13.3.36 to Equation 13.3.33, 


fa = 4(-1) — 4(-2)^ + 2n(-1)", n > 0. (13.3.37) 


e Example 13.3.6 


Given F(z) = (z? + z)/(z — 2)?, let us determine fn. Because 


F(z) z+1 1 3 


z (2-2)? z-2 t (z= 2)?’ (13.3.38) 
iara e (13.3.39) 
"o z—2 (z—2)0|' 3. 
Referring to Table 13.1.1, 
3z 
Z1 z = 2” A i E | = 3B gn 13.3.40 
(4) , and c 3n ( 3.3 ) 


Substituting Equation 13.3.40 into Equation 13.3.39 yields 


fn = (n - 1) 27, n2: 0. (13.3.41) 


| Residue method 


The power series, recursive, and partial fraction expansion methods are rather limited. 
We now prove that fn may be computed from the following inverse integral formula: 


1 
201 


fn = j z"-lF(z)dz, n2 Q0, (13.3.42) 
C 


where C is any simple curve, taken in the positive sense, that encloses all of the singularities 
of F(z). It is readily shown that the power series and partial fraction methods are special 
cases of the residue method. 
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Proof: Starting with the definition of the z-transform 
FQ=S faz", gS Ry (13.3.43) 
n=0 


we multiply Equation 13.3.43 by z"-! and integrating both sides around any contour C 
that includes all of the singularities: 


1 


z 1 d 
Qd = X` fep jr (13.3.44) 
m=0 


Let C be a circle of radius R, where R > R,. Then, changing variables to z = Ret’, and 
dz = iz d0, 


1 d n—m 2m — = 
T) prom dz OR | eiln—m)Oq9 = { 1, red (13.3.45) 
C 0 


2mi A 2m 0, otherwise. 
Substituting Equation 13.3.45 into Equation 13.3.44 yields the desired result that 


1 


pow n—1l LL 
am. z"  F(z)dz = fn. (13.3.46) 


We can easily evaluate the inversion integral, Equation 13.3.42, using Cauchy's residue 
theorem. 


e Example 13.3.7 


Let us find the inverse z-transform of 


1 
F(z)-— ; 13.3.4 
From the inversion integral, 
1 zn-l 
Ac dz. 13.3.48 
f Ar á ( ) 


Clearly the integral has simple poles at z = 1 and z = 2. However, when n = 0 we also have 
a simple pole at z = 0. Thus the cases n = 0 and n > 0 must be considered separately. 


Case 1: n = 0. The residue theorem yields 


1 1 i 
fo= Res] 553) | Res| soos] | Lemna 
Evaluating these residues, i 
1 1 1 
Res| ses Sens (13.3.50) 
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1 1 
nes yest] = aD) bint (13.3.51) 
and 
Res | == =. (13.3.52) 
z(z — 1)(z — 2)’ z(z—1)3 2 


Substituting Equation 13.3.50 through Equation 13.3.52 into Equation 13.3.49 yields fo = 0. 


Case 2: n > 0. Here we only have contributions from z = 1 and z = 2. 


zgn-l n—1 
a= iR TIE 0, 13.3.53 
= Mee] +(e]: tt oe 
where 
Eua 1 13.3.54 
Res| 3! pe | ee ei aed) 
and 
tue Š l 0 13.3.55 
9] = = 2” $ .9. 
Res| Dy | 3c Lus p ) 
Thus, 
[par e. 13.3.56) 
Combining our results, 
0, n = 0, 
e Example 13.3.8 
Let us use the inversion integral to find the inverse of 
F(z) = eur (13.3.58) 
z)-— e- 3. 
The inversion theorem gives 
1 z^ i 22” gntl 4 gen 
n = ; = Res| —— ———5—;1l|, 13.3.59 
af uon =e ug] Heo 


where the pole at z — 1 is second order. Consequently, the corresponding residue is 


zoti 4 27^ d 
R EE "Hp 27^ 
algo ae e) 


Thus, the inverse z-transform of Equation 13.3.58 is 


=3n+1. (13.3.60) 


z=1 


fn =38n+1, n>0. (13.3.61) 
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e Example 13.3.9 


Let F(z) be a z-transform whose poles lie within the unit circle |z| 2 1. Then 
Bos faz, ded, (13.3.62) 
n=0 


and 


HOF T =X AY Y fmf. (13.3.63) 
n=0 n=0 m=0 


ném 


We now multiply both sides of Equation 13.3.63 by z7! and integrate around the unit circle 
C. Therefore, 


j F(OF(s D. dz = Se Pz dz. 5 ffs d gzm—n-l dz, 
|z|=1 n=0 |z|=1 


n=0 m=0 |z|=1 
nzm 
(13.3.64) 


after interchanging the order of integration and summation. Performing the integration, 


= : F(z)F(z !)2^ dz, (13.3.65) 


n7. - 
n= 2ri |z|21 


which is Parseval’s theorem for one-sided z-transforms. Recall that there are similar theo- 
rems for Fourier series and transforms. 


e Example 13.3.10: Evaluation of partial summations* 


Consider the partial summation Sy — x fn. We shall now show that z-transforms 
can be employed to compute Sy. 
We begin by noting that 


N N 
m EM ES n-1 
Sy = 2, fa SE 9, Our dz. (13.3.66) 


Here we employed the inversion integral to replace fn and reversed the order of integration 
and summation. This interchange is permissible since we only have a partial summation. 
Because the summation in Equation 13.3.66 is a geometric series, we have the final result 


that " 
Bises T PE cs (13.3.67) 
C z—1l 


Therefore, we can use the residue theorem and z-transforms to evaluate partial summations. 


^ See Bunch, K. J., W. N. Cain, and R. W. Grow, 1990: The z-transform method of evaluating partial 
summations in closed form. J. Phys. A, 23, L1213-L1215. 
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Let us find Sy = Y^ ,n3. Because f, = n?, F(z) = z(z? + 4z + 1)/(z — 1)4. 
Consequently 


z(z? z zN — 4 
i UL 2E | = i a [z(z? + 42+ 1) (z"" — 1)] (13.3.68) 


=l 


= Ta (z 3 p 45NT? 4 NH z?— Az? z) 


—i(N-1?N?. (13.3.69) 


e Example 13.3.11 


An additional benefit of understanding inversion by the residue method is the ability 
to qualitatively anticipate the inverse by knowing the location of the poles of F(z). This 
intuition is important because many engineering analyses discuss stability and performance 
entirely in terms of the properties of the system's z-transform. In Figure 13.3.1 we graphed 
the location of the poles of F(z) and the corresponding fn. The student should go through 
the mental exercise of connecting the two pictures. 


Problems 


Use the power series or recursive method to compute the first few values of f, of the 
following z-transforms. Then check your answers with MATLAB. 


2 
! : l 1 
L poa = + 0.9z + 0.09 2. F(z) = z+ 
12.622 — 242 +11.4 224 — 227 22—2 
1. 1. 
3. F(z) 92^ + 1.52 4. F(z) 6z^ + 6z 


~ 15.252? — 36.75z + 30.75 ~ 1923 3322 + 21z—7 


Use partial fractions to find the inverse of the following z-transforms. Then verify your 
answers with MATLAB. 


7 z(z 4-1) Ao eh 
5. F(z) = Ge Dp - 21/4 8. F(z)— T DZ rd 


9. Using the property that the z-transform of gn = f, H4 y if n > 0 is G(z) = z-^F(z), 


find the inverse of 
z+l1 


TO) 286-12 


'Then check your answer with MATLAB. 


Use the residue method to find the inverse z-transform of the following z-transforms. Then 
verify your answer with MATLAB. 


274 3z z 


10. F(z) = G 1/3 
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Figure 13.3.1: The correspondence between the location of the simple poles of the z-transform F(z) and 
the behavior of fn. 
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zZ 


oo z) =e? 
(z 4- 1?(z — 1)? TERES) 


12. F(z) = 


13.4 SOLUTION OF DIFFERENCE EQUATIONS 

Having reached the point where we can take a z-transform and then find its inverse, 
we are ready to use it to solve difference equations. The procedure parallels that of solving 
ordinary differential equations by Laplace transforms. Essentially we reduce the difference 
equation to an algebraic problem. We then find the solution by inverting Y (z). 
e Example 13.4.1 

Let us solve the second-order difference equation 


29542 — 3Ynt1 + Yn = 5 9", n > 0, (13.4.1) 


where yo = 0 and yi = 1. 
Taking the z-transform of both sides of Equation 13.4.1, we obtain 


22 (Yn42) — 32 (Yn41) + Z(yn) = 5 Z(3”). (13.4.2) 


From the shifting theorem and Table 13.1.1, 


22°Y (z) — 227 yo — 2zy1 — 3[zY (2) — zyo] + Y (2) = n (13.4.3) 
Substituting yo = 0 and yı = 1 into Equation 13.4.3 and simplifying yields 
(2z — 1)(z — 1)Y (z) = mE (13.4.4) 
Or 
Y()- cne T (13.4.5) 


To obtain yn from Y (z) we can employ partial fractions or the residue method. Applying 
partial fractions gives 


A PCM: (13.4.6) 
ze z—1 2-3’ um 
where 
Y if 
sape xut (13.4.7) 
z laa 2 
and (2) 
Y(z 1 
(z — 3) na 5 (13.4.8) 
Thus, 
1 z 1 z 
Y = } 13.4.9 
CE mu. QM 
Or 


1 z 1 Z 
n= uz z! i 13.4.10 
" 2 (A) (5) ( ) 
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From Equation 13.4.10 and Table 13.1.1, 


y,-i(9-1, n20. (13.4.11) 


An alternative to this hand calculation is to use MATLAB's ztrans and iztrans to 
solve difference equations. In the present case, the MATLAB script would read 


clear 
^ define symbolic variables 
syms z Y; syms n positive 
^4 take z-transform of left side of difference equation 
LHS = ztrans(2*sym(C y (n*2) ) -3*sym C y n*1)?)*syn ay 2?) ,n,2); 
^ take z-transform of right side of difference equation 
RHS = 5 * ztrans(3^n,n,z); 
^4 set Y for z-transform of y and introduce initial conditions 
newLHS = subs(LHS,’ztrans(y(n),n,z)’,’y(0)’,’y(1)’,Y,0,1); 
^ solve for Y 
Y = solve(newLHS-RHS,Y) ; 
% invert z-transform and find y(n) 
y = iztrans(Y,z,n) 
This script produced 
y= 
-1/2+1/2*3^n 
Two checks confirm that we have the correct solution. First, our solution must satisfy 
the initial values of the sequence. Computing yo and yi, 


y = 3(3°— 1) = (1-1) =0, (13.4.12) 


and 
y = 4(31-1) = 4 (8-1) =1. (13.4.13) 


'Thus, our solution gives the correct initial values. 
Our sequence y, must also satisfy the difference equation. Now 


Yn+2 = i(3**? — 1) = 1(9 3^ — 1), (13.4.14) 
and 
Vn4i = i(37*1 — 1) = 1(3 3" — 1). (13.4.15) 
Therefore, 
29442 —39n41 tym = (9-345) 38"-14+3-5=53" (13.4.16) 


and our solution is correct. 

Finally, we note that the term 3” /2 is necessary to give the right side of Equation 13.4.1; 
it is the particular solution. The —1/2 term is necessary so that the sequence satisfies the 
initial values; it is the complementary solution. 
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e Example 13.4.2 
Let us find the y, in the difference equation 
Yn+2 — 2Yn+ı tyn — l, n20 (13.4.17) 


with the initial conditions yọ = 0 and yı = 3/2. 
From Equation 13.4.17, 


Z(yn+2) — 2Z(Yn41) + Z(Yn) = Z(1). (13.4.18) 


The z-transform of the left side of Equation 13.4.18 is obtained from the shifting theorem, 
and Table 13.1.1 yields Z(1). Thus, 


zZ 


2°Y (z) — 2°yo — zy. — 22Y (2) + 2zyo + Y (z) = D (13.4.19) 
z— 
Substituting yo = 0 and yı = 3/2 in Equation 13.4.19 and simplifying gives 
322—z 
Y(z) = ———— 13.4.20 
D= (13.4.20) 
or " 
32° — Zz 
n = Zt |_| . 13.4.21 
n= 2" nes a 
We find the inverse z-transform of Equation 13.4.21 by the residue method, or 
1 3a tl — qn 1 d?[3z2*1 z^ 
4 zi f. 2-4] ^ iel 2 2 ( ) 
= in? +n. (13.4.23) 
Thus, 
Yn =n? +n, n>0. (13.4.24) 


Note that n?/2 gives the particular solution to Equation 13.4.17, while n is there so that 
Yn satisfies the initial conditions. This problem is particularly interesting because our 
constant forcing produces a response that grows as n?, just as in the case of resonance in 
a time-continuous system when a finite forcing such as sin(wot) results in a response whose 
amplitude grows as t”. 


e Example 13.4.3 
Let us solve the difference equation 
b’Yn + Yn+2 = 0, (13.4.25) 


where the initial conditions are yọ = b? and yı = 0. 
We begin by taking the z-transform of each term in Equation 13.4.25. This yields 


U Z (ys) + Z(Yn+2) = 0. (13.4.26) 
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From the shifting theorem, it follows that 
bY (z) + z2Y(z) — 2yo — 21 = 0. (13.4.27) 


Substituting yo = b? and y, = 0 into Equation 13.4.27, 


UY (z) -2^Y(z)- 2 =0, (13.4.28) 
Or "T 
b*z 
Y(z) = ———. 13.4.29 
0-5 (13.4.29) 
To find y, we employ the residue method or 
1 D22^*1 
"E dz. 13.4.30 
ss xi f. (z— ib)(z + ib) ( ) 
Thus, 
2,n4+1 2,n+1 p Tp prr2 04) 
pees PE CU: = (13.4.31) 
z +ib | gy  z—ib| 4, 2 2 
p? inm/2 p? —inmn/2 
- 5 $ E = pnt? cos ( 7.) l (13.4.32) 


because cos(x) = 4 (e'* + e~*”). Consequently, we obtain the desired result that 


yn = Ort? cos( 7.) for n > 0. (13.4.33) 


e Example 13.4.4: Compound interest 


Difference equations arise in finance because the increase or decrease in an account 
occurs in discrete steps. For example, the amount of money in a compound interest savings 
account after n + 1 conversion periods (the time period between interest payments) is 


Yn+1 = Un t Tn; (13.4.34) 


where r is the interest rate per conversion period. The second term on the right side of 
Equation 13.4.34 is the amount of interest paid at the end of each period. 

Let us ask a somewhat more difficult question of how much money we will have if we 
withdraw the amount A at the end of every period starting after the period £. Now the 
difference equation reads 

Yn+1 = Un t TVs — AHn-e-1. (13.4.35) 


Taking the z-transform of Equation 13.4.35, 


Az?-* 
z—1 


zY(z) — zyo = (1 + r)Y (z) — (13.4.36) 
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Figure 13.4.1: The amount in a savings account as a function of an annual conversion period when interest 
is compounded at the annual rate of 12% and $1000 is taken from the account every period starting with 
period 10. 


after using Equation 13.2.9 or 


Yor Az? 
Y = . 13.4.37 
eS CU de e-DA ( ) 
Taking the inverse of Equation 13.4.37, 
A 
Yn = yo(1 r)" — — [0 + r)"7**! — 1] Hne. (13.4.38) 


'The first term in Equation 13.4.38 represents the growth of money by compound interest 
while the second term gives the depletion of the account by withdrawals. 

Figure 13.4.1 gives the values of y,, for various starting amounts assuming an annual 
conversion period with r = 0.12, £ = 10 years, and A = $1000. These computations were 
done two ways using MATLAB as follows: 


^ load in parameters 
clear; r = 0.12; A= 1; k = 0:30; 
y = zeros(length(k) ,3); yanswer = zeros(length(k) ,3) ; 
% set initial condition 
for m=1:3 
y(1,m) = m; 
^4 compute other y values 
for n = 1:30 
y(nti,m) = y(n,m)*r*y (n,m); 
y(nti,m) = y(nt+1,m)-A*stepfun(n, 11); 


end 
^ now use Equation 13.4.38 
for n = 1:31 


yanswer(n,m) = y(1,m)*(1*r)^ n-1); 
yanswer(n,m) = yanswer(n,m)-A*((1*r)^(n-10)-1) 
*stepfun(n,11)/r; 
end; end; 
plot(k,y,’o’); hold; plot(k,yanswer,’s’); 
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axis([O 30 0 10]) 
xlabel(?number of conversion periods’, ’Fontsize’ ,20) 
ylabel(?^amount left in account (K$)’,’Fontsize’ ,20) 


Figure 13.4.1 shows that if an investor places an initial amount of $3000 in an account 
bearing 12% annually, after 10 years he can withdraw $1000 annually, essentially forever. 
This is because the amount that he removes every year is replaced by the interest on the 


funds that remain in the account. 


e Example 13.4.5 
Let us solve the following system of difference equations: 
En+1 = AL y, + 2yn, 


and 


Yn+1 = 3x =F 3Yn; 


with the initial values of zo = 0 and yo = 5. 
Taking the z-transform of Equation 13.4.39 and Equation 13.4.40, 


zX(z) — zoz = AX(z) + 2Y (z), 


zY (z) — yor = 3X (z) + 3Y (z), 


or 
(z — 4)X(z) — 2Y (z) 2 0, 


3X(z) — (z — Y (z) = —5z. 
Solving for X (z) and Y (z), 


10z az 2z 

Ee) eat) 6 z-1 
s 5 4 2 3 
ra= z(z — 4) EE E. 


(z—6)(2—1) 2-6 z-1l 


Taking the inverse of Equation 13.4.45 and Equation 13.4.46 term by term, 
Ln = —24+2 6", and Yn = 3+ 2 6”. 


We can also check our work using the MATLAB script 


clear 

^ define symbolic variables 

syms X Y z; syms n positive 

^4 take z-transform of left side of differential equations 
LHS1 = ztrans(symC x(n*1)?)-4*sym Co x(n) ) -2*sym C8 y Q2) ,n,z); 
LHS2 = ztrans(symC y (n*1)?) -3*sym C x 02?) -3*sym Co y (0) 2) ,n,2) ; 
^4 set X and Y for the z-transform of x and y 

^ and introduce initial conditions 


(13.4.39) 


(13.4.40) 


13.4.41) 


13.4.42) 


13.4.43) 
13.4.44) 


(13.4.45) 


(13.4.46) 


(13.4.47) 


The Z-Transform 


newLHS1 
newLHS2 


^ solve 
[X,Y] = 
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= subs(LHS1, ’ztrans(x(n) ,n,z)’,’ztrans(y(n),n,z)’,... 


>x(0)? , ^y CO) i ,X,Y,0,5); 


= subs(LHS2, ’ztrans(x(n) ,n,z)’,’ztrans(y(n),n,z)’,... 


>x(0)’,’y(O)’,X,Y,0,5); 
for X and Y 
solve (newLHS1,newLHS2,X,Y); 


% invert z-transform and find x(n) and y(n) 


x = 


iztrans(X,z,n) 


y = iztrans(Y,z,n) 


'This script yields 


x = 


2*6^n-2 


y= 


2*67n+3 


Problems 


Solve the following difference equations using z-transforms, where n > 0. Check your answer 
using MATLAB. 


1. Yn+1 — Yn = n?, 
3. Un42 — 2Yn41 t+ Yn = 1, 
5. Yn+1 — 5Yn = cos(n7), 


T. Yn+2 — 


9. Un--2 — 3Yn+1 + 2Yn ced 1, 


Baa 


NE 


«auc 


14. 


Tua. 


yo — 1. 

yo — qi = 0. 
yo = 0. 
IYn = (5)", Yo = 91 = 0. 
yo = yi = 0. 
= 3£n — Aya, Un4i = 224 — 3Yn, 
= 2x4 —10Yn, Unji = — Tn — Yn, 


= Tn — 2yn, Ynti = —Óyn; 


= AY, cA 9Un; Yn+1 = Tn — 2Yn, 


Xo = 


2. Yn+2 — 2Un41 t Jn =0, y=y=l. 
4. Uyn41 T 9y4 =EN, yo — O0. 

6. Yn+2 —494 —l, yo ly =0. 

8. Yn+2 — 5Ynt1 + 6ys = 0, yo — 91 = 1. 


10. Yn+2 — 2yn41 + Yn = 2, Yo = 0, Y= 2. 
To = 3, Yo = 2. 
Xo = 3, yo = =2 


—1, Yo = —7. 


ro = 6, Yo = 2. 


13.5 STABILITY OF DISCRETE-TIME SYSTEMS 


When we discussed the solution of ordinary differential equations by Laplace trans- 
forms, we introduced the concept of transfer function and impulse response. In the case of 
discrete-time systems, similar considerations come into play. 

Consider the recursive system 


Yn = Q1Un—1Hg-1 + a2Yn—2Hn-2 + Tn, 


n>0 (13.5.1) 


, 


where H, , is the unit step function. It equals 0 for n < k and 1 for n > k. Equation 
13.5.1 is called a recursive system because future values of the sequence depend upon all of 
the previous values. At present, a; and a» are free parameters that we shall vary. 
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Using Equation 13.2.7, 
z?Y (z) — a12Y (z) — a2Y (z) = 2? X (2), (13.5.2) 


Or 


Ge Su cci (13.5.3) 


As in the case of Laplace transforms, the ratio Y (z)/ X(z) is the transfer function. The 
inverse of the transfer function gives the impulse response for our discrete-time system. 
This particular transfer function has two poles, namely 


2 

ay En 
= —+4/— ; 13.5.4 
21,2 2 4 + ag ( ) 


At this point, we consider three cases. 


Case 1: a2/4 d ag < 0. In this case z; and z2 are complex conjugates. Let us write them 
as 21,9 = re*'^T , Then 


oF a 
G(z) = — GM 13.5.5 
(2) (z—reoT)(z—re-mol) — 7? — Ir cos(woT)z +r? ( ) 
where r? = —az, and woT' = cos™! (a1/2r). From the inversion integral, 
goti zn*l 
n=R TANE: :z]|, | (18.5.6 
I re E — 2r cos(woT)z + r? a] P E — 2r cos(woT )z + r? 2 ( ) 


where gn denotes the impulse response. Now 


(z— z)z" 


gurl 
R x = in ——— ——— — 13.5.7 
i E — 2r cos(woT )z + r?’ a] aci (z — z)(z — z2) ( ) 
—— aexp[i(n + LoT] 
T LiT (13.5.8) 
ER exp[i(n + 1)woT] (13.5.9) 
2i sin(woT) f > 
Similarly, 
EM r” exp[—i(n + 1)woT] 
; = 13.5.10 
ne z? — 2r cos(woT)z + r?’ = 2i sin(woT) ? ( ) 
and . 
| r"sin[(n + 1)woT] l (13.5.11) 


2 sin(woT) 


A graph of sin[(n + 1)woT]/sin(woT) with respect to n gives a sinusoidal envelope. 
More importantly, if |r| « 1 these oscillations vanish as n — oo and the system is stable. 
On the other hand, if |r| > 1 the oscillations grow without bound as n — oo and the system 
is unstable. 

Recall that |r| » 1 corresponds to poles that lie outside the unit circle while |r| « 1 is 
exactly the opposite. Our example suggests that for discrete-time systems to be stable, all 
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of the poles of the transfer function must lie within the unit circle while an unstable system 
has at least one pole that lies outside of this circle. 


Case 2: a2/A +a > 0. This case leads to two real roots, 2; and z2. From the inversion 
integral, the sum of the residues gives the impulse response 


Jn = 1—2. (13.5.12) 
41 — 22 


Once again, if the poles lie within the unit circle, |z;| < 1 and |z2| < 1, the system is stable. 


Case 3: a2/4 + a3 = 0. This case yields z1 = 22, 


c= (z um (13.5.13) 
and 
1 gntl "EN 
In = Oni f, (z— a fap > (2) (n 4 1). (13.5.14) 


This system is obviously stable if |a,/2| < 1 and the pole of the transfer function lies within 
the unit circle. 

In summary, finding the transfer function of a discrete-time system is important in 
determining its stability. Because the location of the poles of G(z) determines the response 
of the system, a stable system has all of its poles within the unit circle. Conversely, if 
any of the poles of G(z) lie outside of the unit circle, the system is unstable. Finally, if 
limp—+oo Jn = c, the system is marginally stable. For example, if G(z) has simple poles, 
some of the poles must lie on the unit circle. 


e Example 13.5.1 


Numerical methods of integration provide some of the simplest, yet most important, 
difference equations in the literature. In this example,? we show how z-transforms can be 
used to highlight the strengths and weaknesses of such schemes. 

Consider the trapezoidal integration rule in numerical analysis. The integral y, is 
updated by adding the latest trapezoidal approximation of the continuous curve. Thus, the 
integral is computed by 


Un — iT(z, + En 1Hn 1) + Yn 1H, 1; (13.5.15) 


where T is the interval between evaluations of the integrand. 
We first determine the stability of this rule because it is of little value if it is not stable. 
Using Equation 13.2.7, the transfer function is 


G(z) = us - 3 (5 7 r) . (13.5.16) 


To find the impulse response, we use the inversion integral and find that 


T wig eed 
z Som 


In = 4— 


= . 13.5.17 
4ni Jc sp Ven 


5 See Salzer, J. M., 1954: Frequency analysis of digital computers operating in real time. Proc. IRE, 
42, 457—466. 
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Figure 13.5.1: Comparison of various quadrature formulas by ratios of their amplitudes to that of an ideal 
integrator. (From Salzer, J. M., 1954: Frequency analysis of digital computers operating in real time. Proc. 
IRE, 42, p. 463.) 


At this point, we must consider two cases: n = 0 and n > 0. For n = 0, 


T z+1 T zl T 
— —Res ;0 R jl|——. 13.5.18 
limit: ent EE eL | 2 ( ) 
For n > 0, 
T n—1 1 
go = 5 Res| = et 2 -T. (13.5.19) 
z- 


Therefore, the impulse response for this numerical scheme is go = T and gn = T for n > 0. 
Note that this is a marginally stable system (the solution neither grows nor decays with n) 
because the pole associated with the transfer function lies on the unit circle. 

Having discovered that the system is not unstable, let us continue and explore some 


of its properties. Recall now that z = e?T = eT if s = iw. Then the transfer function 
becomes -— 
Tite™” iT wT 
G = l = t . 13.5.20 
=> ur 3 9 ( 2 ) ( ) 


On the other hand, the transfer function of an ideal integrator is 1/s or —i/w. Thus, the 
trapezoidal rule has ideal phase but its shortcoming lies in its amplitude characteristic; it 
lies below the ideal integrator for 0 < wT < m. We show this behavior, along with that for 
Simpson's one-third rule and Simpson's three-eighths rule, in Figure 13.5.1. 

Figure 13.5.1 confirms the superiority of Simpson's one-third rule over his three-eighths 
rule. The figure also shows that certain schemes are better at suppressing noise at higher 
frequencies, an effect not generally emphasized in numerical calculus but often important in 
system design. For example, the trapezoidal rule is inferior to all others at low frequencies 
but only to Simpson's one-third rule at higher frequencies. Furthermore, the trapezoidal 
rule might actually be preferred, not only because of its simplicity but also because it 
attenuates at higher frequencies, thereby counteracting the effect of noise. 


e Example 13.5.2 
Given the transfer function 


(13.5.21) 


The Z- Transform 701 


2.5 


m 


Q0000020202002.-2. 020 0 0 4 
o 
o 


impulse response 
in 
o 


o 


05 5 10 15 20 
n+1 


Figure 13.5.2: The impulse response for a discrete system with a transform function given by Equation 
13.5.21. 


is this discrete-time system stable or marginally stable? 

This transfer function has two simple poles. The pole at z = 1/2 gives rise to a term 
that varies as (3)* in the impulse response while the z — 1 pole gives a constant. Because 
this constant neither grows nor decays with n, the system is marginally stable. 


e Example 13.5.3 


In most cases the transfer function consists of a ratio of two polynomials. In this 
case we can use the MATLAB function filter to compute the impulse response as follows: 
Consider the Kronecker delta sequence, zo = 1, and £n = 0 for n > 0. From the definition 
of the z-transform, X(z) — 1. Therefore, if our input into filter is the Kronecker delta 
sequence, the output yn will be the impulse response since Y(z) = G(z). If the impulse 
response grows without bound as n increases, the system is unstable. If it goes to zero as 
n increases, the system is stable. If it remains constant, it is marginally stable. 

To illustrate this concept, the following MATLAB script finds the impulse response 
corresponding to the transfer function, Equation 13.5.21: 


^4 enter the coefficients of the numerator 

^ of the transfer function, Equation 13.5.21 
num = [1 0 0]; 

% enter the coefficients of the denominator 

h of the transfer function, Equation 13.5.21 
den = [1 -1.5 0.5]; 

% create the Kronecker delta sequence 

x = [1 zeros(1,20)]; 

^ find the impulse response 

y = filter(nun,den,x); 

^4 plot impulse response 

plot(y,’o’), axis([O 20 0.5 2.5]) 
xlabel(’n+1’,’Fontsize’ ,20) 

ylabel(’impulse response’, ’Fontsize’ ,20) 


Figure 13.5.2 shows the computed impulse response. The asymptotic limit is two, so the 
system is marginally stable as we found before. 
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We note in closing that the same procedure can be used to find the inverse of any 
z-transform that consists of a ratio of two polynomials. Here we simply set G(z) equal to 
the given z-transform and perform the same analysis. 


Problems 
For the following time-discrete systems, find the transfer function and determine whether 
the systems are unstable, marginally stable, or stable. Check your answer by graphing the 


impulse response using MATLAB. 


1. Yn — Un—1Hg-1 TX 2: Un — 2n iff, 1— Un 2H; 2 T X 


3. Un — 3yn—1Hga-41 TX 4. Un — iyn-2 H2 T Xn 
Further Readings 


Jury, E. L, 1964: Theory and Application of the z-Transform Method. John Wiley & Sons, 
330 pp. The classic text on z-transforms. 


LePage, W. R., 1980: Complex Variables and the Laplace Transform for Engineers. Dover, 
483 pp. Chapter 16 is on z-transforms. 
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Chapter 14 
The Hilbert Transform 


In addition to the Fourier, Laplace, and z-transforms, there are many other linear 
transforms that have their own special niche in engineering. Examples include Hankel, 
Walsh, Radon, and Hartley transforms. In this chapter we consider the Hilbert transform, 
which is a commonly used technique for relating the real and imaginary parts of a spectral 
response, particularly in communication theory. 

We begin our study of Hilbert transforms by first defining them and then exploring 
their properties. Next, we develop the concept of the analytic signal. Finally, we explore 
a property of Hilbert transforms that is frequently applied to data analysis: the Kramers- 
Kronig relationship. 


14.1 DEFINITION 


In Chapter 13 we motivated the development of z-transforms by exploring the concept 
of the ideal sampler. In the case of Hilbert transforms, we introduce another fundamental 
operation, namely quadrature phase shifting or the ideal Hilbert transformer. This proce- 
dure does nothing more than shift the phase of all input frequency components by —7/2. 
Hilbert transformers are frequently used in communication systems and signal processing; 
examples include the generation of single-sideband modulated signals and radar and speech 
signal processing. 

Because a —7/2 phase shift is equivalent to multiplying the Fourier transform of a 
signal by e-/7/? = —i, and because phase shifting must be an odd function of frequency, ! 


1 For a real function the phase of its Fourier transform must be an odd function of w. 
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the transfer function of the phase shifter is G(w) = —i sgn(w), where sgn(-) is defined by 
Equation 11.2.11. In other words, if X(w) denotes the input spectrum to the phase shifter, 
the output spectrum must be —i sgn(w)X(w). If the process is repeated, the total phase 
shift is —7, a complete phase reversal of all frequency components. The output spectrum 
then equals [~i sgn(w)]?X(w) = —X(w). This agrees with the notion of phase reversal 
because the output function is —z(t). 

Consider now the impulse response of the quadrature phase shifter, g(t) = JF !(G(w)]. 
From the definition of Fourier transforms, 


a a] tg(t)e "^ dt, (14.1.1) 
dw ake 
and 
Jp 
g(t) = iF (55) (14.1.2) 
Since G’(w) = —2i6(w), the corresponding impulse response is 
ciet shape (14.1.3) 
g(t) == m. 1. 


Consequently, if z(t) is the input to a quadrature phase shifter, the superposition integral 
gives the output time function as 


aia s F 20) de (14.1.4) 


We shall define Z(t) as the Hilbert transform of x(t), although some authors use the negative 
of Equation 14.1.4 corresponding to a +7/2 phase shift. The transform Z(t) is also called 
the harmonic conjugate of x(t). 

In similar fashion, a(t) is the Hilbert transform of the Hilbert transform of x(t) and 
corresponds to the output of two cascaded phase shifters. However, this output is known 


to be —z(t), so Z(t) = —qg(t), and we arrive at the inverse Hilbert transform relationship 
that 
1 1 [?? Zr) 
t) = —x(t = dr. 14.1.5 
9) BN F T ( ) 


Taken together, z(t) and Z(t) are called a Hilbert pair. Hilbert pairs enjoy the unique 
property that x(t) + iz(t) is an analytic function.? 


? For the proof, see Titchmarsh, E. C., 1948: Introduction to the Theory of Fourier Integrals. Oxford 
University Press, p. 125. 
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Descended from a Prussian middle-class family, David Hilbert (1862-1943) would make significant 
contributions in the fields of algebraic form, algebraic number theory, foundations of geometry, 
analysis, mathematical physics, and the foundations of mathematics. Hilbert transforms arose 
during his study of integral equations (Hilbert, D., 1912: Grundzüge einer allgemeinen Theorie der 
linearen Integralgleichungen. Teubner, p. 75). (Portrait courtesy of Photo AKG, London, with 
permission.) 


Because of the singularity at 7 = t, the integrals in Equation 14.1.4 and Equation 
14.1.5 must be taken in the Cauchy principal value sense by approaching the singularity 
point from both sides, namely 


L. (ea na f(r)ar + " f(r) ar| j (14.1.6) 


so that the infinities to the right and left of r = t cancel each other. See Section 10.10. 
We also note that the Hilbert transform is basically a convolution and does not produce a 
change of domain; if x is a function of time, then Ẹ is also a function of time. This is quite 
different from what we encountered with Laplace or Fourier transforms. 

From its origin in phase shifting, Hilbert transforms of sinusoidal functions are trivial. 
Some examples are 


ae 


cos(wt + p) = cos (wt + v — $) = sgn(w) sin(wt + v). (14.1.7) 


Similarly, 


p 


sin(wt + y) = —sgn(w) cos(wt + v), (14.1.8) 
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and ae f 
eite = —i sgn(u)e'" tte, (14.1.9) 


Thus, Hilbert transformation does not change the amplitude of sine or cosine but does 
change their phase by 37/2. 


e Example 14.1.1 


Let us apply the integral definition of the Hilbert transform, Equation 14.1.4, to find 
the Hilbert transform of sin(wt), w 4 0. 
From the definition, 


; = 1 f® mim) 
If x =t — 7, then 
H [sin(wt)] = a " inter) dz = — cos(wt) sgu(w). (14.1.11) 


e Example 14.1.2 


Let us compute the Hilbert transform of x(t) = sin(t)/(t? + 1) from the definition of 
the Hilbert transform, Equation 14.1.4. 
From the definition, 


ve sin(7) us 


A 1 T ?9 e 
T(t) = vf (= 7)(r? +1) dt = zs [ev [ WEE . . (141.12) 


Because of the singularity on the real axis at 7 = t, we treat the integrals in Equation 
14.1.12 in the sense of Cauchy principal value. 

To evaluate Equation 14.1.12, we convert it into a closed contour integration by in- 
troducing a semicircle Cg of infinite radius in the upper half-plane. This yields a closed 
contour C, which consists of the real line plus this semicircle. Therefore, Equation 14.1.12 
can be rewritten 


oo eT [d [d 
Pv j wenn Lege? Luxe» 
(14.1.13) 
The second integral on the right side of Equation 14.1.13 vanishes by Equation 10.9.7. 
The evaluation of the closed integral in Equation 14.1.13 follows from the residue 


theorem. We have that 


iz i - (z — t) ei? " eit 
and 
1Z ah erus (z = i) ei? E 5-1 
Re paar 7 pu (t—z)(z2+1) 2it-i) (14.1.15) 


We do not have a contribution from z = —7 because it lies outside of the closed contour. 
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15. 


16. 


'The Hilbert Transform of Some Common Functions 


function, x(t) 


1, a«t«b 
0, otherwise 


sin(wt + v) 


cos(wt + y) 


gie ttvei 


At + ua 
#2 + q?2? 


sin(at)Ji(at), O<a 
sin(at)J,(bt), O<b<a 
cos(at) Ji (at), O<a 


cos(at) J, (bt), O<b<a 


Dem —a«t«a 


0, otherwise 


sin (avt) H(t), 0«a 


Hilbert transform, x(t) 


1 l t—a 

—]n 

T —b 
—sgn(w) cos(wt + v) 


sgn(w) sin(wt + v) 


—i sen(w)ettr 
—nó(t) 


t 
a(t? + a?) 


pt — àa 
t? + a? 


t(1 +t?) 
V2 (1 4- t4) 


1 — cos(at) 
t 


e 1 — cos(t) 
Ts? 


— cos(at) Ji (at) 
— cos(at) Jy, (bt) 
sin(at) J; (at) 


sin(at) J, (at) 


t4 yt? —a?, —oo «t« —a 


t —a«t«a 


t—vVt?^—a?^, a«t«oo 


zav ltl, —oo«t«0 
— cos (avt), 0<t<oo 
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Therefore, 


e eit mie — me l(t-i) 
PV dr — | ; 14.1.16 
EE 15 BFI 241 ( ) 


Only one half of the value of the residue at z = t was included; this reflects the semicircular 
indentation around the singularity there. Substituting Equation 14.1.16 into Equation 
14.1.12, we obtain the final result that 


sin(t)] | e^! — cos(t) 
x = (14.1.17) 


e Example 14.1.3 


Let us employ the relationship that the Fourier transform of (t) equals —isgn(w) times 


the Fourier transform of z(t) to find the Hilbert transform of z(t) — et 
Because J(e-*) = yse- 9" /A, 
(w) = —ivnsgn(w)e- 9/4. (14.1.18) 
'Therefore, 
2( ) _ t ° itw—w?/4 3 a itw—w lid (14 1 19) 
EE 7S NW 345 Jo ' 
p 0 oo 
m 1 2itn—n 1 Qitn—1n? 
= — e dn — — e dn (14.1.20) 
vm —oo vm 0 
—t? t —1? pico —t? pt 
e 2 e 2 2e 
= e? ds— —— e? ds = —— | e? ds, (14.1.21) 
vm —ioo Vm t vm 0 


where s = t + ni. The integral in Equation 14.1.21 is the well-known Dawson’s integral.” 
See Gautschi and Waldvogel^ for an alternative derivation. 


e Example 14.1.4: Numerical computation of the Hilbert transform 


Recently André Weideman? devised a particularly efficient method for numerically 
computing the Hilbert transform when z(t) is known exactly for any real t and enjoys the 
property that 

oo 
1 lc (t)? dt < oo. (14.1.22) 


— oo 


3 Press, W. H., S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, 1992: Numerical Recipes in 
Fortran: The Art of Scientific Computing. Cambridge University Press, Section 6.10. 


^ Gautschi, W., and J. Waldvogel, 2000: Computing the Hilbert transform of the generalized Laguerre 
and Hermite weight functions. BIT, 41, 490—503. 


5 Weideman, J. A. C., 1995: Computing the Hilbert transform on the real line. Math. Comput., 64, 
145—162. 
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Given Equation 14.1.22, the function x(t) can be represented by the rational expansion 


apes X anprlt), (14.1.23) 
where p, (t) is the set of rational functions 
(ra 
a pm AT E = 0,+1,+2,---, 14.1.24 
Pa) = E” (14.1.24) 
and iem 
An = Ji x(t) (t) dt (14.1.25) 
or E 
Gn =5- | [L itan(30)] z[tan(56)] e n? do, (14.1.26) 
T Jax 


if we introduce the substitution t — tan(0/2). 
Why is Equation 14.1.23 useful? Taking the Hilbert transform of both sides of this 
equation, 


PE) = XO nln): (14.1.27) 
Using contour integration, we find that 
A 1 oe (1 +ir)” f 
»(t) = —P = n(t), 14.1.2 
po =PV fT y ci sms (14.1.28) 


where sgn(t) is the signum function with sgn(0) = 1. Therefore, 


oo 


a(t) =i M sgn(n)a, ps (t). (14.1.29) 


n=—CoO 


We must now approximate Equation 14.1.29 so that we can evaluate it numerically. 
We do this by introducing the following truncated version: 


t) 2 —i X` sgn(n) An p. (t). (14.1.30) 


This particular truncation was chosen because pn (t) and p_p—1(t) are a conjugate pair. The 
coefficient an has become An, which equals 


N-1 
I ; 1 1 inb; 
any $5 [l-itan(30;)] x[tan(39;)] e "^^, (14.1.31) 
j=-N+1 
where 0; = mj/N. The terms corresponding to j = +N have been set to zero because it 


is assumed that x(t) vanishes rapidly with t — -too. Finally, we substitute 0 for t and 
transform Equation 14.1.30 into 


r N-1 
Ey (ty) = — eae) ne» sgn(n) Aner, (14.1.32) 
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— exact Hilbert transform 
0.8- o computed Hilbert transform 


-15 -10 -5 0 5 10 15 


+ 


Figure 14.1.1: The Hilbert transform for x(t) = 1/(1 + t4) computed from Weideman’s algorithm. 


The advantage of Equation 14.1.31 and Equation 14.1.32 is that they can be evalu- 
ated using fast Fourier transforms. For example, the following MATLAB script devised by 
Weideman illustrates his methods for x(t) = 1/(1 + t*): 


% initialize parameters used in computation 

b = 1; N = 8; n = [-N:N-1]’; 

^ set up collocation points and evaluate function there 
t = b * tan(pi*(nt+1/2)/(2*N)); F = 1./(1+t.74); 

^ evaluate Equation 14.1.31 

an - fftshift(fft(F.*(b-i*t))); 

^ compute Hilbert transform via Equation 14.1.32 
hilbert = ifft(fftshift(i*(sign(n-1/2).*an)))./(b-i*t); 
hilbert -real(hilbert); 

% find points at which we will compute exact answer 

tt = [-12:0.02:12]; 

% compute exact answer 

answer = tt.*(1*tt.^2)./(1*tt.^4)./sqrt (2); 

fzero = zeros(size(tt)); 


% plot both computed Hilbert transform and exact answer 
plot(tt,answer,’-’,t,hilbert,’o’,tt,fzero,’--’) 
xlabel(’t’,’Fontsize’ ,20) 

legend(’exact Hilbert transform’ ,’ computed Hilbert transform’) 
legend boxoff 


Figure 14.1.1 illustrates Weideman’s algorithm for numerically computing the Hilbert 
transform of 1/(1 + tî). 

There are two important points concerning Weideman's implementation of his algo- 
rithm. First, the collocation points originally given by t; = tan|rj/(2N)], j = —N,...,N—1 
have changed to t; = tan[(j + 3)n/(2N)], j = —N,...,N — 1. This change replaces the 
trapezoidal rule discretization for the Fourier coefficients with a midpoint rule. The advan- 
tages are twofold: First, it avoids the nuisance of dealing with a collocation point at infinity. 
Second, it actually yields more accurate results in many cases. 
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The discerning student will also notice that Weideman introduced a free parameter b, 
which we set to one. This rescaling parameter can have a major influence on the accuracy. 
The interested student is referred to the bottom of page 756 in Weideman’s paper for further 
details. 


e Example 14.1.5: Discrete Hilbert transform 


Quite often the function is given as discrete data points. How do we find the Hilbert 
transform in this case? We will now prove® that the equivalent discrete Hilbert transform 
is 


= 5 CEN k even, 
T a n odd k-n 
H(fn) = fk = 4 9 f (14.1.33) 
25y oy k odd, 
T k-n 


n even 


where f, denotes a set of discrete data values that are sampled at t = nT and both k and 
n run from —oo to oo. The corresponding inverse is 


2 fr 
m »» Ban n even, 
n= li > 14.1.34 
on E g (14.1.34) 
— ; n odd. 
W k even k-n 


We begin our proof by inserting Equation 14.1.33 into Equation 14.1.34. For n even, 


2 1 2 à 4 E 
h= 25 ex »» Z )-4 b» 5 =e (14.1.35) 


k odd p even E peven k odd 


= - p» mE H E >, n Df =}: (14.1.36) 


k odd peven,pzn k odd 


The term within the curly brackets equals zero as k runs through all of its values. Therefore, 
Equation 14.1.36 reduces to 


fn = 54 (14 Ea e). (14.1.37) 


T2 


However, the term in the brackets of Equation 14.1.37 equals 4?/8. Therefore, Equation 
14.1.33 and Equation 14.1.34 is proved for n even. An identical proof follows for n odd. 

A popular alternative’ to Equation 14.1.33 involves the (fast) Fourier transform and 
the relationship that X (w) = —i sgn(w) X(w), where X(w) and X (w) denote the Fourier 
transform of x(t) and Z(t), respectively. In this technique, a fast Fourier transform is taken 
of the data. This transformed dataset is then multiplied by —i sgn(w) and then back 
transformed to give the Hilbert transform. 


6 See Kak, S. C., 1970: The discrete Hilbert transform. Proc. IEEE, 58, 585-586. For an alternative 
derivation, see Kress, R., and E. Martensen, 1970: Anwendung der Rechteckregel auf die reelle Hilbert- 
transformation mit unendlichem Intervall. Z. Angew. Math. Mech., 50, T61—T64. 


7 Čížek, V., 1970: Discrete Hilbert transform. IEEE Trans. Audio Electroacoust., AU-18, 340-343. 
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Let x(t) be a real, even function. Then X(w), the Fourier transform of x(t), is also an 
even function. Consequently, 


a(t) = = n aa do = B L uae dt dur crga) 


-——— sgn(w)X (w) cos(wt) dw + = | sgn(w).X (w) sin(wt) dw (14.1.39) 


2r J 
1 f? ; 
= f X (w) sin(wt) dw. (14.1.40) 


Note that the Hilbert transform in this case is an odd function. Similarly, if z(t) is a real, 
odd function, 


su + f “OD Gattis (14.1.41) 
and the Hilbert transform is an even function. 
Problems 
1. Show that the Hilbert transform of a constant function is zero. 
2. Use Equation 14.1.4 to compute the Hilbert transform of cos(wt), w Æ 0. 


3. Use Equation 14.1.4 to show that the Hilbert transform of the Dirac delta function ó(t) 
is 1/ (nt). 


4. Use Equation 14.1.4 to show that the Hilbert transform of 1/(t? + 1) is t/(t? + 1). 
5. The output y(t) from an ideal lowpass filter can be expressed by the convolution integral 


sin(27wt) 


y(t) = x(t) * 


Tt i 


where x(t) is the input signal. Show that this expression can also be expressed in terms of 
Hilbert transforms as 


y(t) = H[x(t) cos(2rwt)] sin(2rwt) — H[a(t) sin(Qrwt)| cos(2nwt). 


Following Example 14.1.3, find the Hilbert transforms of 


1, —a«t«a 
otherwise 


8. Using the commutative and associate properties of convolution, f(t) * g(t) = g(t) » f (t) 
and [f (t) * g(t)] « v(t) = f(t) x [g(t) x v(t)], respectively, and the definition of the Hilbert 
transform, Equation 14.1.4, show? that 


HISE) * g(t)] = Ft)  g(t) = FE * g(t). 


8 For an application, see Sakai, H., and G. A. Vanasse, 1966: Hilbert transform in Fourier spectroscopy. 
J. Opt. Soc. Am., 56, 131-132. 
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Using MATLAB, test Weideman’s algorithm for the following cases. Why does the algorithm 
do well or not? 


1, -l<t<l å 1 sin(t) 
9 t terius 10. sin(¢) 11. BRE 12. eme 


For Problem 12, you will need 


{sn} - e-V/ Y? [cos(1//2 ) + sin(1//2 )#?] — cos(t). 


t4+1 15-1 


14.2 SOME USEFUL PROPERTIES 


In principle we could construct any desired transform from the definition of the Hilbert 
transform. However, there are several general theorems that are much more effective in 
finding new transforms. 


| Linearity 


From the definition of the Hilbert transform, it immediately follows that if z(t) = 
cıx(t) + coy(t), where cı and cz are arbitrary constants, then Z(t) = c, Z(t) + c29(t). 


The energy in a signal and its Hilbert 
transform are the same. 


Consider the energy spectral densities at input and output of a quadrature phase shifter. 
The output equals 


IE GP? = FEO? = |- i sew) IX GP = Xw). (14.2.1) 


Because the energy spectral density at input and output are the same, so are the total 
energies. 


A signal and its Hilbert transform 
are orthogonal. 


From Parseval's theorem, 


s x(t)@(t) dt = i X (w) X* (w) dw, (14.2.2) 


where X(w) = [2(t)]. Then, 
[a X (wv) X* (w) dw = y i sgn(w)| X (w)|? dw = 0, (14.2.3) 


because the integrand in the middle expression of Equation 14.2.3 is odd. Thus, 


J " e(t) dt = 0. (14.2.4) 


— oo 
The reason why a function and its Hilbert transform are orthogonal to each other follows 
from the fact that a Hilbert transformation of a function shifts the phase of each Fourier 
component of the function forward by 7/2 for positive frequencies and backward for negative 
frequencies. 
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e Example 14.2.1 


Let us verify the orthogonality condition for Hilbert transforms using x(t) = 1/(1--t?). 
Because Z(t) = t/(1 + t°), 


T 2(t)a(t) dt = J- zz. ib esii (14.2.5) 


since the integrand is an odd function. 


| Shifting 


Let us find the Hilbert transform of x(t + a) if we know Z(t). From the definition of 
Hilbert transforms, 


H{x(t + a)] = i UEA a= : Go dr att +0) (14.2.6) 


or H[x(t + a)]| = Z(t + a). 


| Time scaling 


Let a > 0. Then, 


BOR 25 2 f® a(r) E 
?t|x(at)] = z i qum dn = T cum dr = (at). (14.2.7) 
On the other hand, if a < 0, 
db ae) ee 12 x(T) m 
H[z(at)] = a I Pon dn SE dr = —Z(at). (14.2.8) 


Thus, we have that H[z(at)] = sgn(a) Z(at). 


| Derivatives 


Let us find the relationship between the nth derivative of x(t) and its Hilbert transform. 
Using the derivative rule as it applies to Fourier transforms, 


d” c d^$ 
H U ES = —isgn(w)(iw)" X (w) = (iw)"[—i sgn(w) X (w)] = (tw)"X(w) = F E 
(14.2.9) 
Taking the inverse Fourier transforms, we have that 


d'sN dẹ 
=, 14.2.10 
" ( dt" ) dt" ( ) 
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Some General Properties of Hilbert Transforms 


function, x(t) Hilbert transform, x(t) 

1 x(t) —a(t) 
2 a(t) + y(t) a(t) + y(t) 
3 x(t+a), areal a(t +a) 

d” x(t) d^z(t) 
* dt” dt” 
5 x(at) sgn(a) x(at) 
6 ta(t) t@(t) + 1 f^ a(r) dr 
7 (t+ a)a(t) (t + a)g(t) + 4 f^ z(r)dr 


| Convolution 


Hilbert transforms enjoy a similar, but not identical, property with Fourier transforms 
with respect to convolution. If 


w(t) = u(t) * v(t) = D u(r)v(t — 7) dr = is u(t — r)v(7) dr, (14.2.11) 


then 


Proof: From the convolution theorem for Fourier transforms, W (w) = V(w)U(w). Multi- 
plying both sides of the equation by —isgn(w), 


W (w) = —isgn(w)W(w) = V(w)[-isgn(w)U(w)] = V(w)T(w). (14.2.13) 


Again, using the convolution theorem as it applies to Fourier transforms, we arrive at the 
final result. 


e Example 14.2.2 


Given the functions u(t) = cos(t) and v(t) = 1/(1 + t*), let us verify the convolution 
theorem as it applies to Hilbert transforms. 
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With u(t) = cos(t) and v(t) = 1/(1 + t*), 


w(t) = u(t) « v(t) = D eO E ay (14.2.14) 
= n i oe dx + n i ee dz (14.2.15) 
= Te Vv? c = in EN 
V | (75) +s (=)| cos(t) (14.2.16) 
UM w(t) = Je Vv2 ies ( 5) + sin( 7) sin(t) (14.2.17) 
Because D(t) = t(1 + t?)/[V2 (1+ t*)], 
u(t)  à( n= cos(t — x) Her) Lok (14.2.18) 
cos(f)cos(r)r(l-- z?) , , 1 (7? sin(t) sin()a(1 + z?) 
-5f 14 z^ LEE f l4 z* m 
(14.2.19) 
- 5 sin(t) |. nse )sin(e) dz (14.2.20) 
= T -1/72 A. in A in DE 
A ies (5) +s (=)| sin(t), (14.2.21) 


and the convolution theorem for Hilbert transforms holds true in this case. 


| Product theorem 


Let f(t) and g(t) denote complex functions with Fourier transforms F(w) and G(w), 
respectively. If 


1) F(w) vanishes for |w| > a, and G(w) vanishes for |w| < a, where a > 0, 
Or 
2) f(t) and g(t) are analytic functions (their real and imaginary parts are Hilbert pairs), 


then the Hilbert transform of the product of f(t) and g(t) is 
AL (0g(t)) = FAI). (14.2.22) 


Proof:? The product f(t)g(t) can be expressed as 


fa) = s | / F(u)G(v) ett" qu qu. (14.2.23) 


9 See Bedrosian, E., 1963: A product theorem for Hilbert transforms. Proc. IEEE, 51, 868-869. This 
theorem has been extended to functions of n-dimensional real vectors by Stark, H., 1971: An extension of 
the Hilbert transform product theorem. Proc. IEEE, 59, 1359-1360. 
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V 
> > 
u u 
utv =0 u+tv =0 
(a) F(u)=0,lul>a (b) F(u) = 0, u«0 
G(u) = 0, lvl < a G(u) = 0, v«0 


Figure 14.2.1: Region of integration in the proof of the product theorem. 


Because ?1(e/*) = i sgn(b)e**, 
i oo oo R 
HIFOJ) = Ia I J F(u)G(v) sgn(u + vet" qu du. (14.2.24) 


The shaded regions of Figure 14.2.1 are those in which the product F(u)G(v) is nonvanishing 
for the conditions of the theorem. In Figure 14.2.1(a) the nonoverlapping Fourier transforms 
yield two semi-infinite strips in which the product is nonvanishing. In Figure 14.2.1(b), 
for analytic functions, the Fourier transforms vanish for negative arguments!? so that the 
product is nonvanishing only in the first quadrant. In both cases sgn(u + v) = sgn(v) over 
the regions of integration in which the integrand is nonvanishing. Thus, 


HIFO] = na I B n ^ F(u)G(v) sgn(v)e t+ du du (14.2.25) 


= fox f G(v) sgn(v)e du = f(t)g(t). (14.2.26) 


e Example 14.2.3: Hilbert transforms of band-pass functions 


In communications, we have the double-sideband, amplitude-modulated signal given 
by a(t) cos(wt + p), where y is constant. From the product theorem its Hilbert transform 
equals a(t) sin(wt + y), w > 0, provided that the highest frequency component in a(t) is 
less than w. Paradoxically, the Hilbert transform of more general a(t) cos[wt + y(t)], which 
equals a(t) sin[wt + y(t)], has no such restriction. 


Problems 


Verify the orthogonality property of Hilbert transforms using 


1. z(t) 2 1/(14 t?) 2. x(t) = sin(t)/(1 + £) 
1, 0«ct«a 
a) is otherwise 


10 Titchmarsh, E. C., 1948: Introduction to the Theory of Fourier Integrals. Oxford University Press, 
p. 128. 
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Verify the convolution theorem for Hilbert transforms using 


1, 0<t<a, ] 1 
4. u(t) — { 0, otherwise, v(t) = sin(t) 5. u(t) = cos(t), v(t) 


6. Use the product theorem to show that 
H(sin(at) J, (bt)] = — cos(at) J, (bt), 0<b<a, 


ifn =0,1,2,3,.... 
Hint: 


7.60] = A2 m (P) nro — p. 


where T, (-) is a Chebyshev polynomial of the first kind and m = n/2 or (n—1)/2, depending 
upon which definition gives an integer. 


7. Given cosine and sine integrals: 


Ci(z) = — / E ee) dt,  Siz)--— / TREU 


and 
^[Ci(a|t])) = —sgn(t)Si(a|t), 0< a, 


use the product rule to show that 


H{sin(bt)Ci(alt|)] = —sgn(t)sin(bt)Si(alt), ^ 0 b «a. 


Hint: 
; 0, 0 « |w] <a, 
re a MIR ce 
8. Prove that 1 fe 
Hitet) = t2) - — J 2(r) dr 


Hint: 
Tx(T)  ta(7) 


t-r t-r 


14.3 ANALYTIC SIGNALS 
The monochromatic signal A cos(wot + y) appears in many physical and engineering 


applications. It is common to represent this signal by the complex representation Ae!(»ot-9), 
These two representations are related to each other by 


Acos(wot + p) = R [acit] = } [Meilt + dem Hutto] (14.3.1) 


Furthermore, the Fourier transform of A cos(wot + y) is 


JF[A cos(wot + y)] = 3 [Ae'*o(w — wo) + Ae **9(w + wo)] , (14.3.2) 
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Figure 14.3.1: The spectrum of the analytic signal when x(t) is the rectangular pulse given by Equation 
11.1.9. 


while the Fourier transform of Ae/(2ot*9) is 
F [Acito] = Aet? (w — wo). (14.3.3) 


As Equation 14.3.2 and Equation 14.3.3 clearly show, in passing from the real signal to 
its complex representation, we double the strength of the positive frequencies and remove 
entirely the negative frequencies. 

Let us generalize these concepts to nonmonochromatic signals. For the real signal z(t) 
with Fourier transform X (w) and the complex signal z(t) with Fourier transform Z(w), the 
previous paragraph shows that our generalization must have the property: 


Z(w) = X(w) + sgn(w) X (w) (14.3.4) 
or 
2X (w), w > 0, 
Z(w) =< X(w), w=0, (14.3.5) 
0, w <0 


Taking the inverse of Equation 14.3.4, we have the definition of an analytic signal as 
z(t) = x(t) + id (t), (14.3.6) 
where x(t) is a real signal and Z(t) is its Hilbert transform. 
e Example 14.3.1 
In Figure 14.3.1 the amplitude spectrum of the analytic signal is graphed when x(t) is 


the rectangular pulse, Equation 11.1.9. Note that the amplitude spectrum equals zero for 
w <0 and twice the amplitude spectrum for w > 0. 
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Figure 14.3.2: Given a function x(t) with an amplitude spectrum shown in (a), frame (b) shows the 
amplitude spectrum of the amplitude-modulated signal x(t) cos(wot) while frames (c) and (d) give the 
amplitude spectrum of the analytic signal z(t) and «(t) cos(wot) — T(t) sin(wot), respectively. 


e Example 14.3.2 


Let us find the energy of an analytic signal. 
The energy of an analytic signal is 


T popas [^ 2(t) at f P) a=2 f 2?(t) dt = T |X (w) 2 dw 


—oo —oo —oo —Pe9 

(14.3.7) 
by Parseval's theorem. Thus, the analytic signal has twice the energy of the corresponding 
real signal. 


Consider the function z(t) whose amplitude spectrum is shown in Figure 14.3.2(a). If 
we were to amplitude modulate x(t) with cos(wot), then the amplitude spectrum of this 
modulated signal would appear as pictured in Figure 14.3.2(b). 

Consider now the signal 


y(t) = x(t) cos(wot) — (t) sin(wot) = R {[x(t) + iz(£)]e^"»t V (14.3.8) 
—Hisne = $ lee" +2 Qe "|; (14.3.9) 


where z(t) is the analytic signal of x(t). We have plotted the amplitude spectrum |Z(w)| in 
Figure 14.3.2(c). If we computed the amplitude spectrum of y(t), we would find that 


Y (w) = $Z(w — wo) + 52 (—w — wo) (14.3.10) 
X (w — wo), Wo <w < Wo + Wmax; 
= X*(—w m wo), —UQ0 — Wmax € W < wo, (14.3.11) 
0, otherwise. 


We have sketched this amplitude spectrum |Y (w)| in Figure 14.3.2(d). Each triangular part 
is called the single sideband signal because it contains the upper frequencies (|w| > wo) of 
the modulated signal x(t) cos(wot). Similarly, if we had used x(t) cos(wot) + X(t) sin(wot), 
we would only have obtained the lower sidebands. Consequently, a communication system 
using x(t) cos(wot) — Z(t) sin(wot) or x(t) cos(wot) + Z(t) sin(wot) would realize a 50% savings 
in its frequency bandwidth over one transmitting x(t) cos(wot). 
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Problems 
1. Find the analytic signal corresponding to x(t) = cos(wt), w > 0. 


2. Show that the polar form of an analytic signal can be written 


where 


3. Analytic signals are often used with narrow-band waveforms with carrier frequency wo. 
If y(t) = wot + y'(t), show that the analytic signal can be written z(t) = r(t)e'*»*, where 
r(t) = |z(t)|e!* (0. The function r(t) is called the complex envelope or the phasor amplitude; 
this is a generalization of the phasor idea beyond pure alternating currents. 


14.4 CAUSALITY: THE KRAMERS-KRONIG RELATIONSHIP 


Causality is the physical principle which states that an event cannot proceed its cause. 
In this section we explore what effect this principle has on Hilbert transforms. 

We begin by introducing the concept of causal functions. A causal function is a function 
that equals zero for all t < 0. As with all functions we can write it in terms of an even x(t) 
and an odd x,(t) part as z(t) = x-(t) + zo(t). Because x(t) is causal, zo(t) = sgn(t)xe(t) 
and 

x(t) = x(t) + sgn(t)z-(t). (14.4.1) 


Taking the Fourier transform of Equation 14.4.1, we find that the Fourier transform of all 
causal functions are of the form 


X(w) = X-(w) —iX.(w), (14.4.2) 
where 7 
Xu) = - | Zar dr, (14.4.3) 
and n 
X.(w) = JI Xela) dr, (14.4.4) 
because " — 
2nF [x (t)sgn(t)] = D Xe(w) = Ji AAO) (14.4.5) 


Equations 14.4.3 and 14.4.4 first arose in dielectric theory and, taken together, are called 
the Kramers'! and Kronig'* relation after their discoverers, who derived these relationships 
during their work on the dispersion of light by gaseous atoms or molecules. 


11 Kramers, H. A., 1929: Die Dispersion und Absorption von Róntgenstrahlen. Phys. Z., 30, 522-523. 


12 Kronig, R. de L., 1926: On the theory of dispersion of x-rays. J. Opt. Soc. Am., 12, 547-551. 
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e Example 14.4.1 
Let us verify the Kramers-Kronig relation using the causal time function z(t) = H(t). 


Because x(t) = 3 and X,(w) = 7d(w) by Equation 11.2.3, 


X.(w) = Ji sie i (14.4.6) 


T w= T 


— oo 


Consequently, by the Kramers-Kronig relation, 


F(H(t) = Xe(w) — iew) = mó(w) + E: (14.4.7) 


This agrees with the result given in Example 11.2.2. 
e Example 14.4.2 


A simple example of a causal function is the impulse response or Green's function 
introduced in earlier chapters. From Equation 14.4.2 we have the result that the trans- 
fer function G(w), the Fourier transform of the impulse response, must yield the Hilbert 
transform pair Ge(w) — iGe(w). 

For example, if g(t) = e~ H(t), then G(w) = 1/(1 + iw). Because 


: : [e (14.4.8) 
= $ "$. 
l+iw w+] wu? +1’ 

we have the Hilbert transform pair of 

1 P t 
e Example 14.4.3 

Let us verify the Kramers-Kronig relation for the Hilbert transform pair 
1 (P +1 
yl undo e tU. (14.4.10) 
441 V2(t4 + 1) 


by direct integration. 
From Equation 14.4.3, we have that 


BUDE. ADS: dr 
V2(w4 +1) TJ 2 (r^-D(w--) (14.4.11) 


Applying the residue theorem to the right side of Equation 14.4.11, we obtain 


wlw? + 1) 


V2(w4 +1) 


i 1 TUE 
(z* + 1)(w — z) 


-ime| 2iRes| 


E 


T : 
2i Res | .——— — ——.; e|. 14.4.12 
+ ikel oie | ( ) 
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We only include one half of the value of the residue at 7 = w because the singularity lies 
on the path of integration and we must treat this integration along the lines of a Cauchy 
principal value. Evaluating the residues, we find 


1 1 
Res| ie ay curs (14.4.13) 
1 ; V2 — (1-4 i)o 
Res | ———_——; eI = : (14.4.14) 
fees | 4v2 w- 25) +3 
and 
d lee ea =i IE (14.4.15) 


majera) i] 


Substituting Equation 14.4.13 through Equation 14.4.15 into the right side of Equation 
14.4.12, we obtain the left side. 


Problems 
1. For a causal function x(t), prove that x(t) = sgn(t)xe(t) and x(t) = sgn(t)xe(t). 


2. Redo our analysis if x(t) is a negative time function, i.e., x(t) = 0 if t > 0. Verify your 
result using x(t) = e! H(—t). 


3. Using g(t) = te * H(t), find the corresponding Hilbert transform pairs. 
4. Using g(t) = e * cos(wt) H(t), find the corresponding Hilbert transform pairs. 


5. Verify the Kramers-Kronig relation for the Hilbert transform pair: 


1 t 
mee d z(t) = ——— 
t? +1 sp my) t? 1 


x(t) 
by direct integration. 


Further Reading 


Hahn, S. L., 1996: Hilbert Transforms in Signal Processing. Artech House, 442 pp. Covers 
the basic theory and gives some practical applications. 


Taylor & Francis 


Taylor & Francis Group 


http://taylorandfrancis.com 


Chapter 15 


Green's Functions 


We have devoted a major portion of this book to solving linear ordinary and partial 
differential solutions. For example, in the case of partial differential equations we introduced 
the method of separation of variables, which leads to a solution in terms of an eigenfunction 
expansion. However, this method is not the only one; in Section 12.12 we showed how a 
solution can be constructed using the superposition integral. Here we expand upon this idea 
and illustrate how a solution, called a Green's function, to a differential equation forced by 
the Dirac delta function can be used in an integral representation of a solution when the 
forcing is arbitrary. 


15.1 WHAT IS A GREEN'S FUNCTION? 


'The following examples taken from engineering show how Green's functions naturally 
appear during the solution of initial-value and boundary-value problems. We also show that 
the solution u(x) can be expressed as an integral involving the Green's function and f(z). 


Circuit theory 


In electrical engineering, one of the simplest electrical devices consists of a voltage 
source v(t) connected to a resistor with resistance R and an inductor with inductance L. 
See Figure 15.1.1. Denoting the current by i(t), the equation that governs this circuit is 


LČ + Ri — v(t). (15.1.1) 
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Figure 15.1.1: The RZL electrical circuit driven by the voltage v(t). 


Consider now the following experiment: With the circuit initially dead, we allow the 
voltage to suddenly become Vo/Ar during a very short duration Ar starting at t = T. 
Then, at t = T + AT, we again turn off the voltage supply. Mathematically, for t > T + Ar, 
the circuit’s performance obeys the homogeneous differential equation 


P 
Lo +Ri=0,  t>7+Ar, (15.1.2) 
whose solution is 
ilt) = Ioe PV ^,  t>T+Ar, (15.1.3) 


where Jp is a constant and L/R is the time constant of the circuit. Because the voltage v(t) 
during T < t< T + Ar is Vo/Ar, then 


THAT 
/ v(t) dt = Vo. (15.1.4) 


Therefore, over the interval 7 < t < T + Ar, Equation 15.1.1 can be integrated to yield 


THAT THAT THAT 
rf ai+R f i(t) a= f v(t) dt, (15.1.5) 


or 


THAT 
A E S +R f desde. (15.1.6) 


If i(t) remains continuous as Ar becomes small, then 


THAT 
n [ i(t) dt x 0. (15.1.7) 
Finally, because 
i(r) — 0, (15.1.8) 
and 
i(r + AT) = Ipe ROTATE y Tye B/E, (15.1.9) 


for small Az, Equation 15.1.6 reduces to 


Llge 7/4 = Wy, (15.1.10) 
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T THAT t 


Figure 15.1.2: The current i(t) within an RL circuit when the voltage Vo/A is introduced between the 
times T < t< T +AT. 


or 


» 
Io = ao (15.1.11) 


Therefore, Equation 15.1.3 can be written as 


0, bm 
i(t) = | (15.1.12) 
Voe- Re-V/L Jp, T<t 


after using Equation 15.1.11. Equation 15.1.12 is plotted in Figure 15.1.2. 

Consider now a new experiment with the same circuit where we subject the circuit 
to N voltage impulses, each of duration Ar and amplitude V;/Ar with i = 0,1,...,N, 
occurring at t = 7;. See Figure 15.1.3. The current response is then 


0, t< To; 
Voe- Fet-70)/L JT, To «t«m, 
Voe- Ret-7/L JT, + Vjg-R—7/L Jp, Ti «i«T. 
i(t) — (15.1.13) 
N 
» Vg OT, TN Xt € TN4A- 
i=0 


Finally, consider our circuit subjected to a continuous voltage source u(t). Over each 
successive interval dr, the step change in voltage is u(r) dr. Consequently, from Equation 
15.1.13 the response i(t) is now given by the superposition integral 


t 
i(t) =| ae dr, (15.1.14) 


or 


=f v(r)g(t|v) dr, (15.1.15) 
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i(t) 


Figure 15.1.3: The current i(t) within an RL circuit when the voltage is changed at t = To, t = 71, and 
so forth. 


where 
e` R(t-7)/L 


g(tir) = ————. 


Here we have assumed that i(t) = v(t) = 0 for t < T. In Equation 15.1.15, g(t|7) is called 
the Green's function. As this equation shows, given the Green's function to Equation 15.1.1, 
the response i(t) to any voltage source u(t) can be obtained by convolving the voltage source 
with the Green's function. 

We now show that we could have found the Green's function, Equation 15.1.16, by 
solving Equation 15.1.1 subject to an impulse- or delta-forcing function. Mathematically, 
this corresponds to solving the following initial-value problem: 

dg 


L7 -Rg-é(-7) g(r) =0. (15.1.17) 


Taking the Laplace transform of Equation 15.1.17, we find that 


T<t. (15.1.16) 


€ 
G(slr) = zm (15.1.18) 
Or 
e- Re-7)/L 
g(t|r) = H(t — 7), (15.1.19) 


L 


where H(-) is the Heaviside step function. As our short derivation showed, the most direct 
route to finding a Green's function is solving the differential equation when its forcing 
equals the impulse or delta function. This is the technique that we will use throughout this 
chapter. 


| Statics 


Consider a string of length L that is connected at both ends to supports and is subjected 
to a load (external force per unit length) of f(x). We wish to find the displacement u(x) of 
the string. If the load f(x) acts downward (negative direction), the displacement u(x) of 
the string is given by the differential equation 

du 


aa = f(a); (15.1.20) 
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Figure 15.1.4: The response, commonly called a Green’s function, of a string fixed at both ends to a point 
load at x = £. 


where T denotes the uniform tensile force of the string. Because the string is stationary at 
both ends, the displacement u(x) satisfies the boundary conditions 


u(0) = u(L) = 0. (15.1.21) 


Instead of directly solving for the displacement u(x) of the string subject to the load 
f(x), let us find the displacement that results from a load ó(x — £) concentrated at the 
point x = £. See Figure 15.1.4. For this load, the differential equation, Equation 15.1.20, 
becomes 


d?g 
oo B 15.1.22 
<5 =e- 6) (15.1.22) 
subject to the boundary conditions 
gl) = g(L.]£) = 0. (15.1.23) 


In Equation 15.1.22, g(x|£) denotes the displacement of the string when it is subjected to 
an impulse load at x = £. In line with our circuit theory example, it gives the Green's 
function for our statics problem. Once found, the displacement u(x) of the string subject to 
any arbitrary load f(x) can be found by convolving the load f(a) with the Green's function 
g(x|£) as we did earlier. 

Let us now find this Green's function. At any point x Æ £, Equation 15.1.22 reduces 
to the homogeneous differential equation 


—2 = 15.1.24 
T =0, (15.1.24) 
which has the solution 
| j az 4- b, 0x z«£, 
g(z|£) = ta Pepe. (15.1.25) 


Applying the boundary conditions, Equation 15.1.23, we find that 
gO) =a:0+b=b=0, (15.1.26) 


and 
GLE) =cL+d=0, or d= -cL. (15.1.27) 
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Therefore, we can rewrite Equation 15.1.25 as 


ax, 0<a<6, 
g(z|£) = ae L), beget, (15.1.28) 
where a and c are undetermined constants. 
At x = €, the displacement u(x) of the string must be continuous; otherwise, the string 
would be broken. Therefore, the Green’s function given by Equation 15.1.28 must also be 
continuous there. Thus, 


a£ = c(€ — L), or c= (15.1.29) 


From Equation 15.1.22 the second derivative of g(z|£) must equal the impulse function. 
Therefore, the first derivative of g(r|£), obtained by integrating this equation, must be 
discontinuous by the amount 1/T' or 


. [de(£--e£)  dg(€—e«l€)] 1 
B dx dx n gore 
in which case 
dg(£*|£)  dg(€|€) 1 
= an oR (15.1.31) 


where £* and £^ denote points lying just above or below £, respectively. Using Equation 
15.1.28, we find that 


EOE NEM 5, (15.1.32) 
dx 
and 
dg(&*|€) ag 
Se = SS 15.1.33 
dx " E-L ( ) 
Thus, Equation 15.1.31 leads to 
a£ 1 aL 1 ¿E-L 
—-—- = >= = 15.1.34 
Baap euge UMS eg POOE ( ) 
and the Green’s function is 
1 
g(x|£) = pr — Ds, (15.1.35) 


where z< = min(z,£) and zx. = max(z,£). To find the displacement u(x) subject to the 
load f(x), we proceed as we did in the previous example. The result of this analysis is 


L M x T L 
wn-[ fosa tr | re@eac+ FOE- (18.136) 


since é < x in the first integral and x < € in the second integral of Equation 15.1.36. 
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| Integral Equations 


Consider the Sturm-Liouville problem 


y" +y =0, y(0) = y(L) =0. (15.1.37) 


From Section 6.1, nontrivial solutions exist only if 


nr? 


Wap 


y. (x) = sin( 777) (15.1.38) 


where n = 1,2,3,.... 
Consider now a new boundary-value problem: 
dy 
qa f(@), s0) = y(Z) = 0. (15.1.39) 


In the next section (Equation 15.2.76), we will show that we can write its solution by 


=f f(&)g(x|£) d (15.1.40) 
where the Green's function g(x|£) is given by 
d2 
T = —ô(x — £), g(0l) = g(L|£) = 0, (15.1.41) 
Or 
g(z|£) = (b — z>)x</L, (15.1.42) 


where zy = max(z,£) and zz = min(z, ). 
We can now use Equation 15.1.39 to rewrite Equation 15.1.37 as 


Ay(&) = f(&). (15.1.43) 


Multiplying Equation 15.1.43 by g(x|£) and integrating from 0 to L, we find that 


[re f(g(alé) dé = af ve g(x|£) d£, (15.1.44) 
Or 
a-a f ve g(x|£) dé = 0. (15.1.45) 


Because of the equivalence of Equation 15.1.37 and Equation 15.1.45, the solutions to the 
integral equation Equation 15.1.45 are A, = n?sz?/L? with y,(x) = sin(nmz/L). Direct 
substitution verifies this result. Thus, we can use Green's functions to construct integral 
equations that have known solutions. Indeed, it was the use of Green's functions to solve 
Fredholm integral equations that drew the attention of mathematicians at the turn of the 
twentieth century.! 


1 See Section 36 in Kneser, A., 1911: Integralgleichungen und ihre Anwendungen in der mathematischen 
Physik. Braunschweig, 293 pp. 
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15.2 ORDINARY DIFFERENTIAL EQUATIONS 


Second-order differential equations are ubiquitous in engineering. In electrical engi- 
neering, many electrical circuits are governed by second-order, linear ordinary differential 
equations. In mechanical engineering they arise during the application of Newton's second 
law. 

One of the drawbacks of solving ordinary differential equations with a forcing term is 
its lack of generality. Each new forcing function requires a repetition of the entire process. 
In this section we give some methods for finding the solution in a somewhat more gen- 
eral manner for stationary systems where the forcing, not any initially stored energy (i.e., 
nonzero initial conditions), produces the total output. Unfortunately, the solution must be 
written as an integral. 

In Example 12.8.3 we solved the linear differential equation 


y" +2y' +y = f(t), (15.2.1) 


subject to the initial conditions y(0) = y'(0) = 0. At that time we wrote the Laplace 
transform of y(t), Y (s), as the product of two Laplace transforms: 


1 


Y() = cri (15.2.2) 


One drawback in using Equation 15.2.2 is its dependence upon an unspecified Laplace 
transform F(s). Is there a way to eliminate this dependence and yet retain the essence of 
the solution? 
One way of obtaining a quantity that is independent of the forcing is to consider the 
ratio: Y (s) i 
E 
= G(s) = Coane (15.2.3) 


This ratio is called the transfer function because we can transfer the input F(s) into the 
output Y(s) by multiplying F(s) by G(s). It depends only upon the properties of the 
system. 

Let us now consider a related problem to Equation 15.2.1, namely 


g” +29 +g = (t), t>0, (15.2.4) 


with g(0) = g'(0) = 0. Because the forcing equals the Dirac delta function, g(t) is called 
the impulse response or Green's function? Computing G(s), 


1 


G(s) = Gal 


(15.2.5) 


From Equation 15.2.3 we see that G(s) is also the transfer function. Thus, an alternative 
method for computing the transfer function is to subject the system to impulse forcing and 
the Laplace transform of the response is the transfer function. 
From Equation 15.2.3, 
Y(s) = G(s)F(s), (15.2.6) 


? For the origin of the Green's function, see Farina, J. E. G., 1976: The work and significance of George 
Green, the miller mathematician, 1793-1841. Bull. Inst. Math. Appl., 12, 98-105. 
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or 
y(t) = g(t) * f(t). (15.2.7) 


That is, the convolution of the impulse response with the particular forcing gives the re- 
sponse of the system. Thus, we may describe a stationary system in one of two ways: (1) 
in the transform domain we have the transfer function, and (2) in the time domain there is 
the impulse response. 

Despite the fundamental importance of the impulse response or Green’s function for a 
given linear system, it is often quite difficult to determine, especially experimentally, and a 
more convenient practice is to deal with the response to the unit step H(t). This response 
is called the indicial admittance or step response, which we shall denote by a(t). Because 
EH (t)] = 1/s, we can determine the transfer function from the indicial admittance because 
£[a(t)] = G(s)L[A(t)] or sA(s) = G(s). Furthermore, because 


Llo] = G(s) = nes (15.2.8) 
then "— 
g(t) = att) (15.2.9) 


from Equation 12.1.18. 
e Example 15.2.1 
Let us find the transfer function, impulse response, and step response for the system 
y" — 3y' + 2y = f(t), (15.2.10) 
with y(0) = y'(0) = 0. To find the impulse response, we solve 
g” — 3g' + 2g = ó(t — 7), (15.2.11) 


with g(0) = g'(0) = 0. We have generalized the problem to an arbitrary forcing at t = T 
and now denote the Green's function by g(t|r). We have done this so that our discussion 
will be consistent with the other sections in the chapter. 

Taking the Laplace transform of Equation 15.2.11, we find that 


—SsT 


€ 


G(slr) = —3 32» 


(15.2.12) 


which is the transfer function for this system when 7 — 0. The impulse response or Green's 
function equals the inverse of G(s|7) or 


g(tir) = [et> —et=7] H(t- 7). (15.2.13) 


To find the step response, we solve 
a" — 3a' + 2a = H(t), (15.2.14) 
with a(0) = a’(0) = 0. Taking the Laplace transform of Equation 15.2.14, 


1 


AGE e DeD 


(15.2.15) 
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Figure 15.2.1: The impulse and step responses corresponding to the transfer function, Equation 15.2.12, 
with 7 = 0. 


and the indicial admittance is given by the inverse of Equation 15.2.15, or 


a(t) — E 4 je? — et. (15.2.16) 


Note that a'(t) = g(t|0). 


e Example 15.2.2 


MATLAB’s control toolbox contains several routines for the numerical computation 
of impulse and step responses if the transfer function can be written as the ratio of two 
polynomials. To illustrate this capacity, let us redo the previous example where the transfer 
function is given by Equation 15.2.12 with 7 = 0. The transfer function is introduced by 
loading in the polynomial in the numerator num and in the denominator den followed by 
calling tf. The MATLAB script 


clear 

^4 load in coefficients of the numerator and denominator 
^ of the transfer function 

num = [0 0 1]; den = [1 -3 2]; 

^4 create the transfer function 

sys = tf(num,den); 

^4 find the step response, a 

[a,t] = step(sys); 

^4 plot the indicial admittance 
subplot(2,1,1), plot(t, a, ’o’) 
ylabel(’indicial response’, ’Fontsize’ ,20) 
% find the impulse response, g 

[g,t] = impulse(sys) ; 

^4 plot the impulse response 
subplot(2,1,2), plot(t, g, ?o?) 
ylabel(’impulse response’, ’Fontsize’ ,20) 
xlabel(’time’,’Fontsize’ ,20) 


shows how the impulse and step responses are found. Both of them are shown in Figure 
15.2.1. 
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e Example 15.2.3 


There is an old joke about a man who took his car into a garage because of a terrible 
knocking sound. Upon his arrival the mechanic took one look at it and gave it a hefty 
kick.? Then, without a moment’s hesitation he opened the hood, bent over, and tightened 
up a loose bolt. Turning to the owner, he said, “Your car is fine. That’ll be $50.” The 
owner felt that the charge was somewhat excessive, and demanded an itemized account. 
The mechanic said, “The kicking of the car and tightening one bolt, cost you a buck. The 
remaining $49 comes from knowing where to kick the car and finding the loose bolt.” 

Although the moral of the story may be about expertise as a marketable commodity, 
it also illustrates the concept of transfer function.^ Let us model the car as a linear system 
where the equation 


d"y d"-ly dy 
1 aba sega = f(t 15.2.17 
E Age ge = JUS ( ) 


governs the response y(t) to a forcing f(t). Assuming that the car has been sitting still, the 
initial conditions are zero and the Laplace transform of Equation 15.2.17 is 


K(s)Y (s) = F(s), (15.2.18) 

where 
K(s) = ans” + a4 15"! ----- +018 + ao. (15.2.19) 

Hence, 
Y (s) = a = G(s) F(s), (15.2.20) 


where the transfer function G(s) clearly depends only on the internal workings of the car. 
So if we know the transfer function, we understand how the car vibrates because 


ut) = f Meade. (15.2.21) 


But what does this have to do with our mechanic? He realized that a short sharp kick 
mimics an impulse forcing with f(t) = ó(t) and y(t) = g(t). Therefore, by observing the 
response of the car to his kick, he diagnosed the loose bolt and fixed the car. 


In the previous examples, we used Laplace transforms to solve for the Green’s functions. 
However, there is a rich tradition of using Fourier transforms rather than Laplace transforms. 
In these particular cases, the Fourier transform of the Green’s function is called frequency 
response or steady-state transfer function of our system when r = 0. Consider the following 
examples. 


3 This is obviously a very old joke. 


^ Originally suggested by Stern, M. D., 1987: Why the mechanic kicked the car - A teaching aid for 
transfer functions. Math. Gaz., T1, 62-64. 
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e Example 15.2.4: Spectrum of a damped harmonic oscillator 


In mechanics the damped oscillations of a mass m attached to a spring with a spring 
constant k and damped with a velocity-dependent resistance are governed by the equation 


my" + cy' + ky = f(t), (15.2.22) 


where y(t) denotes the displacement of the oscillator from its equilibrium position, c denotes 
the damping coefficient, and f(t) denotes the forcing. 

Assuming that both f(t) and y(t) have Fourier transforms, let us analyze this system 
by finding its frequency response. We begin by solving for the Green's function g(t|7), 
which is given by 

mg" +cg' + kg = o(t — 7), (15.2.23) 


because the Green's function is the response of a system to a delta function forcing. Taking 
the Fourier transform of both sides of Equation 15.2.23, the frequency response is 


e ("T ett /m, 
G - = 15.2.24 
wir) k+icw— mw? wg + icw/m — w? ( ) 


where wi = k/m is the natural frequency of the system. The most useful quantity to plot 
is the frequency response or 


(15.2.25) 


eue 


= (15.2.26) 


In Figure 15.2.2 we plotted the frequency response as a function of c?/(km). Note that as the 
damping becomes larger, the sharp peak at w = wo essentially vanishes. As c?/(km) — 0, 
we obtain a very finely tuned response curve. Let us now find the Green’s function. From 
the definition of the inverse Fourier transform, 


65 git 1 oo eiet 
he Mines dw, (15.2.27 
mg(t|r) / ? iwm- we a 2m J. (w — wi) (w — wa) E l 


wia = Hw — 72 + i, (15.2.28) 


and y = c/(2m) > 0. We can evaluate Equation 15.2.27 by residues. Clearly the poles 
always lie in the upper half of the w-plane. Thus, if t < 7 in Equation 15.2.27 we can 
close the line integration along the real axis with a semicircle of infinite radius in the lower 
half of the w-plane by Jordan’s lemma. Because the integrand is analytic within the closed 
contour, g(t|r) = 0 for t < r. This is simply the causality condition,? the impulse forcing 


5 The principle stating that an event cannot precede its cause. 
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Figure 15.2.2: The variation of the frequency response for a damped harmonic oscillator as a function of 
driving frequency w. See the text for the definition of the parameters. 


being the cause of the excitation. Clearly, causality is closely connected with the analyticity 
of the frequency response in the lower half of the w-plane. 

If t > 7, we close the line integration along the real axis with a semicircle of infinite 
radius in the upper half of the w-plane and obtain 


(15.2.29) 
ere) sin| t—T)/wi-— J 
= 7 rade page = ( ) oT H(t T) 
w — w2 we — 7 
(15.2.30) 


Let us now examine the damped harmonic oscillator by describing the migration of the 
poles w1,2 in the complex w-plane as y increases from 0 to oo. See Figure 15.2.3. For 
y « wo (weak damping), the poles w1,2 are very near to the real axis, above the points 
cw, respectively. This corresponds to the narrow resonance band discussed earlier and 
we have an underdamped harmonic oscillator. As y increases from 0 to wo, the poles 
approach the positive imaginary axis, moving along a semicircle of radius wo centered at 
the origin. They coalesce at the point iwo for y = wo, yielding repeated roots, and we have 
a critically damped oscillator. For y > wo, the poles move in opposite directions along 
the positive imaginary axis; one of them approaches the origin, while the other tends to 
ioo as y — oo. The solution then has two purely decaying, overdamped solutions. During 
the early 1950s, a similar diagram was invented by Evans? where the movement of closed- 
loop poles is plotted for all values of a system parameter, usually the gain. This root-locus 
method is very popular in system control theory for two reasons. First, the investigator can 
easily determine the contribution of a particular closed-loop pole to the transient response. 
Second, he can determine the manner in which open-loop poles or zeros should be introduced 
or their location modified so that he will achieve a desired performance characteristic for 
his system. 


6 Evans, W. R., 1948: Graphical analysis of control systems. Trans. AIEE, 67, 547—551; Evans, W. 
R., 1954: Control-System Dynamics. McGraw-Hill, 282 pp. 
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Figure 15.2.3: The migration of the poles of the frequency response of a damped harmonic oscillator as 
a function of y. 


e Example 15.2.5: Low-frequency filter 


Consider the ordinary differential equation 


Ry + 5 = f) (15.2.31) 

where R and C are real, positive constants. If y(t) denotes current, then Equation 15.2.31 

would be the equation that gives the voltage across a capacitor in an RC circuit. Let us find 

the frequency response and Green's function for this system. We begin by writing Equation 
15.2.31 as 

Rg + a = 5(t—7), (15.2.32) 


where g(t|7r) denotes the Green's function. If the Fourier transform of g(t|r) is G(w|r), the 
frequency response G(w|r) is given by 


iwRG(w|r) + eur) ager (15.2.33) 
or ‘ ; 
ener Ce "7T 
SUIT DIEGEIO  LHSED! (152.34) 
and C C 
IG(w|7)) = (15.2.35) 


V1+w?R?C? VETIA 


where wp = 1/(RC) is an intrinsic constant of the system. In Figure 15.2.4 we plotted 
|G(w|r)| as a function of w. From this figure, we see that the response is largest for small 
w and decreases as w increases. 

'This is an example of a low-frequency filter because relatively more signal passes through 
at lower frequencies than at higher frequencies. To understand this, let us drive the system 
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Figure 15.2.4: The variation of the frequency response, Equation 15.2.35, as a function of driving frequency 
w. See the text for the definition of the parameters. 


with a forcing function that has the Fourier transform F(w). The response of the system 
will be G(w,0)F(w). Thus, that portion of the forcing function's spectrum at the lower 
frequencies is relatively unaffected because |G(w,0)| is near unity. However, at higher 
frequencies where |G(w, 0)| is smaller, the magnitude of the output is greatly reduced. 


e Example 15.2.6 


During his study of tumor growth, Adam" found the particular solution to an ordinary 
differential equation which, in its simplest form, is 


no _ flæļ/L-1, |z| < L, 
y -a y= { 0, EA (15.2.36) 


by the method of Green's functions. Let us retrace his steps and see how he did it. 
The first step is finding the Green's function. We do this by solving 


g" — o?g = ó(a), (15.2.37) 


subject to the boundary conditions lim),)-,.. g(x) — 0. Taking the Fourier transform of 
Equation 15.2.37, we obtain 

1 
w+ a2" 


G(w) = (15.2.38) 
The function G(w) is the frequency response for our problem. Straightforward inversion 


yields the Green’s function 
—ol|z| 
e 
=— : 15.2.39 
g(x) = (15.2.39) 


Therefore, by the convolution integral, y(x) = g(x) * f (x), 


L 


L 
y(x) = ds g(x — £) (J£/L — 1) dé = x] (1 —|é|/L) e-9l*- ge. (15.2.40) 


-L 


7T Adam, J. A., 1986: A simplified mathematical model of tumor growth. Math. Biosci., 81, 229-244. 
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To evaluate Equation 15.2.40 we must consider four separate cases: —oo < x < —L, 
—L«mz«0,0«z « L, and L « xz « oo. Turning to the —oo < x < —L case first, we have 


1 L 
y) — Le L a(z—&) q 2. 
ua) = s. [ O- IE eee ag (15.2.41) 
ax 0 ax L 
Tp uM 2t -a£ E 2 -a£ 
=od m &/L)e *5 dE + 2i (1— €/L)e~% d£ (15.2.42) 
E - (ru ES (15.2.43) 


Similarly, for x > L, 


1 L 
ula) = s; f -KI/De*e-9 ag (15.2.44) 
TAR 0 e 9* L 
os ] ameta F (1 — £/L) e** dé (15.2.45) 
H xS (^ ee" -2). (15.2.46) 


On the other hand, for —L < x < 0, we find that 


1 d =O eS 1 A GLO -— 
v= z | a-W/mee-tare = f (el/ea (15.2.47) 
2a ES, 2a x 
enor x ect 0 ear L 
= J (1 +E/L) e*5 d£ + sf (1--£/L)e 95 d£ + al (1— &/L)e~% d£ 
2a Jr 2a Jz 2a Jo 
(15.2.48) 
1 
= ay [e ** cosh(ax) + a(x + L) — e?*]. (15.2.49) 
Q 
Finally, for 0 < x < L, we have that 
if -a(s-£) 1 ft o(z-£) 
y(z)=5- f Q-lI8/LD)e d+ 5— |] Ab |g/ Le dg (15.2.50) 
2a LIT 2a x£ 
-EE f aesnetas t l'a-agnetas E f a-dgnesta 
a 2a =f E 2a 0 B 2a zr < 
(15.2.51) 
1 
ED [e ^" cosh(ax) + a(L — x) - e7% ]. (15.2.52) 
Q 
These results can be collapsed down into 
y(z) = TN e^" cosh(az) + a(L — |x|) — e~ 0"! | (15.2.53) 
o3L 
if |z| < L, and 
eR aL —aL 
uu) sr (ee +e 2) (15.2.54) 


if |2| > L. 
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| Superposition integral 


So far we showed how the response of any system can be expressed in terms of its 
Green’s function and the arbitrary forcing. Can we also determine the response using the 
indicial admittance a(t)? 

Consider first a system that is dormant until a certain time t = 7,. At that instant we 
subject the system to a forcing H(t — 7,). Then the response will be zero if t < 7; and will 
equal the indicial admittance a(t — 7,) when t > 7, because the indicial admittance is the 
response of a system to the step function. Here t—7, is the time measured from the instant 
of change. 

Next, suppose that we now force the system with the value f(0) when t = 0 and hold 
that value until t = rı. We then abruptly change the forcing by an amount f(r1) — f(0) 
to the value f(7,) at the time 7, and hold it at that value until t = 72. Then we again 
abruptly change the forcing by an amount f(72) — f(71) at the time 72, and so forth (see 
Figure 15.2.5). From the linearity of the problem the response after the instant t = Tn 
equals the sum 


y(t) = f(0)a(t) + [f(1) — f(0)]a(t — 71) + [f(72) — Fat- 72) 
t +[f(t) - f(m-1)]a(t — tm). (15.2.55) 


If we write f(t) — f(Tk-1) = Afk and Tk — 74.1 = A7, Equation 15.2.55 becomes 
y(t) = «Y pis) 2t = Arp. (15.2.56) 


Finally, proceeding to the limit as the number n of jumps becomes infinite, in such a manner 
that all jumps and intervals between successive jumps tend to zero, this sum has the limit 


y(t) = of ro a(t — 7) dr. (15.2.57) 


Because the total response of the system equals the weighted sum (the weights being a(t)) 
of the forcing from the initial moment up to the time t, we refer to Equation 15.2.57 as 
the superposition integral, or Duhamel's integral, named after the French mathematical 
physicist Jean-Marie-Constant Duhamel (1797-1872) who first derived it in conjunction 
with heat conduction. 

We can also express Equation 15.2.57 in several different forms. Integration by parts 


yields 
y(t) — O+ f ft» '(t— 7) =F] [rea a(t — 7) dr. (15.2.58) 


8 Duhamel, J.-M.-C., 1833: Mémoire sur la méthode générale relative au mouvement de la chaleur dans 


les corps solides plongés dans des milieux dont la température varie avec le temps. J. École Polytech., 22, 
20-77. 
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Figure 15.2.5: Diagram used in the derivation of Duhamel’s integral. 
e Example 15.2.7 

Suppose that a system has the step response of a(t) = A[1 — e-'"/T], where A and T 
are positive constants. Let us find the response if we force this system by f(t) = kt, where 


k is a constant. 
From the superposition integral, Equation 15.2.57, 


y(t) 204- [ kA[1 — e~-7)/7) dr = kA[t — T(1 — e-*/7)]. (15.2.59) 


| Boundary-value problem 


One of the purposes of this book is the solution of a wide class of nonhomogeneous 
ordinary differential equations of the form 


+ [poy Z| sew- asec, (15.2.60) 
with 
ayy(a) + azy'(a)=0, —— fiy(b) + Boy' (b) = 0. (15.2.61) 


For example, in Chapter 6 we examined the Sturm-Liouville-like equation 


M Lord + [d(z) + Ar(z)p ——f(m, a<z<b, (15.2.62) 


where A is a parameter. Here we wish to develop the Green's function for this class of 
boundary-value problems. 
We begin by determining the Green's function for the equation 


= [tay S| + s(2)g = -&(z - £), (15.2.63) 


subject to yet undetermined boundary conditions. We know that such a function exists for 
the special case p(x) = 1 and s(x) = 0, and we now show that this is almost always true 
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in the general case. Presently we construct Green’s functions by requiring that they satisfy 
the following conditions: 


e g(x|€) satisfies the homogeneous equation f(x) = 0 except at x = €, 


e g(x|€) satisfies certain homogeneous conditions, and 


e g(z|£) is continuous at x = £. 


These homogeneous boundary conditions for a finite interval (a,b) will be 


aigla)  oag'(a|£) — 0, — Big(b|E) + Bag'(b|E) = 0, (15.2.64) 


where g' denotes the x derivative of g(x|£) and neither a nor b equals £. The coefficients 
Qı and o» cannot both be zero; this also holds for 6; and £5. These conditions include the 
commonly encountered Dirichlet, Neumann, and Robin boundary conditions. 

What about the value of g'(z|£) at x = £? Because g(x|£) is a continuous function of 
x, Equation 15.2.63 dictates that there must be a discontinuity in g'(x|£) at x = €. We now 
show that this discontinuity consists of a jump in the value g'(x|£) at x = £. To prove this, 
we begin by integrating Equation 15.2.63 from € — e to € + e, which yields 


+e E+e 
«f s(x)g(x|£) dz = —1. (15.2.65) 


£-e VEe 


Because g(x|£) and s(x) are both continuous at x = £, 


éte 
lim |. s(x)g(x|£) dz = 0. (15.2.66) 


Applying the limit e — 0 to Equation 15.2.65, we have that 


so [4619.19]... T 


where £* and £^ denote points just above and below x = £, respectively. Consequently, 
our last requirement on g(z|£) will be that 


e dg/dx must have a jump discontinuity of magnitude —1/p(£) at x = £. 


Similar conditions hold for higher-order ordinary differential equations.? 


Consider now the region a € x < €. Let yı(x) be a nontrivial solution of the homo- 
geneous differential equation satisfying the boundary condition at x = a; then ayıla) + 
ay, (a) = 0. Because g(x|£) must satisfy the same boundary condition, o1g(a|£) 4- 39 (a|£) 
= 0. Since the set o4, @ is nontrivial, then the Wronskian of y; and g must vanish at x = a 


9 Ince, E. L., 1956: Ordinary Differential Equations. Dover Publications, Inc. See Section 11.1. 
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or y1(a)g'(a|£) — y1(a)g(a|£) = 0. However, for a € x < £, both yi(z) and g(a|€) satisfy 
the same differential equation, the homogeneous one. Therefore, their Wronskian is zero 
at all points and g(z|£) = cıyı(x) for a € x < £, where cı is an arbitrary constant. In 
a similar manner, if the nontrivial function y2(x) satisfies the homogeneous equation and 
the boundary conditions at x = b, then g(x|£) = cz2y2(x) for € < x < b. The continuity 
condition of g and the jump discontinuity of g’ at x = € imply 


cayı (£) — e2ya(£) = 0, ery) (£) = c2y2(£) = 1/p(£). (15.2.68) 


We can solve Equation 15.2.68 for cı and c2 provided the Wronskian of yı and y2 does not 
vanish at x = £, or 


yı (€)yo(€) — y2(€)y1 (£) A 0. (15.2.69) 


In other words, yı (x) must not be a multiple of yo(x). Is this always true? The answer is 
"generally yes." If the homogeneous equation admits no nontrivial solutions satisfying both 
boundary conditions at the same time,!? then yi (x) and y2(x) must be linearly independent. 
On the other hand, if the homogeneous equation possesses a single solution, say yo(x), which 
also satisfies a1 yo(a) + a2yg(a) = 0 and £B1yo(b) + B5yt (b) = 0, then y (x) will be a multiple 
of yo(a) and so is y2(a). Then they are multiples of each other and their Wronskian vanishes. 
This would occur, for example, if the differential equation is a Sturm-Liouville equation, À 
equals the eigenvalue, and yo(x) is the corresponding eigenfunction. No Green's function 
exists in this case. 


e Example 15.2.8 


Consider the problem of finding the Green's function for g” +k?g = —ó(rz—£), 0< x < 
L, subject to the boundary conditions g(0|£) = g(L|£) = 0 with k z 0. The corresponding 
homogeneous equation is y" + k?y = 0. Consequently, g(z|£) = c1yi(x) = cisin(kz) for 
0 € x € €, while g(x|£) = coyo(x) = co sin|k(L — x)] for £ € x < L. 

Let us compute the Wronskian. For our particular problem, 


W(x) = yi (x)yo() — yi (x)y2(x) (15.2.70) 
= —ksin(kz) cos[k(L — x)] — k cos(kz) sin[k(L — x)] (15.2.71) 
= —ksin[k(z + L — z)] = —k sin(k L), (15.2.72) 


and W(£) = —ksin(kL). Therefore, the Green's function will exist as long as kL 4 nz. If 
kL = nz, yi(x) and ys(x) are linearly dependent with yo(x) = ca sin(naz/L), the solution 
to the regular Sturm-Liouville problem y" + Ay = 0, and y(0) = y(L) = 0. 


Let us now proceed to find g(x|£) when it does exist. The system, Equation 15.2.68, 
has the unique solution 


m vle) n = o ue 
where W (£) is the Wronskian of yı(x) and y2(x) at x = €. Therefore, 
gale) = BC (15.2.74) 


»(£)W(£) 


10 In the theory of differential equations, this system would be called incompatible: one that admits no 
solution, save y = 0, which is also continuous for all x in the interval (a,b) and satisfies the homogeneous 
boundary conditions. 
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Figure 15.2.6: The Green's function, Equation 15.2.75, divided by L, as functions of x and € when 
kL = 10. 


Clearly g(a|€) is symmetric in z and £. It is also unique. The proof of the uniqueness is as 
follows: We can always choose a different y1(x), but it will be a multiple of the “old” yi (a), 
and the Wronskian will be multiplied by the same factor, leaving g(r|£) the same. This is 
also true if we modify y2(x) in a similar manner. 


e Example 15.2.9 


Let us find the Green's function for g” + k?g = —ó(z — £), 0 < x < L, subject to 
the boundary conditions g(0|€) = g(L|£) = 0. As we showed in the previous example, 
yi(x) = eisin(kx), yo(x) = cosin[k(L — z)], and W(£) = —ksin(kL). Substituting into 
Equation 15.2.74, we have that 


sin(kzz) sin|k(L — z>)] 


g(z|£) = k sin(kL) , (15.2.75) 


where z< = min(x, £) and x> = max(x,£). Figure 15.2.6 illustrates Equation 15.2.75. O 


So far, we showed that the Green’s function for Equation 15.2.63 exists, is symmetric, 
and enjoys certain properties (see the material in the boxes after Equation 15.2.63 and 
Equation 15.2.67). But how does this help us solve Equation 15.2.63? We now prove that 


b 
y(z) = f g(olé) f (6) de (15.2.76) 


is the solution to the nonhomogeneous differential equation, Equation 15.2.63, and the 
homogeneous boundary conditions, Equation 15.2.64. 

We begin by noting that in Equation 15.2.76 x is a parameter while £ is the dummy 
variable. As we perform the integration, we must switch from the form for g(z|£) for £ € x 
to the second form for £ > x when £ equals x; thus, 


x b 
yz) = T I(EN f(€) de + ri slö FE de. (15.2.77) 
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Differentiation yields 


E f selon ac [ EO po ac + geere (15.2.78) 
and 
d 
2 [ aeiortoac- f PEO rey ac - sero. (15.2.79) 
Because g(x|£) is continuous everywhere, we have that g(x|z*) = g(x|v- ) so that 
dy — f” dg(a\é) ? dg(z|£) 
ea f "Loue T nos. (15.2.80) 


Differentiating once more gives 


P f COC sg ag BEED ja). [P989 pe ag — WEE) gy, 
(15.2.81) 


The second and fourth terms on the right side of Equation 15.2.81 will not cancel in this 
case; on the contrary, 


dglele-) _ dolele*) __ 1 
dx dx p(x) 


(15.2.82) 


To show this, we note that the term dg(v|x-)/dx denotes a differentiation of g(x|£) with 
respect to z using the x > € form and then letting £ — x. Thus, 


di tz POWE pG)WG)' 


dg(r|z ) im EME) — vom) (2) (15.2.83) 


while for dg(x|z*)/dx we use the x < € form or 


dide. o citet. Owe 
dr — OWE) playa) ea 


Upon introducing these results into the differential equation 


pa) + p) + (ay = —f(a), (15.2.85) 

we have 
[ iov ele) eva Glo) + stato fO de (15.2.86) 
+ S DOE + oa (ae) + stroll a - o) 22 = — reo. 

Because 


P(x) g" (|£) + p'(x)g' (v|£) + s(x) g(x|g) = 0, (15.2.87) 
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except for r — £, Equation 15.2.86, and thus Equation 15.2.63, is satisfied. Although 
Equation 15.2.87 does not hold at the point x = £, the results are still valid because that 
one point does not affect the values of the integrals. As for the boundary conditions, 


b b 
ya)= f «erede — ve - [ SO) noa. (15.2.88) 


and a,y(a) + agy’(a) = 0 from Equation 15.2.64. A similar proof holds for x = b. 
Finally, let us consider the solution for the nonhomogeneous boundary conditions 
ayla) + agy’(a) = a, and 8B1y(b) + B3y'(b) = B. The solution in this case is 


x. aya(z) Bux) a d 2 
y(x) = NODE ERES + ENCES BR" 1 g(x|£) f (£) dé. (15.2.89) 


A quick check shows that Equation 15.2.89 satisfies the differential equation and both 
nonhomogeneous boundary conditions. 


| Eigenfunction expansion 


We just showed how Green’s functions can be used to solve the nonhomogeneous linear 
differential equation. The next question is how do you find the Green’s function? Here we 
present the most common method: series expansion. This is not surprising given its success 
in solving the Sturm-Liouville problem. 

Consider the nonhomogeneous problem 


y” =- f(x), with y(0) = y(L) — 0. (15.2.90) 

The Green's function g(r|£) must therefore satisfy 
g'-— —ó(r—£), with g(0|£) = g(L|£) = 0. (15.2.91) 
Because g(r|£) vanishes at the ends of the interval (0, L), this suggests that it can be 


expanded in a series of suitably chosen orthogonal functions such as, for instance, the 
Fourier sine series 


g(z|£) = X G.(&) sin(77) , (15.2.92) 
nci 
where the expansion coefficients G, are dependent on the parameter €. Although we chose 


the orthogonal set of functions sin(nzx/L), we could have used other orthogonal functions 
as long as they vanish at the endpoints. 


Because E 
"(le - Y (- 
n=1 


2:2 
= a) (15.2.93) 


) Gate) sin( 57 
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and 
5(@ —£) = Y A(&) sin( 5), (15.2.94) 
n-l 
where 
L NTL NT 
A, (é) = " 5(a — £) sin(^77) dx = (C ; (15.2.95) 


we have that 
(nen? . (NTT 2. f(nz£N . (nae 
— vb (E) Gn (ê) sin(“**) = "OE p» sin( “5 sin( 2) ; (15.2.96) 
n=1 


after substituting Equation 15.2.93 through Equation 15.2.95 into the differential equation, 
Equation 15.2.91. Since Equation 15.2.96 must hold for any arbitrary zx, 


nen? 2 . (mn£ 
Thus, the Green's function is 
bs 
sel = —; 2 6 sin(777) (15.2.98) 


How might we use Equation 15.2.98? We can use this series to construct the solution 
of the nonhomogeneous equation, Equation 15.2.90, via the formula 


L 
y) = | alle) F(E) d£. (15.2.99) 
This leads to 
oo L 
ylz) = E E sin( pi s(e)sin( 2) d£, (15.2.100) 
n=1 0 
or " 
Pre Qn . (mm 
ue) = a sin( =) (15.2.101) 


where an are the Fourier sine coefficients of f(x). 
e Example 15.2.10 
Consider now the more complicated boundary-value problem 
y'--k*y--f(z) with y(0)=y(L)=0. (15.2.102) 
The Green's function g(z|£) must now satisfy 


g” + k?°g = —ó(z—£), and g(0|£f) = g(L|£) = 0. (15.2.103) 
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Once again, we use the Fourier sine expansion 


oo 


g(zl£) = 6, (£)sim(* 7) (15.2.104) 


Direct substitution of Equation 15.2.104 and Equation 15.2.94 into Equation 15.2.103 and 
grouping by corresponding harmonics yields 


nên? 2 2 , (mn£ 
" 2 sin(nnt/L) 
sin(nz 
n = 15.2.106 
C= T aa (15.2.106) 
Thus, the Green’s function is 
sin(n7&/L) m) 
sel = 7 a "OPERE (15.2.107) 


Examining Equation 15.2.107 more closely, we note that it enjoys the symmetry property 
that g(a|) = g(&|z). 


e Example 15.2.11 


Let us find the series expansion for the Green's function for 


2 
ag’ +g + (rz » )s =-6(e-£), O<ae<L, (15.2.108) 


where m > 0 and is an integer. The boundary conditions are 


lim lg(x|£)| « oo, and g(L|£) — O. (15.2.109) 


To find this series, consider the Fourier-Bessel series 
g(z|£) = XO Gn (€) Jm(knm2), (15.2.110) 


where knm is the nth root of Jm(knm L) = 0. This series enjoys the advantage that it satisfies 
the boundary conditions and we will not have to introduce any homogeneous solutions so 
that g(z|£) satisfies the boundary conditions. 

Substituting Equation 15.2.110 into Equation 15.2.108 after we divide by x and using 
the Fourier-Bessel expansion for the delta function, we have that 


(k? — Kiam) Gn(€) = Poca (ban DE E Fen) (15.2.111) 


so that 


g(x|£) = 2» Jm Ewa t 2m (15.2.112) 
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Figure 15.2.7: The Green's function, Equation 15.2.112, as functions of z/L and £/L when kL = 10 and 
m — 1. 


Equation 15.2.112 is plotted in Figure 15.2.7. oO 


We summarize the expansion technique as follows: Suppose that we want to solve the 
differential equation 
Ly(x) = — f (2), (15.2.113) 


with some condition By(x) — 0 along the boundary, where L now denotes the Sturm- 
Liouville differential operator 


L= 4 Lord + [a(z) + Ar(z)], (15.2.114) 


and B is the boundary condition operator 


d 
Oi ORIS at z =a, 
B= s (15.2.115) 


d 
£1 + Ba—, at x = b. 
dx 
We begin by seeking a Green's function g(z|£), which satisfies 
Lg=—6(x—£), Bg=0. (15.2.116) 


To find the Green's function, we utilize the set of eigenfunctions y,,(x) associated with the 
regular Sturm-Liouville problem 


= po% 


as |\P Ge | + [a(z) + Anr(z)]¥n = 0, (15.2.117) 
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where q, (x) satisfies the same boundary conditions as y(x). If g exists and if the set {pn} 
is complete, then g(x|£) can be represented by the series 


9(2l€) = X Gs (Os (2). (15.2.118) 


Applying L to Equation 15.2.118, 


Lg(v|£) = >) G«(£)Llen(2)] = 9 Ga- Au)r(z)en(z) = —0(z — £), (15.2.119) 


if A does not equal any of the eigenvalues A,,. Multiplying both sides of Equation 15.2.119 
by %m(a) and integrating over x, 


oo b 
> Gnl(QOA — an) f r(z)en(z)em(r) dz = —en(£). (15.2.120) 
n=1 9 
If the eigenfunctions are orthonormal, 
b 
|. fd, n -—m, = Yn(§) 
f r(z)egs(x)oq (x) dz = T lah. and G (£) = je (15.2.121) 
'This leads directly to the bilinear formula: 
Gn (E) n(x 
gaje) = V^ Palslen(@) (15.2.122) 


which permits us to write the Green’s function at once if the eigenvalues and eigenfunctions 
of L are known. 


Problems 
For the following initial-value problems, find the transfer function, impulse response, Green's 


function, and step response. Assume that all of the necessary initial conditions are zero and 
T » 0. If you have MATLAB’s control toolbox, use MATLAB to check your work. 


1. g' +kg = ó(t— 7) 2. g" —2g' — 3g = ô(t — 7) 
3. g! +49’ +39 = ó(t— 7) 4. g” — 2g' + 5g = ó(t— 7) 
5. g! — 3g! +29 = ó(t — 7) 6. g” -- Ag' + Ag = ó(t — T) 
7. g” — 9g = ó(t— T) 8. g" +g —(t— T) 


9. g" — g' = ó(t— T) 
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Find the Green's function and the corresponding bilinear expansion using eigenfunctions 
from the regular Sturm-Liouville problem Y” + k2,, = 0 for 


g” = —ô(x — £), O0<2,€<L, 


which satisfy the following boundary conditions: 


11. g(0|£) — g'(0]8) = 0, g(LI§) — g' (LIE) = 0, 
12. g(0|§) — g'(0]£) = 0, g(L|&) + g'(LI§) = 0. 


Find the Green's function!! and the corresponding bilinear expansion using eigenfunctions 
from the regular Sturm-Liouville problem Y” + k2y, = 0 for 


g” — k?g = —ó(x — £), 0<2,€<L, 


which satisfy the following boundary conditions: 


13. g(0|£) = 0, g(L|£) — 0, 
14. g'(0]&) = 0, g'(L|&) = 0, 
15. g(0|£) = 0, g(L|£) + g' (LIE) = 0, 
16. g(0|£) = 0, g(LI§) — g'(LI£) = 0, 
17. a g(0]8) + g'(0&) = 0, g' (L|&) = 0, 
18. g(0|£) + g'(0]8) = 0, g(L|£) — g' (LIS) = 0. 


15.3 JOINT TRANSFORM METHOD 


In the previous section an important method for finding Green’s function involved either 
Laplace or Fourier transforms. In the following sections we wish to find Green’s functions 
for partial differential equations. Again transform methods play an important role. We will 
always use the Laplace transform to eliminate the temporal dependence. However, for the 
spatial dimension we will use either a Fourier series or Fourier transform. Our choice will 
be dictated by the domain: If it reaches to infinity, then we will employ Fourier transforms. 
On the other hand, a domain of finite length calls for an eigenfunction expansion. The 
following two examples illustrate our solution technique for domains of infinite and finite 
extent. 


11 Problem 18 was used by Chakrabarti, A., and T. Sahoo, 1996: Reflection of water waves by a nearly 
vertical porous wall. J. Austral. Math. Soc., Ser. B, 37, 411—429. 
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e Example 15.3.1: One-dimensional Klein-Gordon equation 
The Klein-Gordon equation is a form of the wave equation that arose in particle physics 


as the relativistic scalar wave equation describing particles with nonzero rest mass. In this 
example, we find its Green’s function when there is only one spatial dimension: 


ur T (52 + ag) m e tbc (15.3.1) 


where —oo < x, < oo, 0 < t, T, c is a real, positive constant (the wave speed), and a is a 
real, nonnegative constant. The corresponding boundary conditions are 


| im g(x, tl, T) > 0, (15.3.2) 
zl CO 
and the initial conditions are 

g(x, O|€, 7) = g(a, O|€,7) = 0. (15.3.3) 


We begin by taking the Laplace transform of Equation 15.3.1 and find that 


2 2 2 
E ( rd Je- S(x — £e“. (15.3.4) 
dx? 2 


C 


Applying Fourier transforms to Equation 15.3.4, we obtain 


e œ phe) c © cos[k(z — £)] 
= —e *7 k= d ————— dk. (15.3. 
G(a, s|£, 7) olt J. KERETE d z? n ggg ke ( 5) 


Inverting the Laplace transform and employing the second shifting theorem, 


g(z,t|£, T) = “ult - T) l ERI A re ZO] ak. (15.3.6) 


Equation 15.3.6 represents a superposition of homogeneous solutions (normal modes) to 
Equation 15.3.1. An intriguing aspect of Equation 15.3.6 is that this solution occurs every- 
where after t > 7. If |z — ¿| > c(t — 7), these wave solutions destructively interfere so that 
we have zero there while they constructively interfere at those times and places where the 
physical waves are present. 

Applying integral tables to Equation 15.3.6, the final result is 


g,tl&,7) = SJolaV (t= 7)? = @— 67/2 | Het- -e-e | (45.3.7) 


This Green’s function is illustrated in Figure 15.3.1. Thus, the Green’s function for the 
Klein-Gordon equation yields waves that propagate to the right and left from z = 0 with 
the wave front located at x = cct. At a given point, after the passage of the wave front, 
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9(x,tl,t)/c 


10 a(x-$)/c 


Figure 15.3.1: The free-space Green's function g(x, t|€,7)/c for the one-dimensional Klein-Gordon equa- 
tion at different distances a(x — £)/c and times a(t — T). 


the solution vibrates with an ever-decreasing amplitude and at a frequency that approaches 
a - the so-called cutoff frequency - at t > oo. 

Why is a called a cutoff frequency? From Equation 15.3.5, we see that, although the 
spectral representation includes all of the wavenumbers k running from —oo to oo, the 
frequency w = y c?k? + a? is restricted to the range w > a from Equation 15.3.6. Thus, a is 
the lowest possible frequency that a wave solution to the Klein-Gordon equation may have 
for a real value of k. 


e Example 15.3.2: One-dimensional wave equation on the interval 0 < x < L 


One of the classic problems of mathematical physics involves finding the displacement 
of a taut string between two supports when an external force is applied. The governing 
equation is 

Ou 0u 
0m Ox? 
where c is the constant phase speed. 

In this example, we find the Green's function for this problem by considering the 
following problem: 


= f(xz,t), Oc«rc«L, 0<t, (15.3.8) 


Og 229 
Ot? Ox? 


=d0(a—€)d(t-—7), O<a,€<L, 0<t,T, (15.3.9) 
with the boundary conditions 
o19(0, tl, 7) CE £193 (0, t|£, T) = 0, 0< t, (15.3.10) 


and 


a39(L,t|&, T) + Bags (L,t, T) 20, 0<t, (15.3.11) 


and the initial conditions 


g(z,0]£,7) = q(2,0|6,7) 20, O<a<L. (15.3.12) 
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We start by taking the Laplace transform of Equation 15.3.9 and find that 


dG s ó(r—£) sr 
di ac = x ng 0<a<L, 15.3.13) 
with 
o3 G(0, s|€,7) + &1G' (0, s|£, T) = 0, 15.3.14) 
and 
a2G(L, s|£, T) + B3G' (L, s|£, T) = 0. 15.3.15) 


Problems similar to Equation 15.3.13 through Equation 15.3.15 were considered in the 
previous section. There, solutions were developed in terms of an eigenfunction expansion. 
Applying the same technique here, 


—sT 3 Pn (£)n (x) 


G(a, sé, 7) =e ee 


(15.3.16) 


n=1 


where o, (x) is the nth orthonormal eigenfunction to the regular Sturm-Liouville problem 
p (x) +k y(z)=0, O<ae<L, (15.3.17) 
subject to the boundary conditions 


o1 (0) + iy’ (0) = 0, (15.3.18) 


and 


azp(L) + Bav! (L) = 0. (15.3.19) 


Taking the inverse of Equation 15.3.16, we have that the Green's function is 


kne 


g(x, t|, 7) = D» £n (&)en (x) pinleri = v) | H(t — 7). (15.3.20) 


Let us illustrate our results to find the Green's function for 


O?g — 307g 
a2 Cap om —£)o(t.—), (15.3.21) 
with the boundary conditions 
g(0.t&,7) = g(b,t£,7) 20, — Oct, (15.3.22) 


and the initial conditions 


g(x, OJE, T) = g(x, OE, 7) = 0, 0 « x « L. (15.3.23) 
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c 9(x, t,t) 


1:5 
1 
o(t-t)/L 


Figure 15.3.2: The Green's function cg(z,t|£, T) given by Equation 15.3.26 for the one-dimensional wave 
equation over the interval 0 < x < L as a function of location z/L and time c(t — 7T)/L with £/L = 0.2. 
The boundary conditions are g(0,t|£, 7) = g(L,t|£, 7) = 0. 


For this example, the Sturm-Liouville problem is 
g" (24) +k’y(z)=0, 0«z«L, (15.3.24) 


with the boundary conditions y(0) = y(L) = 0. The nth orthonormal eigenfunction for 
this problem is 


Prlz) = : sin( Z2) (15.3.25) 


Consequently, from Equation 15.3.20, the Green’s function is 


g(a, t|f,7) = Z Dx 1 sin( 5) sin( 2) sin] St 2 | H(t—r). (15.3.26) 


See Figure 15.3.2. 


15.4 WAVE EQUATION 


In Section 15.2, we showed how Green's functions could be used to solve initial- and 
boundary-value problems involving ordinary differential equations. When we approach par- 
tial differential equations, similar considerations hold, although the complexity increases. 
In the next three sections, we work through the classic groupings of the wave, heat, and 
Helmholtz's equations in one spatial dimension. All of these results can be generalized to 
three dimensions. 

Of these three groups, we start with the wave equation 


= —q(z,t), (15.4.1) 
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where t denotes time, x is the position, c is the phase velocity of the wave, and q(x, t) is 
the source density. In addition to Equation 15.4.1 it is necessary to state boundary and 
initial conditions to obtain a unique solution. The condition on the boundary can be either 
Dirichlet or Neumann or a linear combination of both (Robin condition). The conditions in 
time must be Cauchy - that is, we must specify the value of u(z,t) and its time derivative 
at t = to for each point of the region under consideration. 

We begin by proving that we can express the solution to Equation 15.4.1 in terms of 
boundary conditions, initial conditions, and the Green’s function, which is found by solving 


Og 18g 
Ox? c8? 
where € denotes the position of a source that is excited at t = 7. Equation 15.4.2 expresses 
the effect of an impulse as it propagates from x = € as time increases from t = 7. For 
t < T, causality requires that g(z,t|£, T) = g(x, t|, T) = 0 if the impulse is the sole source 
of the disturbance. We also require that g satisfies the homogeneous form of the boundary 
condition satisfied by u. 
Our derivation starts with the equations 


Outer) | T307«(£;7) 
O£? c? Or? 


= —ô(x €)5(t— 7), (15.4.2) 


= —-q(6,7), (15.4.3) 


and 
O?g(z,t|£, T) | 107g(m,tl£, T) _ 
Qé” c Or? i 
where we obtain Equation 15.4.4 from a combination of Equation 15.4.2 plus reciprocity, 
namely g(z,t|£,7) = g(£, —7|x, —t). Next we multiply Equation 15.4.3 by g(z,t|£, 7) and 
Equation 15.4.4 by u(£, T) and subtract. Integrating over € from a to b, where a and b are 
the endpoints of the spatial domain, and over T from 0 to tt, where t^ denotes a time 
slightly later than t so that we avoid ending the integration exactly at the peak of the delta 
function, we obtain 


as aule, 7) agla, tlg, 7) 
| f aote RD -uens 


O?g(a: T oul, T 
+ 5 ie 0*5 ) gete n er d j dé dr 


ó(x — €)d(t — 7), (15.4.4) 


tr pb 
= u(x,t) af Í q(£,7) g(a, t|E, T) d£ dr. (15.4.5) 


Because 


0?u T. ? T z 
g(a.tlé, 7) = ) aye, o) Gs te n) 


a£? 
oO ðulE, T) ð Og(z,t|£, T) 
= 8€ [pst n) 8€ | = 8€ jue AE) j (15.4.6) 
and 
9?u(&, T) O?g(z,t|E, 7) 


glm tE 7) SS — u( 0) 5 
= o [pede nEn] u 2 ute, 7) d A (15.4.7) 


Or Or Or 
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we find that 
i E CON Been. . 
[| pee ED ug] a 
m = | je nE quae EDI ag 
T=0 
+f y, q(£, T) g(z.t|£, 7) d£ dr = u(z,t). (15.4.8) 


The integrand in the first integral is specified by the boundary conditions. In the second 
integral, the integrand vanishes at t = t* from the initial conditions on g(z,t|£, T). The 
limit at t = 0 is determined by the initial conditions. Hence, 


u(x, t) FE. (£, T)g(z, t|£, 7) dé dr 


+f Je: ie KS D ag een ae 


-4 f [u (£, 0) 2g 16.0) D a(x, tle, 0) outs.) ? | ag (15.4.9) 


Equation 15.4.9 gives the complete solution of the nonhomogeneous problem. The first 
two integrals on the right side of this equation represent the effect of the source and the 
boundary conditions, respectively. The last term involves the initial conditions; it can be 
interpreted as asking what sort of source is needed so that the function u(x,t) starts in the 
desired manner. 


e Example 15.4.1 


Let us apply the Green's function technique to solve 


u Ou 


az me flre O«t (15.4.10) 


subject to the boundary conditions u(0,t) = 0 and u(1,t) = t, 0 < t, and the initial 
conditions u(z,0) = z and u,(z,0) 20, 0 « z « 1. 
Because there is no source term and c — 1, Equation 15.4.9 becomes 


u(z, t) -f (g(a, t|1, T)ug(1, T) — u(1, T)ge(x,t|1,T)] dr 
3 (g(x, t|0, T)ug(0, T) — u(0, T)ge(z, t|0, 7)] dr 
- f tué. 0)g- (2,41€, 0) — gle, t6. 0)ue(6,0)] d (15.4.11) 


Therefore we must first compute the Green's function for this problem. However, we have 
already done this in Example 15.3.2 and it is given by Equation 15.3.26 with c — L — 1. 
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Next, we note that g(a, t|1,7) = g(x,t|0,7) = 0 and u(0, 7) = u-(€,0) = 0. Consequently, 
Equation 15.4.11 reduces to only two nonvanishing integrals: 


«(2 =- f u(1,T)ge(x,t|l, T) a- | u(€, 0)gr (x, t|£, 0) d£. (15.4.12) 


If we now substitute for g(x,t|£, 7) and reverse the order of integration and summation, 


if u(1, 7) ge (a, t|1,7) dr = 2X (- 


n=1 


1)” mo 7T sin[nz (t — 7)|] dr (15.4.13) 
— 2t E " sin (nz) f sin[nz(t — 7)|] d(t — 7) 


—2 Yu " sin(nzx) | (t — r) sin[nz(t — 7)] d(t —7) (15.4.14) 


x t Pae t 
= —2t Yo " sin(nzx) ces) (15.4.15) 
nu T 
sin[nz (t — 7)] cos[nz (t — 7)] | |Ý 
—2 ye " sin(nzz) { X (t — 7) S l 
De sin(nrzx) m Dus La sin(nzx) sin(nzt), 
n=1 E n=1 
(15.4.16) 
and 
1 
ji u(£,0)g- (x, t|£,0) dé = Ys nz) cos(nzt) f € sin(n£) dé 15.4.17) 
0 
oo i 
=-2 5 sin(nrx)cos(nrt) ae d ud 15.4.18) 
mei n?g no ò 
2 4 (-1)" 
=- CU sin(nzx) cos(nat). 15.4.19) 
Tl n 
Substituting Equation 15.4.16 and Equation 15.4.19 into Equation 15.4.12, we finally obtain 


2t oo 
77m e 
2 oo 
als 
The first summation in Equation 15.4.20 is the Fourier sine expansion for f(x) = x over the 
interval 0 < x < 1. Indeed, a quick check shows that the particular solution up(x,t) = zt 


satisfies the partial differential equation and boundary conditions. 'The remaining two 
summations are necessary so that u(z,0) = x and u;(x,0) = 0. 


sin( nca) Lv 2. 


sin(nzx) cos(nrt) 


sin(nzzx) sin(nzt). (15.4.20) 


'To apply Equation 15.4.9 to other problems, we must now find the Green's function 
for a specific domain. In the following examples we illustrate how this is done using the 
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joint transform method introduced in the previous section. Note that both examples given 
there were for the wave equation. 


e Example 15.4.2: One-dimensional wave equation in an unlimited domain 


The simplest possible example of Green’s functions for the wave equation is the one- 
dimensional vibrating string problem.!? In this problem the Green's function is given by 
the equation 


-2e EM slx — &)ó(t — T), (15.4.21) 


where —oo < x, < oo, and 0 < t,v. If the initial conditions equal zero, the Laplace 
transform of Equation 15.4.21 is 


Gs? 
dz? œ 


G = —ó(zr—£)e 7", (15.4.22) 


where G(x, s|£, T) is the Laplace transform of g(a, t|£, r). To solve Equation 15.4.22 we take 
its Fourier transform and obtain the algebraic equation 


exp(—ik€ — sr) 


G(k, s|€,T) zm k2 E s2/c 


(15.4.23) 


Having found the joint Laplace-Fourier transform of g(x, t|£, T), we must work our way 
back to the Green's function. From the definition of the Fourier transform, we have that 


e 37 oo gik(x—&) 
G(z,s|£,7) = 25 ri Pisa dk. (15.4.24) 


To evaluate the Fourier-type integral, Equation 15.4.24, we apply the residue theorem. See 
Section 11.4. Performing the calculation, 
c exp(—st — s|x — €|/c) 


G(x, s|€,7) = (15.4.25) 


Finally, taking the inverse Laplace transform of Equation 15.4.25, 


amter) = 2H (t7 7 Is - €/9), (15.4.26) 
Or 
c 
glz, tig, T) = 5H [e(t - r) + (s - B) H [e(t — 7) - (x - 9)I- (15.4.27) 
We can use Equation 15.4.26 and the method of images to obtain the Green's function 
for 
Og 18g 
o: 2 OR = 0(a — €)d(t — 7), 0 < zx, t, E, T, (15.4.28) 


1? See also Graff, K. F., 1991: Wave Motion in Elastic Solids. Dover Publications, Inc., Section 1.1.8. 
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e 
T 


g(x.tl&, c yc 


Figure 15.4.1: The Green's function g(z,t|£,T)/c given by Equation 15.4.30 for the one-dimensional 
wave equation for x > 0 at different distances x/ and times c(t — T) subject to the boundary condition 


g(0, t|, T) = 0. 


subject to the boundary condition g(0, t|£, 7) = 0. 

We begin by noting that the free-space Green's function,!? Equation 15.4.26, is the par- 
ticular solution to Equation 15.4.28. Therefore, we need only find a homogeneous solution 
f(a, t|£, 7) so that 


gleter) = SH (t T — |x — &|/e) + f(x, t£, 7) (15.4.29) 


satisfies the boundary condition at x = 0. 

To find f(x,t|£, 7), let us introduce a source at x = —€ at t = r. The corresponding 
free-space Green's function is H(t — r — |x + £|/c). If, along the boundary x = 0 for any 
time t, this Green's function destructively interferes with the free-space Green's function 
associated with the source at x = £, then we have our solution. This will occur if our new 
source has a negative sign, resulting in the combined Green's function 


a(@, tlé,7) = 5 UH r- |z — &/c) - H(t — 7 — |z + €|/o)]. (15.4.30) 


See Figure 15.4.1. Because Equation 15.4.30 satisfies the boundary condition, we need no 
further sources. 

In a similar manner, we can use Equation 15.4.26 and the method of images to find 
the Green's function for 


g 1 Og 
ag ggg 9-99(-7T) 0< x,t, gT, (15.4.31) 


subject to the boundary condition g,(0,t|€,7) = 0. 


13 Tn electromagnetic theory, a free-space Green's function is the particular solution of the differential 
equation valid over a domain of infinite extent, where the Green's function remains bounded as we approach 
infinity, or satisfies a radiation condition there. 
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We begin by examining the related problem 

Og 18g 

Ox? eot? 

where —oo < x, < oo, and 0 < t, 7. In this particular case, we have chosen an image that 


is the mirror reflection of ó(z — £). This was dictated by the fact that the Green's function 
must be an even function of x along x = 0 for any time t. In line with this argument, 


= ô(x — £)ó(t — T) + ôx + £)ó(t — 7), (15.4.32) 


g(z,tl 7) = $ [F (t -7 1e — 1/0) +H (t— 7 — |e  €/0)]. (15.4.33) 


e Example 15.4.3: Equation of telegraphy 


When the vibrating string problem includes the effect of air resistance, Equation 15.4.21 
becomes 


0?g Og 0?g 
2 2—2 có(x-£)ó(t— 15.4.34 
0t + VH C Ox? C ó(x £) ( T), ( 5 ) 
where —oo < x, < oo, and 0 < t, 7, with the boundary conditions 
| ie g(x,t|£, 7) > 0 (15.4.35) 
z|—oo 
and the initial conditions 
g(x, O|§,7) = g(x, O|€, 7) = 0. (15.4.36) 


Let us find the Green’s function. 
Our analysis begins by introducing an intermediate dependent variable w(z,t|£, T), 
where g(z,t|£, T) = e ?*w(z,t|£, T). Substituting for g(z,t|£, T), we now have 
Pw 2 2 Ow 2 T 
ap r e lx — €)d(t — r)e". (15.4.37) 
Taking the Laplace transform of Equation 15.4.37, we obtain 
ew gs? — 7 
da? 2 


) W = —d(a — €)e™~*". (15.4.38) 


c 
Using Fourier transforms as in Example 15.3.1, the solution to Equation 15.4.38 is 
E E Fa a2 ee - 
W(x, s|€,7) = 2 ins SOM EIS (15.4.39) 
2G - 35/8 


Employing tables to invert the Laplace transform and the second shifting theorem, we have 
that 


wiz.tlé T) = Se Io[aV/ E=- @ —H7/E] Hle(t—7) —|e- El, Q54.40) 


or 


g(a, tlé,7) = 5e? Bo | VE TP (@— P/E | Hiet- 7) - le- 8l. | 05-441) 
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9(x,t6,t)/c 
[9] 
[^] 


0 
0 -10 pese 


Figure 15.4.2: The free-space Green's function g(z, t|£, 7)/c for the one-dimensional equation of telegraphy 
with y = 1 at different distances (x — £)/c and times t — 7T. 


Figure 15.4.2 illustrates Equation 15.4.41 when y = 1. LH 


e Example 15.4.4 


Let us solve!^ the one-dimensional wave equation on an infinite domain: 


u u 
—— aS t -X 15.4.42 
o8 Ow cos(wt)d[a (t)], (15 ) 
subject to the boundary conditions 
| lim u(z,t) => 0, 0<t, (15.4.43) 
z|—o0o 
and initial conditions 
u(x, 0) = uz(x,0) = 0, —oo < T < oo. (15.4.44) 


Here w is a constant and X(t) is some function of time. 

With the given boundary and initial conditions, only the first integral in Equation 
15.4.9 does not vanish. Substituting the source term q(x, t) = cos(wt)d[a — X(t)| and the 
Green's function given by Equation 15.4.26, we have that 


u(x,t) = L. q(£,T)g(z, t£, 7) dé dr (15.4.45) 
-3 f [esteri xenma--ie-epaan (18.4.46) 
= 5 Alt — 7 — |X (T) — x|] cos(wr) dr, (15.4.47) 


14 See Knowles, J. K., 1968: Propagation of one-dimensional waves from a source in random motion. J. 
Acoust. Soc. Am., 43, 948-957. 
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since c — 1. 
Problems 
1. By direct substitution, show! that 
g(x, t|0, 0) = Jo(Vat )H (x) H(t) 


is the free-space Green’s function governed by 


2. Use Equation 15.3.20 to construct the Green’s function for the one-dimensional wave 
equation 
O?g 8g 
0t? | Ox? 
subject to the boundary conditions g(0,t|€,7) = g4(L,t|£,r) = 0, O < t, and the initial 
conditions that g(z,0|£, 7) = g(z,0|£,7) 20, 0 € x « L. 


—ó(r—&)(t—-T7) O<4a2,€E<L, 0<t,T, 


3. Use Equation 15.3.20 to construct the Green's function for the one-dimensional wave 
equation 
O?g 8g 
Ot? | Oz? 
subject to the boundary conditions g4(0,t|£,7) = g4(L,t]£, 7) = 0, 0 < t, and the initial 
conditions that g(z,0|£, 7) = g(z,0|£,7) 20, 0 € x « L. 


—ó(r—&)(t—-7) O<a2,€E<L, 0<t,T, 


4. Use the Green's function given by Equation 15.3.26 to write down the solution to 
the wave equation Ut = u4, on the interval 0 < x < L with the boundary conditions 
u(0,t) = u(L,t) = 0, 0 < t, and the initial conditions u(x,0) = cos(zz/L) and u;(x,0) = 0, 
0 « x « L. 


5. Use the Green's function given by Equation 15.3.26 to write down the solution to the wave 
equation Utt = Ure on the interval 0 < x < L with the boundary conditions u(0,t) = e~* 
and u(L,t) = 0, 0 < t, and the initial conditions u(x,0) = sin(zz/L) and w;(r,0) = 1, 
0 « z « L. 


6. Use the Green's function that you found in Problem 2 to write down the solution to the 
wave equation Utt = wu, on the interval 0 < x < L with the boundary conditions u(0, t) = 0 
and u;(L,t) = 1, 0 < t, and the initial conditions u(r, 0) = z and u(x,0) 21, 0 « x< L. 


7. Use the Green's function that you found in Problem 3 to write down the solution to 
the wave equation Ut = Uz, on the interval 0 < x < L with the boundary conditions 
u5 (0,0) = 1 and u;(L,t) = 0, 0 < t, and the initial conditions u(x,0) = 1 and u;(x,0) = 0, 
0 « z « L. 


15 First proven by Picard, É., 1894: Sur une équation aux dérivées partielles de la théorie de la propa- 
gation de l'électricité. Bull. Soc. Math., 22, 2-8. 
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8. Find the Green’s function!® governed by 


0?g 


ETE + 2 —ó(r—£&)(t—--r) O<a,E<L, 0«tr, 


Ot Ox? 


subject to the boundary conditions 
9s (0, t|£, T) = Gx(L, t]€,7) = 0, 0 < t, 
and the initial conditions 


g(x, O|f, T) = g(x, 0|£, T) = 0, 0 « zx « L. 


Step 1: If the Green's function can be written as the Fourier half-range cosine series 
(0, 1I6,7) = Gor) + 2 Gal (=) 
= — T) += ) cos 
gv, , 0 t7 a L , 


so that it satisfies the boundary conditions, show that G,(t|r) is governed by 


2,2 
G! +2G', + "pr Gn - cos( 27E) ó(t-7), O<n. 


Step 2: Show that 
Go(t|r) = e- 7? sinh(t — 7) H(t — 7), 


and 


nae ) -e-n sinl. (E — 7) 


H(t—T), 1<n, 
B. (t — 7) 


where Bn = J/(nz/L)? — 
Step 3: Combine the results from Steps 1 and 2 and show that 


g(z,t|£ T) = e- 7? sinh(t — 7) H(t — 7)/L 
4-2e-0-? H(t — 7)/L 


oo 


22 sin E E] cos( 575) cos (27) l 


1B Ozisik, M. N., and B. Vick, 1984: Propagation and reflection of thermal waves in a finite medium. 
Int. J. Heat Mass Transfer, 27, 1845-1854; Tang, D.-W., and N. Araki, 1996: Propagation of non- 
Fourier temperature wave in finite medium under laser-pulse heating (in Japanese). Nihon Kikai Gakkai 
Rombumshu (Trans. Japan Soc. Mech. Engrs.), Ser. B, 62, 1136-1141. 
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15.5 HEAT EQUATION 


In this section we present the Green's function!" for the heat equation 


Ou Ou 
nes a 3 = q(z, t), (15.5.1) 
where ¢ denotes time, x is the position, a? is the diffusivity, and q(x, t) is the source den- 
sity. In addition to Equation 15.5.1, boundary conditions must be specified to ensure the 
uniqueness of solution; the most common ones are Dirichlet, Neumann, and Robin (a linear 
combination of the first two). An initial condition u(x,t = to) is also needed. 

'The heat equation differs in many ways from the wave equation and the Green's function 
must, of course, manifest these differences. The most notable one is the asymmetry of the 
heat equation with respect to time. This merely reflects the fact that the heat equation 
differentiates between past and future as entropy continually increases. 

We begin by proving that we can express the solution to Equation 15.5.1 in terms 
of boundary conditions, the initial condition, and the Green's function, which is found by 
solving 

0g | 507g 

ðt | Ox? 
where £ denotes the position of the source. From causality! we know that g(z,t|£, T) = 0 
ift <7. We again require that the Green's function g(z,t|£, 7) satisfies the homogeneous 
form of the boundary condition on u(x,t). For example, if u satisfies a homogeneous or 
nonhomogeneous Dirichlet condition, then the Green's function will satisfy the correspond- 
ing homogeneous Dirichlet condition. Although we will focus on the mathematical aspects 
of the problem, Equation 15.5.2 can be given the physical interpretation of the temperature 
distribution within a medium when a unit of heat is introduced at £ at time 7. 

We now establish that the solution to the nonhomogeneous heat equation can be ex- 
pressed in terms of the Green's function, boundary conditions, and the initial condition. 
We begin with the equations 


29 ET) _ Oul T) 
a£? Or 


= 5(x — £)5(t — 7), (15.5.2) 


= —q(£&,T), (15.5.3) 


and 


O*g(z,tl£, T) | Og(a,t€,7) 

2 , , , , _ 

a Je + ar = —ó(z — )d(t—T). (15.5.4) 
As we did in the previous section, we multiply Equation 15.5.3 by g(a, t\€,7) and Equation 
15.5.4 by u(€,7) and subtract. Integrating over € from a to b, where a and b are the 
endpoints of the spatial domain, and over 7 from 0 to t*, where t* denotes a time slightly 
later than t so that we avoid ending the integration exactly at the peak of the delta function, 


we find 
gi 9?u 7 
ef T fee pa 7) - gs tle.) E ) d£ dr 


17 See also Carslaw, H. S., and J. C. Jaeger, 1959: Conduction of Heat in Solids. Clarendon Press, 
Chapter 14; Beck, J. V., K. D. Cole, A. Haji-Sheikh, and B. Litkouhi, 1992: Heat Conduction Using 
Green's Functions. Hemisphere Publishing Corp., 523 pp.; Ozisik, M. N., 1993: Heat Conduction. John 
Wiley & Sons, Inc., Chapter 6. 


18 The principle stating that an event cannot precede its cause. 
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+f n ute) AED X gt te, ET] agar 


-f eee (15.5.5) 


Applying Equation 15.4.6 and performing the time integration in the second integral, we 


finally obtain 
u(e, t) aie (é, nga, tle, 7) d£ dr 


u z,t£,7)]5- 
+a f Ir (x, t|£, ae a T) u(é,r) ok HE, ) dr 


b 
«f u(€,0)g(2, tE 0) d£, (15.5.6) 


where we used g(z,t|£, t*) = 0. The first two terms in Equation 15.5.6 represent the familiar 
effects of volume sources and boundary conditions, while the third term includes the effects 
of the initial data. 


e Example 15.5.1: One-dimensional heat equation in an unlimited domain 


The Green's function for the one-dimensional heat equation is governed by 


0g 8g 
Ot s Ox? 


= 0(a — €)d(t— T), oo«m,£«oo, 0<t,7, (15.5.7) 


subject to the boundary conditions lim)... g(z,t|£, 7) — 0, and the initial condition 
g(x,0|£, T) = 0. Let us find g(z,t|£, 7). 
We begin by taking the Laplace transform of Equation 15.5.7 and find that 


d^G S ó(x = £) —sT 


Sen ee d (15.5.8) 


Next, we take the Fourier transform of Equation 15.5.8 so that 


e ik6e-5T 


(E +lk sler) = E. (15.5.9) 
where G(k, s|£, 7) is the Fourier transform of G(x, s|£, T) and b? = s/a?. 
To find G(x, s|£, T), we use the inversion integral 
T e 37 oo (r—£&)k a 
(x, sl T) = 5—3 I gg (15.5.10) 


Transforming Equation 15.5.10 into a closed contour via Jordan's lemma, we evaluate it by 
the residue theorem and find that 


ele &lvs/a- sr 


Sa (15.5.11) 


G(x, s|€,7) = 
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From a table of Laplace transforms we finally obtain 


(15.5.12) 


after applying the second shifting theorem. 


The primary use of the fundamental or free-space Green's function!’ is as a particular 
solution to the Green’s function problem. For this reason, it is often called the fundamental 
heat conduction solution. Consequently, we usually must find a homogeneous solution so 
that the sum of the free-space Green’s function plus the homogeneous solution satisfies any 
boundary conditions. The following examples show some commonly employed techniques. 


e Example 15.5.2 


Let us find the Green’s function for the following problem: 


09 2 0?g 
Ot Ox? 


—ó(r—£&)(t—-T) 0c«mx£«oo, 0<t,T, (15.5.13) 


subject to the boundary conditions g(0, t|£, T) = 0, lims 55 g(a, t|£, 7) — 0, and the initial 
condition g(z,0|£,7) = 0. From the boundary condition g(0,t|£, 7) = 0, we deduce that 
g(x,t|£,r) must be an odd function in x over the open interval (—0o0,00). We find this 
Green's function by introducing an image source of —ó(x--£) and resolving Equation 15.5.7 
with the source ó(z — £)ó(t — T) — ó(x + £)ó(t — T). Because Equation 15.5.7 is linear, 
Equation 15.5.12 gives the solution for each delta function and the Green's function for 
Equation 15.5.13 is 


glz, t|, T) = aall H exp EN (15.5.14) 


- E ap] aes) a). (15.5.15) 


In a similar manner, if the boundary condition at zr = 0 changes to g;(0,t|£, 7) = 0, 
then Equation 15.5.14 through Equation 15.5.15 become 


(oof ~ [enl on +en|- 2L (15.5.16) 


4ra?(t — T Aa?(t — T 4a? (t 
a H(t — 7) ve + £? wf 
— Ama?(t — 7) a| Aa? (t — 5 PUER bd i (15.5.17) 


19 In electromagnetic theory, a free-space Green’s function is the particular solution of the differential 
equation valid over a domain of infinite extent, where the Green’s function remains bounded as we approach 
infinity, or satisfies a radiation condition there. 
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Figure 15.5.1: The Green's function, Equation 15.5.17, for the one-dimensional heat equation on the 
semi-infinite domain 0 < x < oo, and 0 € t — T, when the left boundary condition is g4(0,t|£, T) = 0 and 
= 0.5. 


Figure 15.5.1 illustrates Equation 15.5.17 for the special case when £ — 0.5. 
e Example 15.5.3: One-dimensional heat equation on the interval 0 < x < L 


Here we find the Green's function for the one-dimensional heat equation over the in- 
terval 0 « x « L associated with the problem 


2 
oe ce = f), O<2<L, 0<t, (15.5.18) 


where a? is the diffusivity constant. 
To find the Green’s function for this problem, consider the following problem: 


0g 8g — 
a | gzz 7 Ole -Eit — 7), 0< zx, < L, 0 « t,T, (15.5.19) 


with the boundary conditions 


o39(0,t|£. T) + 819,(0,t£, 7) 20, 0«t, (15.5.20) 


and 


a39(L, ti€, T) + Bags (L,t&, 7) 20, 0 «t, (15.5.21) 


and the initial condition 
g(z,0£,7) 20, O<a<L. (15.5.22) 


We begin by taking the Laplace transform of Equation 15.5.19 and find that 


G s d(x — €) 
= a L 15.5.2 
di ac "| E 0<a<L, (15.5.23) 
with 
a1G(0, s|€,7) + 61G"(0, s|£, T) = 0, (15.5.24) 
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and 
a2G(L, s|€,7) + B2G' (L, s£, 7) = 0. (15.5.25) 


Problems similar to Equation 15.5.23 through Equation 15.5.25 were considered in Section 
15.2. Applying this technique of eigenfunction expansions, we have that 


—SsT > Qn(£)en(x) 
s+ ark? 


G(z,s|&, T) =e i (15.5.26) 


n=1 


where n(x) is the nth orthonormal eigenfunction to the regular Sturm-Liouville problem 
p (x) +k?o(x) — 0, 0<a<L, (15.5.27) 

subject to the boundary conditions 
a1p(0) + 81v (0) = 0, (15.5.28) 


and 


agy(L) + Boy" (L) = 0. (15.5.29) 


Taking the inverse of Equation 15.5.26, we have that 


g(x, tlé,7) = V es (£n (s)e- ^" 7? | H(t— 7). (15.5.30) 
n=1 


For example, let us find the Green’s function for the heat equation on a finite domain 


0g 8g 


ap "gi 90-9977, O<@E<L, O<t,7, (15.5.31) 


with the boundary conditions g(0,t|£, 7) = g(L,t|£, 7) = 0, 0 < t, and the initial condition 
g(x, 0|§,7) 20, 0€ x « L. 
'The Sturm-Liouville problem is 


yp" (x) +k’ p(x) = 0, 0<a<L, (15.5.32) 


with the boundary conditions y(0) = (L) = 0. The nth orthonormal eigenfunction to 


Equation 15.5.32 is 
2 
eua) =f 7 sin(777) (15.5.33) 


Substituting Equation 15.5.33 into Equation 15.5.30, we find that 


g(z,t£,7) = 2 P» sin( 5) sin(77) pda H(t-T) (15.5.34) 
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On the other hand, the Green's function for the heat equation on a finite domain 
governed by 


0g 8g 
Ot A Ox? 


with the boundary conditions 


—ó(r—£)(t—-T) 0<2,€<L, 0«tr, (15.5.35) 


9g. (0,t|£, T) 20, g.(L,t]£yr) - hg(L,tl£,o7r) 20, 0<t, (15.5.36) 


and the initial condition g(z,0]£,7) = 0, 0 < x < L, yields the Sturm-Liouville problem 
that we must now solve: 


yp (x) + Ay(x) =0, y'(0)=0, we'(L)-he(L) — 0. (15.5.37) 


The nth orthonormal eigenfunction for Equation 15.5.37 is 


2(k2 + h?) 


where kn is the nth root of k tan(kL) = h. We also used the identity that (k2 +h?) sin? (knh) 
— h?. Substituting Equation 15.5.38 into Equation 15.5.30, we finally obtain 


TR i> (hut + (hL)?] cos(kn€) cos(kn z) eem) NUES 


kn L)? + (hL)? + hL 


e Example 15.5.4 


Let us use Green’s functions to solve the heat equation 


Ou — Ou 


Be = agn O0<a<L, 0<t, (15.5.40) 
subject to the boundary conditions 
u(0,t)=0, u(L,t)=t,  0«t, (15.5.41) 
and the initial condition 
u(z,0) = 0, 0 « x « L. (15.5.42) 


Because there is no source term, Equation 15.5.6 simplifies to 
t 
u(z,t) = a f [g(z, t|D, T)ue(L, T) — u(L, T)ge(z, t|L,7)] dr (15.5.43) 
0 
t Ñ 
-a [ lg tfo T)ue (0,7) — u(0; Dans to] dr f wt 0). t6. 0) a. 
0 0 
'The Green's function that should be used here is the one given by Equation 15.5.34. Further 
simplification occurs by noting that g(x,t|0,7) = g(a,t|L,7) = 0 as well as u(0,7) = 


u(£,0) — 0. Therefore we are left with the single integral 


t 
u(x,t) = -e f u(L,T)ge(x,t|L, 7) dr. (15.5.44) 
0 
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a’u(x,t)/L? 


Figure 15.5.2: The temperature distribution within a bar when the temperature is initially at zero and 
then the ends are held at zero at x = 0 and t at x = L. 


Upon substituting for g(a, t|L, 7) and reversing the order of integration and summation, 


Iqa? 2 t 2,272 
u(x,t) = — = (-1)"nsin(™) f rexp| T(r - 8) dr (15.5.45) 
n=1 0 
2L? (ye. a^n?g? a^n?g?r i 
mE. z sin( L )ew| T2 (T ] ( T2 1) : (15.5.46) 
2L? QA (—1)* . rmmz [o?n?s?t a?n?gz?t 
S n sin( L ) | I 14 ew( T3 ). (15.5.47) 


Figure 15.5.2 illustrates Equation 15.5.47. This solution is equivalent to Equation 8.6.16 
that we found by Duhamel's integral. g 


e Example 15.5.5: Heat equation within a cylinder 


In this example, we find the Green’s function for the heat equation in cylindrical 
coordinates 


ag a (-22) _ &(r— pt — 7) 


0 b, Oct 15.5.48 
Ot rOr\ Or 2nr : Sap en eke ( ) 


subject to the boundary conditions lim,_,9 |g(r, t|p, 7)| < oo, g(b, t|p, 7) = 0, and the initial 
condition g(r, 0|p,7) = 0. 
As usual, we begin by taking the Laplace transform of Equation 15.5.48, or 


2103 OE LEM MET (15.5.49) 


rdrN dr a? 2na?r 
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b? g(r,tip.t) 


0.1 
a’ (t-t)/b* 
0 0 r/b 


Figure 15.5.3: The Green's function, Equation 15.5.54, for the axisymmetric heat equation, Equation 
15.5.48, with a Dirichlet boundary condition at r = b. Here p/b = 0.3 and the graph starts at a?(t— 7)/b? = 
0.001 to avoid the delta function at t — 7 = 0. 


Next we re-express ó(r — p)/r as the Fourier-Bessel expansion 


Cl e Y AnJo(knr/b), (15.5.50) 
n=1 
where kn is the nth root of Jo(k) = 0, and 
2 ? &(r — p) Jo(knp/b) 
An = Jolknr/b)r dr = ———— 15.5.51 
LEE E dd 
so that 
ld / dG E Jo( E (k nr/b). 
= 15.5.52 
2103 ao pe kn) MPO) 
'The solution to Equation 15.5.52 is 
e T > Jo(knp/b)Jo(knr/b) 
= ] 15.5.53 


Taking the inverse of Equation 15.5.53 and applying the second shifting theorem, 


HE- 7) ^ Jol [Pol [D tiep 


15.5.54 
nb? JZ(k,) ( ) 


g(r, tlp, T) = 


n=1 
See Figure 15.5.3. 
If we modify the boundary condition at r = b so that it now reads 
gr (b. tlp, T) + hg(b.t|p, T) = 0, (15.5.55) 
where h > 0, our analysis now leads to 


assises a T) < 3 E E ae -a?k? ary 


where kn are the positive roots of k Ji(k) — hb Jo(k) = 0. If h = 0, we must add 1/(7b?) to 
Equation 15.5.56. 


(15.5.56) 
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Problems 


1. Find the free-space Green's function for the linearized Ginzburg-Landau equation?? 


Og 
Ot 


m Og _ 
Hua — ag baa = O(a £)Óó(t —7T), -oo < x,£ «oo, 0 « t, T, 


with b > 0. 


Step 1: Taking the Laplace transform of the partial differential equation, show that it 
reduces to the ordinary differential equation 
£G dG 


di vx tae sG = —ó(x—£)e ?*. 


Step 2: Using Fourier transforms, show that 


€ 


G n QN Lu M" 
(z,s|£, 7) = On | i 


Or 
ea(t-7) 


g(x, tl€,7) = - H(t — 7) ia e tne cos{k|x — € — v(t — r)]} dk. 


Step 3: Evaluate the second integral and show that 


gat—7) H(t — 7) [x - £ — v(t 7)? 
2,/mb(t — T) exp \ 


g(x, t|£, T) zm 


2. Use Green's functions to show that the solution?! to 


Ou u 


— =a, 0 « z,t, 
Ot Ox? 


subject to the boundary conditions 


u(0,t)=0, lim u(z,t) — 0, 0 « t, 


-+00 


and the initial condition 
u(z,0) = f(x), 0«r«oo, 

is " : 
et? /(4a7t) 


ay Ti 


= = : LT —1? /(4a7t) 
u(z, t) ji f(r) sinh( =z) e dr. 


20 See Deissler, R. J., 1985: Noise-sustained structure, intermittency, and the Ginzburg-Landau equation. 
J. Stat. Phys., 40, 371—395. 


21 See Gilev, S. D., and T. Yu. Mikhailova, 1996: Current wave in shock compression of matter in a 
magnetic field. Tech. Phys., 41, 407-411. 
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3. Use Equation 15.5.30 to construct the Green’s function for the one-dimensional heat 
equation gt — Jax = 6(a — €)d(t — r) for 0 < x < L, 0 < t, with the initial condition that 
g(x, 0,7) = 0, 0 <a < L, and the boundary conditions that g(0,t|€,7) = g.(L,t|£, 7) = 0 
for 0 < t. Assume that L Z m. 


4. Use Equation 15.5.30 to construct the Green's function for the one-dimensional heat 
equation gy — gs; = O(a — £€)ó(t — T) for 0 < x < L, 0 < t, with the initial condition that 
g(x,0|£, T) 20,0 < x < L, and the boundary conditions that g;(0,t|£, T) = g«(L,t|£, T) = 0 
for 0 « t. 


5. Use Equation 15.5.43 and the Green's function given by Equation 15.5.34 to find the 
solution to the heat equation uw; = Use for 0 < x < L, 0 < t, with the initial data u(x, 0) = 1, 
0 « x< L, and the boundary conditions u(0,t) = e * and u(L,t) = 0 when 0 < t. 


6. Use Equation 15.5.43 and the Green's function that you found in Problem 3 to find 
the solution to the heat equation uz = Ugg for 0 < x < L, 0 < t, with the initial data 
u(z,0) — 1, 0 < x < L, and the boundary conditions u(0,t) = sin(t) and u;(L,t) = 0 when 
0 « t. 


7. Use Equation 15.5.43 and the Green's function that you found in Problem 4 to find 
the solution to the heat equation uz = uz, for 0 < x < L, 0 < t, with the initial data 
u(r,0) = 1, 0 < x < L, and the boundary conditions u4(0,t) = 1 and u;(L,t) = 0 when 
0 « t. 


8. Find the Green's function for 


Og 207g yy ER 
ar a2 ta Kg = 9(s - (t - 7), 0czx,£«L, 0<t,7, 


subject to the boundary conditions 
g(0,t&, 7) = g«(L,t&, 7) -0, — 0«t, 


and the initial condition 
g(x, O|€,7) — 0, 0<a<L, 


where a and Kk are real constants. 
15.6 HELMHOLTZ'S EQUATION 


In the previous sections, we sought solutions to the heat and wave equations via Green's 
functions. In this section, we turn to the reduced wave equation 


u Ou 
Ba? T Oy? + Au = —f (x,y). (15.6.1) 


Equation 15.6.1, generally known as Helmholtz’s equation, includes the special case of Pois- 
son's equation when A = 0. Poisson's equation has a special place in the theory of Green's 
functions because George Green (1793-1841) invented his technique for its solution. 
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The reduced wave equation arises during the solution of the harmonically forced wave 
equation”? by separation of variables. In one spatial dimension, the problem is 
u 1 Ou 
Ox? e Ot? 
Equation 15.6.2 occurs, for example, in the mathematical analysis of a stretched string over 
some interval subject to an external, harmonic forcing. Assuming that u(x,t) is bounded 
everywhere, we seek solutions of the form u(x,t) = y(x)e !^'. Upon substituting this 
solution into Equation 15.6.2 we obtain the ordinary differential equation 


y" + koy = -f (x), (15.6.3) 
where k2 = w*/c?. This is an example of the one-dimensional Helmholtz equation. 
Let us now use Green’s functions to solve the Helmholtz equation, Equation 15.6.1, 
where the Green’s function is given by the Helmholtz equation 
ag | O?g 
H àg = —ô ô ' 15.6.4 
3r? ^ Oy? g (x — &)ó(y — n) (15.6.4) 
'The most commonly encountered boundary conditions are 


= —f(z)e t, (15.6.2) 


e the Dirichlet boundary condition, where g vanishes on the boundary, 


e the Neumann boundary condition, where the normal gradient of g vanishes on the bound- 
ary, and 


e the Robin boundary condition, which is the linear combination of the Dirichlet and Neu- 
mann conditions. 


We begin by multiplying Equation 15.6.1 by g(a, y|€,7) and Equation 15.6.4 by u(z,y), 
subtract and integrate over the region a « z « b,c « y « d. We find that 


He -f [fo ae) | age 


2 2 
MN É dose) 2? FE |} asy 


d 
kA 9) g(a, yl£, 0) de dy (15.6.5) 


EF 2L bells) um 2| 7 Flute, yy Aen |} oer 


TI Blots nM) - fate MEME] aay 


+f [^ f(x, y)g(, yl€, n) da dy (15.6.6) 
= [pee AED — uoe En] ay 
«f peu E — us artem a 


+f [^ f(a, y)g(z, y|&, n) dx dy. (15.6.7) 


22 See, for example, Graff, K. F., 1991: Wave Motion in Elastic Solids. Dover Publications, Inc., Section 
1.4: 
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Because (€,7) is an arbitrary point inside the rectangle, we denote it in general by (x,y). 
Furthermore, the variable (x, y) is now merely a dummy integration variable that we now 
denote by (6,5). Upon making these substitutions and using the symmetry condition 
g(x, yl, m) = g(&. nlx, y), we have that 


2 u T €=b 
us y) = f [arsit m? (m ule, n? xn dn 


ae aE . 
° u n= 
«f lors. m? Gum) ule, em " 
a On On = 
d pb 
«f I F(E,mg(a, ylé,m) dé dn. (15.6.8) 


Equation 15.6.8 shows that the solution of Helmholtz’s equation depends upon the sources 
inside the rectangle and values of u(z, y) and (Ou/Ox,0u/Oy) along the boundary. On the 
other hand, we must still find the particular Green's function for a given problem; this 
Green's function depends directly upon the boundary conditions. At this point, we work 
out several special cases. 


1. Nonhomogeneous Helmholtz equation 
and homogeneous Dirichlet boundary conditions 


In this case, let us assume that we can find a Green's function that also satisfies the 
same Dirichlet boundary conditions as u(z,y). Once the Green's function is found, then 
Equation 15.6.8 reduces to 


d b 
u(z,y) = f n FE Male ule, n) d£dn. (15.6.9) 


A possible source of difficulty would be the nonexistence of the Green’s function. From 
our experience in Section 15.2, we know that this will occur if A equals one of the eigenvalues 
of the corresponding homogeneous problem. An example of this occurs in acoustics when 
the Green's function for the Helmholtz equation does not exist at resonance. 


2. Homogeneous Helmholtz equation 
and nonhomogeneous Dirichlet boundary conditions 


In this particular case, f(x,y) = 0. For convenience, let us use the Green's function 
from the previous example so that g(x, y|£, ) = 0 along all of the boundaries. Under these 
conditions, Equation 15.6.8 becomes 


b (x £—b 
us) =~ | ue Ae d£ — [ 99m Wem a (15.6.10) 


EE uA keea 


Consequently, the solution is determined once we compute the normal gradient of the 
Green's function along the boundary. 


3. Nonhomogeneous Helmholtz equation 
and homogeneous Neumann boundary conditions 
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If we require that u(x, y) satisfies the nonhomogeneous Helmholtz equation with homo- 
geneous Neumann boundary conditions, then the governing equations are Equation 15.6.1 
and the boundary conditions uz = 0 along x = a and x = b, and u, = 0 along y = c and 
y =d. Integrating Equation 15.6.1, we have that 


[| ena f [e P] dx 


sa [veo dz dy — - f [ rev drdy. (15.6.11) 


Because the first two integrals in Equation 15.6.11 must vanish in the case of homogeneous 
Neumann boundary conditions, this equation cannot be satisfied if A = 0 unless 


f is f(a, y) dz dy = 0. (15.6.12) 


A physical interpretation of Equation 15.6.12 is as follows: Consider the physical pro- 
cess of steady-state heat conduction within a rectangular region. The temperature u(z, y) 
is given by Poisson's equation 


25 2 
- 4 ss = — f (v, y), (15.6.13) 
where f(x,y) is proportional to the density of the heat sources and sinks. The boundary 
conditions u;(a,y) = uz(b, y) = 0 and u,(z,c) = u,(x,d) = 0 imply that there is no heat 
exchange across the boundary. Consequently, no steady-state temperature distribution can 
exist unless the heat sources are balanced by heat sinks. This balance of heat sources and 
sinks is given by Equation 15.6.12. 

Having provided an overview of how Green's functions can be used to solve Poisson 
and Helmholtz equations, let us now determine several of them for commonly encountered 
domains. 


e Example 15.6.1: Free-space Green's function for the one-dimensional Helmholtz equation 
Let us find the Green's function for the one-dimensional Helmholtz equation 
g + kg = —ó(x — E), —oo < z,£ < oo. (15.6.14) 
If we solve Equation 15.6.14 by piecing together homogeneous solutions, then 
g(x|£) = Ae-**e(*-9 4+ Betho(e—§) | (15.6.15) 


for x < €, while 
g(x|£) = Ce~to@-§) 4 peiko(z-8). (15.6.16) 


for £ « zx. 

Let us examine Equation 15.6.15 more closely. The solution represents two propagating 
waves. Because x < £, the first term is a wave propagating out to infinity, while the second 
term gives a wave propagating in from infinity. This is seen most clearly by including the 
e-?** term into Equation 15.6.15, or 


g(z|£)e- t = Ae" i*o(z—&)— iwt + Bei*o(z-8)- iot. (15.6.17) 
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ly 
a ee (b) Contour for x < & 
d xX, 
/ 1 A, 
] ko ' ko 
eoe > et __e,__ > 
x 1 i E 
Ko ~o ckg | 
v ^ 
(a) Contour for x > & eMe. um 


Figure 15.6.1: Contour used to evaluate Equation 15.6.21. 


Because we have a source only at x = £, solutions that represent waves originating at infinity 
are nonphysical and we must discard them. This requirement that there are only outwardly 
propagating wave solutions is commonly called Sommerfeld's radiation condition.?? Similar 
considerations hold for Equation 15.6.16 and we must take C = 0. 

To evaluate A and D, we use the continuity conditions on the Green's function: 


g(£*|£) = gE lE), and g'(£*|£) — g'(£-|£) = —1, (15.6.18) 
Or 
A=D, and = ikoD +ikyA=—-1. (15.6.19) 
Therefore, 
g(z|£) = meee (15.6.20) 


We can also solve Equation 15.6.14 by Fourier transforms. Assuming that the Fourier 
transform of g(z|£) exists and denoting it by G(k|£), we find that 


G(k = d Aa ug a) 15.6.21 
(RIS) = ga ag an qeu] t . (15.6.21) 
Immediately we see that there is a problem with the singularities lying on the path of 
integration at k = +ko. How do we avoid them? 

There are four possible ways that we might circumvent the singularities. One of them 
is shown in Figure 15.6.1. Applying Jordan's lemma to close the line integral along the real 


?3 Sommerfeld, A., 1912: Die Greensche Funktion der Schwingungsgleichung. Jahresber. Deutschen 
Math.- Vereinung, 21, 309-353. 
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Free-Space Green’s Function for the Poisson and Helmholtz Equations 


Dimension Poisson Equation Helmholtz Equation 
One no solution g(x|£) = oes 
T _ _ nt) _ ina 
wo g(z,vl& n) = — g(v,y|&, m) = 7 Ho (Kor) 


2T 4 


r= (£ -EP + (y-n)? 


Note: For the Helmholtz equation, we have taken the temporal forcing to be e-?^* and ko = w/c. 


axis (as shown in Figure 15.6.1), 


1 ei2(@—&) 
g(a|) = x f. Pak dz. (15.6.22) 


For x < €, 
iz(a—€) j 
] E " — 2  —iko(z— 
while 
iz(r—£) j 
; id $^ iko(z— 
a(n) = i Res [T gs So (15.6.24) 
0 


for x > €. A quick check shows that these solutions agree with Equation 15.6.20. If we try 
the three other possible paths around the singularities, we obtain incorrect solutions. 


e Example 15.6.2: Free-space Green’s function for the two-dimensional Helmholtz equation 


At this point, we have found two forms of the free-space Green’s function for the one- 
dimensional Helmholtz equation. The first form is the analytic solution, Equation 15.6.20, 
while the second is the integral representation, Equation 15.6.21, where the line integration 
along the real axis is shown in Figure 15.6.1. 

In the case of two dimensions, the Green's function?^ for the Helmholtz equation sym- 
metric about the point (£, n) is the solution of the equation 

dg ldg v ó(r) 


where r = „/(x — £)? + (y — n)?. The homogeneous form of Equation 15.6.25 is Bessel's 
differential equation of order zero. Consequently, the general solution in terms of Hankel 
functions is 

g(rlro) = A Hj? (kor) + BH? (kor). (15.6.26) 


24 For an alternative derivation, see Graff, K. F., 1991: Wave Motion in Elastic Solids. Dover Publica- 
tions, Inc., pp. 284-285. 
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Why have we chosen to use Hankel functions rather than Jo(-) and Yo(-)? As we argued 
earlier, solutions to the Helmholtz equation must represent outwardly propagating waves 
(the Sommerfeld radiation condition). If we again assume that the temporal behavior 
is e~*“* and use the asymptotic expressions for Hankel functions, we see that H\ (kor) 
represents outwardly propagating waves and B = 0. 

What is the value of A? Integrating Equation 15.6.26 over a small circle around the 
point r = 0 and taking the limit as the radius of the circle vanishes, A = 7/4 and 


a 


g(r|ro) = 4 H6 (hor). (15.6.27) 


If à real function is needed, then the free-space Green's function equals the Neumann 
function Yo(kor) divided by —4. 


e Example 15.6.3: Free-space Green's function for the two-dimensional Laplace equation 


In this subsection, we find the free-space Green's function for Poisson's equation in two 
dimensions. This Green's function is governed by 


1 0-f Og 18g  ó(r— p)d(0 — 0") 
r Or (54) r2 062 — r VEO) 


If we now choose our coordinate system so that the origin is located at the point source, 
r = y(x — €)? + (y — 9? and p = 0. Multiplying both sides of this simplified Equation 
15.6.28 by rdr d and integrating over a circle of radius e, we obtain —1 on the right side 
from the surface integration over the delta functions. On the left side, 


pu „29 
0 Ór 
The Green's function g(r, 0|0, 0’) = —1n(r)/(27) satisfies Equation 15.6.29. 


To find an alternative form of the free-space Green’s function when the point of exci- 
tation and the origin of the coordinate system do not coincide, we first note that 


do = —1. (15.6.29) 


ré 


1 oo . j 
_ aly — in(0—6') 
(6-0) = Do P : (15.6.30) 
This suggests that the Green's function should be of the form 


oo 


g(r.8lp,0') = Y gu(rlg)e t=. (15.6.31) 


n=—Co 


Substituting Equation 15.6.30 and Equation 15.6.31 into Equation 15.6.29, we obtain the 
ordinary differential equation 


ld 73 n? ó(r — p) 


pot 15.6.32 
r dr dr r2 9 2nr ( ) 
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The homogeneous solution to Equation 15.6.32 is 


-Ja Üsc p, 
g(rlo) = I in, pe Peo. (15.6.33) 
and (r/p) 
| felr/p)*, O<r<p, 
gn(rle) = {i (or, — p&r«oo, dope 
if n z 0. 
At r = p, the gn’s must be continuous, in which case, 
a = bln(p), and c — d. (15.6.35) 
On the other hand, 
CE A 
dgn |" ^ 1 
T OA E EE 15.6.36 
dr | 9- D ( ) 
"m info) 1 1 
n(p 
= — = — — — = —., 15.6.3 
a pcd b Ba? and c=d Ga ( 7) 
Therefore, 
la(s) 1 &&1 /re\” 
0|p, 0) = | 0—0 15.6.38 
geo) = RE eu (E) e-o (15.6.38) 
where ry = max(r, p) and rz = min(r, p). 
We can simplify Equation 15.6.38 by noting that 
e^ p" cos[n(8 — 6") 
Wisp odes] un od <2 ee 15.6.39 
n [1+ p* — 2pcos(8 — 6’)] 2. 5 ( ) 
if |p| < 1. Applying this relationship to Equation 15.6.38, we find that 
1 
g(r,0|p, 0") = ae In [r? + p? — 2rpcos(0 — 6’)] . (15.6.40) 
T 
Note that when p = 0 we recover g(r, 0|0, 6’) = — In(r)/(2m). 
e Example 15.6.4: Two-dimensional Poisson equation over a rectangular domain 
Consider the two-dimensional Poisson equation 
u Ou 
Sb m ; 15.6.41 
Oz? ka Oy? fv, y) ( ) 


This equation arises in equilibrium problems, such as the static deflection of a rectangular 
membrane. In that case, f(x, y) represents the external load per unit area, divided by the 
tension in the membrane. The solution u(x,y) must satisfy certain boundary conditions. 
For the present, let us choose u(0, y) = u(a, y) = 0, and u(z,0) = u(x, b) = 0. 
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To find the Green’s function for Equation 15.6.41 we must solve the partial differential 
equation 

3g | O?g u 

Ox? Oy? — 


d(a—E)d(y—n), O<a,E<a, 0<y,n< b, (15.6.42) 


subject to the boundary conditions 


9(0, ylE, n) = g(a, ylE, n) = g(x, O[€, m) = g(a, IE, n) = 0. (15.6.43) 
From Equation 15.6.9, 
a b 
u = £ dn d£. .6. 
(x,y) f f g(x, y|&, n) f(E, n) dn d£ (15.6.44) 


One approach to finding the Green’s function is to expand it in terms of the eigenfunc- 
tions (x,y) of the differential equation 


Cp Py 
+55 =À 15.6.45 
Ag jy? 9, ( ) 
and the boundary conditions, Equation 15.6.43. The eigenvalues are 
n?n? mèr? 
Anm = uw 5g (15.6.46) 
where n = 1,2,3,..., m = 1,2, 3,..., and the corresponding eigenfunctions are 
Qnm(t, y) = sin( 757) sin( 2) i (15.6.47) 
a 


Therefore, we seek g(x, y|, n) in the form 


g(a, ylE,7) 23 S Anm sin(““* Z) sin n( =) : (15.6.48) 


n=1m=1 


Because the delta functions can be written 


d(a — §)d(y— 7) = E Y Ss (226) sin( 277) sin( 27) sin (772) , (15.6.49) 


n=1m=1 
we find that c3 "E £ 
N T MT 4 . (nt . (MTN 
(= n" | Anm = = 6 sin( 772) l (15.6.50) 


after substituting Equations 15.6.48 and 15.6.49 into Equation 15.6.42, and setting the 
corresponding harmonics equal to each other. Therefore, the bilinear formula for the Green’s 
function of Poisson’s equation is 


sa (EFE) sin (27E) sa (EFA) nC) 


4 oo oo 
g(x, y|&, n) go uiu pui . | (15.6.51) 
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Thus, solutions to Poisson’s equation can now be written as 


-X 3 sin(“™*) sin( 777) (15.6.52) 
=e EUER 2762 ° a b J? Us 
where anm are the Fourier coefficients for the function f(x, y): 
4 qq . [NTEN , (mmy 
Qnm = = f(x,y) sin(“**) sin( 772) dy dx. (15.6.53) 
ab 0 0 a b 


Another form of the Green’s function can be obtained by considering each direction 
separately. To satisfy the boundary conditions along the edges y = 0 and y = b, we write 
the Green’s function as the Fourier series 


g(x, yl, n) sy Gs (x|£) sin( 22) ; (15.6.54) 
mæl 


where the coefficients G,,(r|£) are left as undetermined functions of x, £, and m. Substi- 
tuting this series into the partial differential equation for g, multiplying by 2sin(nay/b)/b, 
and integrating over y, we find that 


Gn nêr? 
dx? b2 


Gn = s sin(27) 5(x — £). (15.6.55) 


This differential equation shows that the expansion coefficients G,,(z|£) are one-dimensional 
Green's functions; we can find them, as we did in Section 15.2, by piecing together homo- 
geneous solutions to Equation 15.6.55 that are valid over various intervals. For the region 
0 € z € £, the solution to this equation that vanishes at x = 0 is 


Gn tat) = An sin(^77) (15.6.56) 


where A, is presently arbitrary. The corresponding solution for £ € x < a is 


na(a — 2] | 


G5 (x|£) = Bn sinh] 5 


(15.6.57) 


Note that this solution vanishes at x = a. Because the Green's function must be continuous 


at x = Å, 
An sim (276) = Bn sin [2762] . (15.6.58) 


On the other hand, the appropriate jump discontinuity of G^, (z|£) yields 


nT nr(a— £) nT nr£N 2, (mmm 
ZB, cosh 5 | b An cosh 5 )- ;sin( 5 k (15.6.59) 


Solving for A, and Bn, 


zie 2 sin (271) sinh[nz(a — £)/0] (15.6.60) 


b sinh(nza/b) ' 
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g(xykm) 


Figure 15.6.2: The Green’s function, Equation 15.6.62 or Equation 15.6.63, for the planar Poisson equation 
over a rectangular area with Dirichlet boundary conditions on all sides when a = b and £/b = n/b = 0.3. 


me h(né/b) 
2 nm sinh(nm. 
B, = i : 15.6.61 
” mm sin( b ) sinh(nza/b) HUS 
This yields the Green's function 
_2 c^ sinh[(nz(a — ay)/b] sinh(nmaz/b) . (mmm . (mmy 
g(a, ylen) T 2 n sinh(nra/b) sin( b ) sin( b ) y (5S2 


where zy = max(z,£) and rz = min(z,£). Figure 15.6.2 illustrates Equation 15.6.62 in 
the case of a square domain with £/b = n/b = 0.3. 


If we began with a Fourier expansion in the y-direction, we would have obtained 


oo 


sinh[mz(b — y.)/a| sinh(mmy-/a) . (mn£YN . (mara 
g(x, l6. n) o» CENTS NP sin| — sin( 7 ) , (15.6.63) 
where y> = max(y,7) and yz = min(y, n). m 


e Example 15.6.5: Two-dimensional Helmholtz equation over a rectangular domain 
The problem to be solved is 
o? 2 
cg nr By + kog = —ó(x — €)d(y — n), (15.6.64) 
where 0 < 2,€ < a, and 0 < y, 1] < b, subject to the boundary conditions that 


9(0, yl€,7) = gla, ylé, n) = g(x, O|€, n) = g(a, bE, m) = 0. (15.6.65) 
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We use the same technique to solve Equation 15.6.64 as we did in the previous example 
by assuming that the Green’s function has the form 


g(x, yl€,7) as Gm (x|£) sin( 72) ; (15.6.66) 
m=1 


where the coefficients G,,(z|£) are undetermined functions of x, €, and 7. Substituting this 
series into Equation 15.6.64, multiplying by 2sin(nzy/b)/b, and integrating over y, we find 


that a a 
A n*m 2 | 2 , (NTN 
A ( 7 i) Gn = = sin( ; ) 9 £). (15.6.67) 


The first method for solving Equation 15.6.67 involves writing 


&(r—-£)- : b» sin( "=" sin(“"*) (15.6.68) 


and 
2— . (mna 
m 2. anm sin( 25) ; (15.6.69) 


Upon substituting Equations 15.6.68 and 15.6.69 into Equation 15.6.67, we obtain 
= ( Due n?n? ) MTE 
5 k2— — Grin sin( 22) 

b2 a 


m=1 
Se us Les ?) an (95) sin (E), (15.6.70) 


Matching similar harmonics, 


|... Asin(mné£/a) sin(nan/b) 
"ee ab(m?n? /a? + n?n?/b? — k2)' 


(15.6.71) 


and the bilinear form of the Green’s function is 


bi abi 9 3 sin(mn£/a) sin(nan/b) sin(maa/a) sin(nmy/b) (15.6.72) 


272/72 242 /b2 2 
MT n b? — kg 
n=lm=1 /a? + / 


See Figure 15.6.3. The bilinear form of the Green's function for the two-dimensional Helm- 
holtz equation with Neumann boundary conditions is left as Problem 15. 

As in the previous example, we can construct an alternative to the bilinear form of the 
Green's function, Equation 15.6.72, by writing Equation 15.6.67 as 


Gy 
dx? 


2. (mmy 
— k2 = 
Gy z sin( ; ) (a £), (15.6.73) 
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Figure 15.6.3: The Green’s function, Equation 15.6.72, for Helmholtz’s equation over a rectangular region 


with a Dirichlet boundary condition on the sides when a = b, koa = 10, and £/a = n/a = 0.35. 


where k2 = n?n? /b? — kê. The homogeneous solution to Equation 15.6.73 is now 


B A, sinh(k,2), 0 € x € €, 
Gn(alé) = s smhk.(a—-z), <ra. 


'This solution satisfies the boundary conditions at both endpoints. 
Because G,,(x|£) must be continuous at x = £, 


An sinh(k,£) = By sinh[k, (a — £)]. 


On the other hand, the jump discontinuity involving G7, (x|) yields 


2 
—kn Bn cosh[k, (a — €)] — kn An cosh(kn€) = (b sim) 


Solving for A, and By, 


2 , (mmm sinh[k,(a — €)] 
Aca sin( b ) sinh(k,a) ^ 


and 


= 2. (mmm sinh(k,£) 
E sin( b Jie 


This yields the Green's function 


glz, yl, m) = p» sin[s, (a — z>)] sin(Knt<) sin(72) sin( 772) 


9 N 
be Kn Sin(&a) b 
2 
b 


TL 


X sinh[k,(a — z4.)]sinh(k,zrz) . pnmn\ . (mmy 
2 ‘ kn sinh(k,a) sin( b )s (C) i 


1 
n=N+ 


(15.6.74) 


(15.6.75) 


(15.6.76) 


(15.6.77) 


(15.6.78) 


(15.6.79) 
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where z> = max(z,£) and z< = min(z,£). Here N denotes the largest value of n such that 
k2 < 0, and K2 = kĝ — n?z?/0?. If we began with a Fourier expansion in the y direction, 
we would have obtained 


M 


g(x, yl, n) = ES sin[em (5 — y») sin(emg«) sin( "= si(* 77) 


Kim SÌN(Kmb) 


m=1 


n. x sinh inl — o>) sah Eme) sin( 276) sin( 77), (15.6.80) 


km sinh(kmb) 
m=M+1 


where M denotes the largest value of m such that k2, < 0, k2, = m?n? /a? — k2, k2, = 


kô — m?n?/a?, y< = min(y, n), and y> = max(y, n). 


e Example 15.6.6: Two-dimensional Helmholtz equation over a circular disk 


In this example, we find the Green’s function for the Helmholtz equation when the 
domain consists of the circular region 0 < r < a. The Green’s function is governed by the 
partial differential equation 


18 0g 18g > ó(r — p)ó(0 — 0^) 
m (32) a age + 09 - ; (15.6.81) 


where 0 < r, p < a, and 0 < 0,0' < 27r, with the boundary conditions 


lim |g(r.6]p,0)| « oo, — 9(a.6]p,09) 20, — 0x6,0 < 2r. (15.6.82) 
Tr 


The Green's function must be periodic in 0. 
We begin by noting that 


ó(0 — 0") = + + L 3 cos[n(0 — 0)| = = 3 cos[n(0 — 0")]. (15.6.83) 


n=— oo 


Therefore, the solution has the form 


oo 


g(r.0]p,0) = M7 g«(rlp) cos[n(0 — 6')]. (15.6.84) 


Substituting Equation 15.6.83 and Equation 15.6.84 into Equation 15.6.81 and simplifying, 
we find that roar " " 
1 n B 2 r—p 
ntkogn = — . 15.6.8 
r dr (« dr ) r29 d 2xr (156.85) 


The solution to Equation 15.6.85 is the Fourier-Bessel series 


< k 
gs(rlp) = > Aand mr) ; (15.6.86) 
m=1 


a 


where knm is the mth root of J,(k) = 0. Upon substituting Equation 15.6.86 into Equation 
15.6.85 and solving for Anm, we have that 


1 id kamt 
(HB Ja um ———Ó SGT (==) de (15.6.87) 


a 
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g(r.9/p,9") 


ya x/a 


Figure 15.6.4: The Green's function, Equation 15.6.89, for Helmholtz's equation within a circular disk 
with a Dirichlet boundary condition on the rim when koa = 10, p/a = 0.35 /2, and 0' = 7/4. 


Or 
Jn (knmp/a) 
Anm = . 15.6.88 
Thk? m = kla?) J’? (knm) ( ) 
Thus, the Green's function?” is 
i = In (KknmP/@)In(knmt/a) 
|p, 8") = — cos[n(0 — 6')]. (15.6.89) 
g(r, 0p, 6^) o» (RÀ, Ka?) J? (bs) [n (6 — 8^)] 


See Figure 15.6.4. 
Problems 


1. Using a Fourier sine expansion in the z-direction, construct the Green's function governed 
by the planar Poisson equation 


2 2 
ai ae ü(z—£)ó(y—9, O<a,€<a, —w«1),9«o, 


with the Dirichlet boundary conditions 


g(0,y|£, n) = g(a, ylE,n) 20, —oo«y « oo, 


?5 For an example of its use, see Zhang, D. R., and C. F. Foo, 1999: Fields analysis in a solid magnetic 
toroidal core with circular cross section based on Green's function. [EEE Trans. Magnetics, 35, 3760-3762. 
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and the conditions at infinity that 


| m. g(x, ylE,7) 0, 0 « x « a. 


2. Using a generalized Fourier expansion in the x-direction, construct the Green's function 
governed by the planar Poisson equation 


3g Og . 
Ox? Əy? — 


ó(x E)d(y n), O0<2,€ <a, =~ < yY, < O, 


with the Neumann and Dirichlet boundary conditions 
g« (0, y|&, m) = gla, yl, m) = 0, —00 < y «0o, 
and the conditions at infinity that 


lim g(x, y|l£, n) — 0, 0 «€ x « a. 
|y| oo 


3. Using a Fourier sine expansion in the y-direction, show that the Green's function governed 
by the planar Poisson equation 
Og | Og _ 
0r? Oy) — 


ó(r—&)y—-m), O<2,E<a, 0«yn«b, 


with the boundary conditions 


g(x, 0|£, n) ud g(x, bE, m) = 0, 


and 
9(0, ln) = g«(a, y|&, m) + 8 gla, ylen) =0, B20, 
is 
X sinh(ve<) (v cosh [v(a — z4.)] + Bsinh[v(a—2xs)]} . pnany . (may 
(2, 96,7) ux V? cosh(va) + Bv sinh(va) sin( b ) sin( 5") à 


where v = nz /b, x> = max(z,£), and z< = min(z, £). 


4. Using the Fourier series representation of the delta function in a circular domain: 
ó(0 — 0^) PLE on (6 — 6')] 0x 0,0! < 2r, 


construct the Green's function governed by the planar Poisson equation 


10 29 18g — ó(r — p)ó(0 — 6’) 
rôr\ Or] ` 


r2 062 T i 
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where a < r,p < b, and 0 < 6,0’ < 2r, subject to the boundary conditions g(a, |p, 6’) = 
g(b, 0|p, 6’) = 0 and periodicity in 0. You may want to review how to solve the equidimen- 
sional equation, Section 2.7. 


5. Construct the Green’s function governed by the planar Poisson equation 


18 / 0gN, 10g . Ó(r—p)o(0 —0) 
r Or "Br 00 r , 


where 0 < r, p < oo, and 0 < 6,0’ < B, subject to the boundary conditions that g(r, 0|p, 6’) 
= g(r, B|p, 0’) = 0 for all r. Hint: 


6(0— 6’) = 2 sin( "= ) sia( 23) à 
n=1 


You may want to review how to solve the equidimensional equation, Section 2.7. 


6. Construct the Green’s function governed by the planar Poisson equation 


18 / 0gN, 10g . d(r—p)d(0—- 6) 
r Or "Or "pop r , 


where 0 < r, p < a, and 0 < 6,0’ < B, subject to the boundary conditions g(r, 0|p, 0") = 
g(r, B|p, 0") = g(a, O|p, 6") = 0. Hint: 


6(0— 6’) = 32. sin( “= ) sin( 232) ; 
n=1 


You may want to review how to solve the equidimensional equation, Section 2.7. 


7. Using a Fourier sine series in the z-direction and the results from Problem 31 at the end 
of Section 6.5, find the Green’s function governed by the axisymmetric Poisson equation 


Pg 109 Fg | ó(r— p)ó(z —€) 


or? rOr O2 — 2nT , 


where 0 < r, p <a, and 0 < z,¢ < L, subject to the boundary conditions 


g(r, Olp, ¢) = g(r, L|p, C) = 0, O0<r<a, 


and 
lim |9(r, 2|p,6)| <œ, g(a.2]p, C) 20, 0c z « L. 


8. Following Example 15.6.5 except for using Fourier cosine series, construct the Green's 
function?? governed by the planar Helmholtz equation 


2 
daz + Bye 709—909 — m, 0<2,& <a, 0<y,n <8, 


26 Kulkarni et al. (Kulkarni, S., F. G. Leppington, and E. G. Broadbent, 2001: Vibrations in several 


interconnected regions: A comparison of SEA, ray theory and numerical results. Wave Motion, 33, 79-96) 
solved this problem when the domain has two different, constant k2's. 
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subject to the Neumann boundary conditions 


gx(0, ylE, 7) = g«(o,ul£, m) =0, O«y«b, 


and 
Gy (x, O|E, 7) = gy(z, blé, n) 20, O<aK<a. 


9. Using Fourier sine transforms, 


P fe E 
g(x, y|&, m) = Ji G(k, y|£, n) sin(ka) dk, 
0 
where HP 
Gees n) = f o alén)sin( ke) de, 
0 
find the Green's function governed by 


oF o? 
z +g TIE- 


for the quarter space 0 < z, y, with the boundary conditions 


g(0,ylé, n) = g(x, 0l, n) = 0, 


and 
lim g(a, y|£, n) — 0. 
2,y—00 


Step 1: Taking the Fourier sine transform in the x direction, show that the partial differ- 
ential equation reduces to the ordinary differential equation 


ËG 
dy? - k’ G=- sin(k£)ó(y — n), 


with the boundary conditions 


G(k,0|§,7) — 0, ^ and lim G(K, y|E, n) 0. 
y oo 


Step 2: Show that the particular solution to the ordinary differential equation in Step 1 is 
sin(k£) k 
Gk, lt, n) = FES aoa, 


You may want to review Example 15.2.8. 


Step 3: Find the homogeneous solution to the ordinary differential equation in Step 1 so 
that the general solution satisfies the boundary conditions. Show that the general solution 
is 

sin(k£) 


G(k.yl&; m) = ae e — ek] . 
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Step 4: Taking the inverse, show that 


ae ae ee dk 
glz, ylé, m) = -= n |e hcl — echar sin(k£) sin(kz) +. 
0 


T 


Step 5: Performing the integration,?" show that 


1, file 92+ v= mle +e? + EROS 
seven =z hn e Cro £x (y Oe 


10. Find the free-space Green's function?? governed by 


ag | O?g 
Ox? T Oy? im 


ó(x E)d(y n), —o0 < 2,y,€,T « oo. 


Step 1: Introducing the Fourier transform 


1 S . 
g(x, y|&, m) = x] Gk, y|£, 1) e'^* dk, 


where x 


G(k,y|£, m) = 7 glx, lé. m)e- * da, 


— oo 


show that the governing partial differential equation can be transformed into the ordinary 
differential equation 


Step 2: Introducing the Fourier transform in the y-direction, 
1 i ily 
G(k, y|&, n) = on G(k, ££, ne d£, 


where 


Geten = f G(k, yl£ nen dy, 


solve the ordinary differential equation in Step 1 and show that 


en ike pœ — gium) 
atu = to [^ la 


27 Gradshteyn, I. S., and I. M. Ryzhik, 1965: Table of Integrals, Series, and Products. Academic Press, 
Section 3.947, Formula 1. 


?8 For its use, see Geisler, J. E., 1970: Linear theory of the response of a two layer ocean to a moving 
hurricane. Geophys. Fluid Dyn., 1, 249-272. 
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Step 3: Complete the problem by showing that 


eec Deom 
g(v, yl, m) = aj. f: ea 44 


2n gir cos(0— p) 
ES! L” — 3—— — «dé dk 


?? Jo(rk) ae e ) 
~ 2m o K?-1 2T 


, 


where r = \/(x — £)? + (y — 9?, k = &cos(0), £ = ksin(0), x — £ = rcos(y), andy 5 = 
rsin(y). You may want to review the material around Equation 9.3.73 through Equation 
9.3.75. You will need to use integral tables?? to obtain the final result. 


11. Find the free-space Green's function governed by 


Pg Og Og 
912 + By? An = O(a £)ó(y n), —00 < x,y, n < oo. 


Step 1: By introducing y(x, y|£, n) such that 
g(z, y|£, n) = e" ple l6, n), 
show that the partial differential equation for g(x, y|£,) becomes 


Pp Fy gs 


Step 2: After taking the Fourier transform with respect to x of the partial differential 
equation in Step 1, show that it becomes the ordinary differential equation 


d?d . 
dy? zz (k? + 1) d= =e t ly = n). 


Step 3: Introducing the same transformation as in Step 3 of the previous problem, show 


that 
Bu e &/2-ik& poo — git(y—m) d 
( ,y|£, m = On M ee , 
and 
e &/ ; 
(a, y|5, m) = 7  Ko(sr), 
T 


where r = \/(x — £)? + (y—7)?. 
Step 4: Using the transformation introduced in Step 1, show that 


e(t—§)/2 
20 


g(x, ylE,n) = 


0 ir). 


?9 Gradshteyn and Ryzhik, op. cit., Section 6.532, Formula 6. 
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15.7 GALERKIN METHOD 


In the previous sections we developed various analytic expressions for Green’s functions. 
We close this chapter by showing how to construct a numerical approximation. 

In Sections 6.9 and 9.8 we showed how finite elements can be used to solve differential 
equations by introducing subdomains known as finite elements rather than a grid of nodal 
points. The solution is then represented within each element by an interpolating polynomial. 
Unlike finite difference schemes that are constructed from Taylor expansions, the theory 
behind finite elements introduces concepts from functional analysis and variational methods 
to formulate the algebraic equations. 

There are several paths that lead to the same finite element formulation. The two most 
common techniques are the Galerkin and collocation methods. In this section we focus on 
the Galerkin method. This method employs a rational polynomial, called a basis function, 
that satisfies the boundary conditions. 

We begin by considering the Sturm-Liouville problem governed by 


Pin 


di + Antn = 0, 0crc«L, (15.7.1) 
L 


subject to the boundary conditions Yn(0) = Yn(L) = 0. Although we could solve this 
problem exactly, we will pretend that we cannot. Rather, we will assume that we can 
express it by 


N 
n(x) = X` ons fj (x), (15.7.2) 
j=l 
where f;(x) is our basis function. Clearly, it is desirable that f;(0) = f;(L) = 0. 


How do we compute a,;? We begin by multiplying Equation 15.7.1 by f;(r) and 
integrating the resulting equation from 0 and L. This yields 


L dy L 
J fi(z)—— dx + rn ff fi(z)Un(x) dx = 0, (15.7.3) 
0 da? 0 
where 7 = 1,2,3,..., N. Next, we substitute Equation 15.7.2 and find that 
N L L 
5 | | fila) fi! (x) dz + Xn f fila) f;(a) d anj = 0. (15.7.4) 
j= L70 0 
We can write Equation 15.7.4 as 
(A 4- A4,B)d — 0, (15.7.5) 
where 
L L 
a= RORE de=- f finas. (15.7.6) 
0 0 
L 
hy f| GG) (15.7.7) 
0 


and the vector d contains the elements o;. 
There are several obvious choices for f;(x): 
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e Example 15.7.1 


The simplest choice for f;(x) = sin(jmx/L). If we select N = 2, Equation 15.7.2 
becomes 


Us(x) = Ani sin(77) + An2 sin( 7) i (15.7.8) 


From Equation 15.7.6 and Equation 15.7.7, 


ac im ; : 
aj; =— (=) | s (5) s (452) dz, i=1,2, j=1,2; (15.7.9) 


T. . 2 
by = | sin( =) sin( 4) dz, i=1,2, j21,2. (15.7.10) 
n L L 


Performing the integrations, a12 = a21 = b12 = bo, = 0, ay, = —7°/ (2L), a22 = —27?/L, 
and bii = bo» = L/2. 
Returning to Equation 15.7.5, it becomes 


=r? /2 + rn L?/2 0 am \ (0 
( : expen eta (15.7.11) 


In order for Equation 15.7.11 to have a unique solution, 


and 


—2?/2 + ALL? /2 0 
| / 0 / —282 + An L?/2 | = 0. (15.7.12) 
Equation 15.7.12 yields 4, = A» = 41?/L?. 
In summary, 
V1(x) = sin(zz/L), Aree LL (15.7.13) 
and 
V»(x) = sin(2nz/L), dg = Am? IET, (15.7.14) 


with a1» = a2, = 0. Here we have chosen that a1, = a2» = 1. 


e Example 15.7.2 
Another possible choice for f;(x) involves polynomials of the form (1 — z/L)(x/ L)? 
with 7 = 1,2. Unlike the previous example, we have nonorthogonal basis functions here. 
Note that f;(0) — f;(L) — 0. Therefore, Equation 15.7.2 becomes 
Va(x) = Oni (1 — z/D)(z/L) + o«3(1 — »/ D)(z/ L?. (15.7.15) 
From Equation 15.7.6 and Equation 15.7.7, 
T4 £ qe? q$Nj-2 qgAj-1 
ds 1-3) pe-n(z) =a +9G) [4 
aij DJ. =) (2) |i Jr IG+V 5 T 
(15.7.16) 


Te 39-1 300. jor brat 
L|i+j-1 i+j i+j  i+j+lj’ DS 
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with i = 1,2 and j = 1,2. Similarly, 


w= f usse acm (15.7.18) 


T 2 1 
= L|—— T do : 15.7.19 
Ez t+j+2 ESEE] ( ) 


Performing the computations, a1; = —1/(3L), aig = ag, = —1/(6L), as» = —2/(15L), 
bi = L/30, bio = ba = L/60, and boo = L/105. 
Returning to Equation 15.7.5, it becomes 


eeo ca vraies) (o)-  — guum 


In order for Equation 15.7.20 to have a unique solution, 


—1/3 + A.L2/30  —1/6 + AnL?/60 
—1/6 + A,L2/60 —2/15 + A,12/105 


| — 0. (15.7.21) 


Equation 15.7.21 yields \;L? = 10 and \2L? = 42. Note how close these values of A are to 
those found in the previous example. Returning to Equation 15.7.20, we find that ay; = 1, 
Q12 = 0, Q22 = 1, and Q21 = —1/2. 

In summary, 


p(x) = (1 = =) E à = Di (15.7.22) 
= =o 1 L\ x L\ (cz? _ 42 
pls) = -3 (1 =) E (1 =) (2) (p (15.7.23) 


Because f;(x) are linearly independent, their use in the Galerkin expansion is quite ac- 
ceptable. However, because these functions are not particularly orthogonal, their usefulness 
will become more difficult as N increases. Consequently, the choice of orthogonal functions 
is often best. 


How do we employ the Galerkin technique to approximate Green's functions? We begin 
by considering the inhomogeneous heat conduction problem: 


ðu u 
— — = = F (zx,t L t 15.7.24 
or Bg (x,t) O<2<L, 0<t, (15.7.24) 
with the boundary conditions 
u(0,t) = u(L,t) = 0, 0 « t, (15.7.25) 


and the initial condition u(z, 0) 2 0, 0 < x « L. 
Let us write the solution to this problem as 


N 


u(x,t) = X Cia ae 206 (15.7.26) 


n=1 
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Direct substitution of Equation 15.7.26 into Equation 15.7.24, followed by multiplying the 
resulting equation by f;(a) and integrating from 0 to L, gives 


~ p i X (des 
25 elte An JE IE c- >> ( dt Anen) e^ JE fila dn (x 


n=1 
2 fila) F (a, t) da. (15.7.27) 
0 
Because p 
= + Antn = 0, (15.7.28) 


Equation 15.7.27 simplifies to 


eM» d fila) un (a Jar = f fila) F(a, t) dx = F7 (t), (15.7.29) 


where i = 1,2,..., N. 
We must now find cn. We can write Equation 15.7.29 as 


N de 
5 eine n —* dx = F*(t), (15.7.30) 
n=1 
where 


Using linear algebra, we find that 
s% den 
emant E -Yeen Y (15.7.32) 


where pn; are the elements of an array P = E^! and E = (DB)T. The arrays D and B 
consist of elements o;j and bij, respectively. Solving Equation 15.7.32, we find that 


= Án «Yos f 7i 7 (n)e^"? dn. (15.7.33) 


Because u(z,0) = 0, c,(0) = 0 and A, = 0. 
We are now ready to find our Green's function. Let us set F(x,t) = ó(x — €)d(t — T). 
Then F(t) = f;(£)ó(t — 7) and 


N 
en(t) = H(t- 7) V Pri file. (15.7.34) 
i=1 
From Equation 15.7.2, Equation 15.7.26, and Equation 15.7.34, we obtain the final result 


that 
NNN 


glx, thé, T) = H(t- 7) V, S anpa fi(£) fi(z)e ^77. (15.7.35) 


n=1 j=1 i=1 
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e Example 15.7.3 


In Example 15.5.2, we solved the Green’s function problem: 


Og g » 

DE aga 960-990 —- 7), (15.7.36) 
with the boundary condition 

9(0, t|€, T) = g(L.t|& T) = 0, (15.7.37) 


and the initial condition g(z,0|£, 7) = 0. There we found the solution (Equation 15.5.34): 
g(a, t£ 7) = H(t- v » Vy (E) (z)e ^ 77, (15.7.38) 


where we have the orthonormal eigenfunctions 


Un(a) = V2/L sin(k,z), kn = nn/L. (15.7.39) 


Let us use the basis function f;(z) = (1— x/L)(x/L)! to find the approximate Green's 
function to Equation 15.7.36. Here j = 1,2,3,..., N, 
For N — 2, we showed in Example 15.7.2 that 


= 1/30 1/60 u 1 0 
par (YY). pa( 1. T). asra 
Consequently, 
L (98 14 L (28 0 
BD = — E = — . 15.7.41 
a » E » ( ) 
Using Gaussian elimination, 
1 30 0 
eee A 
P=E'=>7 E d) : (15.7.42) 


Therefore, the two-term approximation to the Green's function, Equation 15.7.38, is 
| 3802 TA E € 10(t — 7) 
glx, t|, T) = LE r T (: : ap] I: H(t — 7) 
2 
| 210 x ee 420 x (1 J £ 1 € 
L i m LL L/NL L 
420 a\ E E\ 840 (m2 zN(£M £ 
23) 0 -96- +) +" (3) (2 AIC : L 


xexp| BOO pm 7). (15.7.43) 


mn 


For N » 2, hand computations are very cumbersome and numerical computations are 
necessary. For a specific N, we first compute the arrays A and B via Equation 15.7.17 and 
Equation 15.7.19. 
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Table 15.7.1: The Value of L?A, for n = 1,2,..., N as a Function of N. 


n Exact N=2 N=3 N=4 N=6 N=8 N=10 
1 9.87 10.00 9.87 9.87 9.87 9.87 9.87 
2 39.48 42.00 42.00 39.50 39.48 39.48 39.48 
3 88.83 102.13 102.13 89.17 88.83 88.83 
4 157.91 200.50 159.99 157.96 157.91 
5 246.74 350.96 254.42 247.04 
6 355.31 570.53 376.47 356.65 
7 483.61 878.88 531.55 
8 631.65 1298.03 725.34 
9 799.44 1850.98 


(= 
Ee 
[9.9] 
e»t 
ke 
a 
bo 
[o1 
A 
d 
N 
e 


for j - 
for i - 
AGi,j) 


1:N 

1:N 

j*(j-1)/(i+j-1) - j*(j-1)/(i+j) 

j*(j+1)/(i+j+1) - j*(j+1)/(i+j) ; 
B(i,j) = 1/(i+j+1) - 2/(i+j+2) + 1/(i+j+3); 

end; end 


+m: 


Next we compute the A,'s and corresponding eigenfunctions: [v,d] = eig(A,-B). 
Table 15.7.1 gives L?A, for several values of N. 

Once we found the eigenvalues and eigenvectors, we now compute the matrices D, E, 
and P. For convenience we have reordered the eigenvalues so that their numerical value 
increases with n. Furthermore, we have set o, equal to one for n = 1,2,..., N. 


[lambda,ix] = sort(temp); 
for i= 1:N 
for j = 1:N 

D(i,j) = v(j,ix(i)); 
end; end 
for i = 1:N 

denom = D(i,i); 

for j = 1:N 

D(i,j) = D(i,j) / denom; 

end; end 
E = transpose(D*B); 
P - inv(E); 


Having computed the matrices D and P, our final task is the computation of the 
Green's function using Equation 15.7.35. The MATLAB code is: 


phi i(1) = (1-xi)*xi; 


for i = 2:N 
phi_i(i) = xi*phi_i(i-1); 
end 


for ii = 1:idim 


x = (ii-1)*dx; 
phi_j(1) = (1-x)*x; 
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L g(x,tlé,7) 


Figure 15.7.1: Comparison of the exact Green's function Lg(x,t|€,7) for a one-dimensional heat equation 
given by Equation 15.7.38 (solid line) and approximate Green's functions found by the Galerkin method for 
different values of N. Here (t — 7)/L? = 0.001 and £ = 0.4. 


for j = 2:N 
phi_j(j) = x*phi_j(j-1); 
end 
for n = 1:N 
for j= 1:N 
for i = 1:N 
g(ii) = g(ii) + D(n,j.*PQ,i).*phi j(j).*phi.i(i) 


.*exp(-lambda(n)*time); 
end; end; end 


end 


In this code the parameter time denotes the quantity (4—7)/L?. Figure 15.7.1 compares 
this approximate Green's function for various N against the exact solution. One of the 
problems with this method is finding the inverse of the array E. As N increases, the 
accuracy of the inverse becomes poorer. 


Further Readings 
Beck, J. V., K. D. Cole, A. Haji-Sheikh, and B. Litkouhi, 1992: Heat Conduction Using 
Green’s Functions. Hemisphere Publishing Corp., 523 pp. Detailed study of solving heat 


conduction problems via Green’s functions. 


Carslaw, H. S., and J. C. Jaeger, 1959: Conduction of Heat in Solids. Oxford University 
Press, Chapter 14. An early classic for finding the Green's function for the heat equation. 
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Duffy, D. G., 2015: Green's Functions with Applications. Chapman & Hall/ CRC, 464 pp. 
A source book. 


Ózisik, M. N., 1993: Heat Conduction. John Wiley & Sons, Chapter 6. A book of how to 
solve partial differential equations of heat conduction. 


Stakgold, I., 1997: Green's Functions and Boundary Value Problems. Wiley-Interscience, 
720 pp. A systematic approach to boundary-value problems. 
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Chapter 16 
Probability 


So far in this book we presented mathematical techniques that are used to solve deter- 
ministic problems - problems in which the underlying physical processes are known exactly. 
In this and the next chapter we turn to problems in which uncertainty is key. 

Although probability theory was first developed to explain the behavior of games of 
chance,! its usefulness in the physical sciences and engineering became apparent by the late 
nineteenth century. Consider, for example, the biological process of birth and death that 
we treated in Example 1.2.9. If b denotes the birth rate and d is the death rate, the size of 
the population P(t) at time t is 


P(t) = P(0)e-2*. (16.0.1) 


Let us examine the situation when P(0) = 1 and b = 2d so that a birth is twice as likely 
to occur as a death. Then, Equation 16.0.1 predicts exponential growth with P(t) = e*. 
But the first event may be a death, a one-in-three chance since d/(b + d) = 1/3, and this 
would result in the population immediately becoming extinct. Consequently we see that 
for small populations, chance fluctuations become important and a deterministic model is 
inadequate. 

'The purpose of this and the next chapter is to introduce mathematical techniques that 
will lead to realistic models where chance plays an important role, and show under what 
conditions deterministic models will work. In this chapter we present those concepts that 
we will need in the next chapter to explain random processes. 


1 Todhunter, I., 1949: A History of the Mathematical Theory of Probability From the Time of Pascal 
to That of Laplace. Chelsea, 624 pp.; Hald, A., 1990: A History of Probability and Statistics and Their 
Applications before 1750. John Wiley & Sons, 586 pp. 
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16.1 REVIEW OF SET THEORY 


Often we must count various objects in order to compute a probability. Sets provide a 
formal method to aid in these computations. Here we review important concepts from set 
theory. 

Sets are collections of objects, such as the number of undergraduate students at a 
college. We define a set A either by naming the objects or describing the objects. For 
example, the set of natural numbers can be either enumerated: 


A= {1,2,3,4,...}, (16.1.1) 


or described: 
A= {I:I is an integer and J > 1}. (16.1.2) 


Each object in set A is called an element and each element is distinct. Furthermore, the 
ordering of the elements within the set is not important. 

Two sets are said to be equal if they contain the same elements and are written A = B. 
An element x of a set A is denoted by x € A. A set with no elements is called a empty 
or null set and denoted by Ø. On the other hand, a universal set is the set of all elements 
under consideration. 

A set B is subset of a set A, written B C A, if every element in B is also an element 
of A. For example, if A = (x :0 € x < œ} and S = {x : -oo < x < ool, then ACS. 
We can also use this concept to define the equality of sets A and B. Equality occurs when 
A C B and B C A. 

The complement of A, written A, is the set of elements in S but not in A. For example, 
if A= {x:0<2 < oo} and S = {z: -c0 < x < oo}, then A= (z: —co < x < 0). 

Two sets can be combined together to form a new set. This union of A and B, written 
AUB, creates a new set that contains elements that belong to A and/or B. This definition 
can be extended to multiple sets A1, A2,..., Ay so that the union is the set of elements for 
which each element belongs to at least one of these sets. It is written 


N 
Ay U Ag U As UU An = [ J As. (16.1.3) 


i=l 


The intersection of sets A and B, written AN B, is defined as the set of elements that 
belong to both A and B. This definition can also be extended to multiple sets A1, Ag,..., An 
so that the intersection is the set of elements for which each element belongs to all of these 


sets. It is written 
N 


Ain Ag Ag n n Ay = (] Ai. (16.1.4) 
i=1 
If two sets A and B have no elements in common, they are said to be disjoint and ANB = (). 

A popular tool for visualizing set operations is the Venn diagram.? For sets A and B 
Figure 16.1.1 pictorially illustrates AU B, An B, A, and An B. 

With these definitions a number of results follow: A = A, AU A = S, An A = Í, 
AU — A, ANO=0, AUS — S, ANS = A, S =O, and Ü = S. Here S denotes the 
universal set. 

Sets obey various rules similar to those encountered in algebra. They include: 


? Venn, J., 2008: Symbolic Logic. Kessinger, 492 pp. 
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DP 


Shaded area: ANB 


SSS 
SSS 
SSS 


Shaded area: A Shaded area: ANB 


Figure 16.1.1: Examples of Venn diagrams for various configurations of sets A and B. Note that in the 
case of the lower right diagram, B C A. 


Commutative properties: AU B = BUA, An B- Bn A. 

Associate properties: AU(BUC)=(AUB)UC, AN(BNC)=(ANB)NC. 
Distributive properties: AN( BUC) = (ANB)U(ANC), AU(BNC) = (AUB)N(AUC). 
De Morgan’s law: AU B = ANB. 


ae ge ea 


Finally we define the size of a set. Discrete sets may have a finite number of el- 
ements or countably infinite number of elements. By countably infinite we mean that 
we could in theory count the number of elements in the sets. Two simple examples are 
A = {1,2,3,4,5,6} and A = {1,4,16,64,...}. Discrete sets lie in opposition to continuous 
sets where the elements are infinite in number and cannot be counted. A simple example 
is A= {x:0< a4 < 2}. 


Problems 


1. If B C A, use Venn diagrams to show that A = BU (BN A) and BN (BNA) =9. Hint: 
Use the Venn diagram in the lower right frame of Figure 16.1.1. 


2. Using Venn diagrams, show that AU B = AU (AN B) and B = (An B)U(An B). Hint: 
For AN B, use the upper right frame from Figure 16.1.1. 


16.2 CLASSIC PROBABILITY 


All questions of probability begin with the concept of an experiment where the governing 
principle is chance. The set of all possible outcomes of a random experiment is called the 
sample space (or universal set); we shall denote it by S. An element of S is called a sample 
point. The number of elements in S can be finite as in the flipping of a coin twice, infinite 
but countable such as repeatedly tossing a coin and counting the number of heads, or infinite 
and uncountable, as measuring the lifetime of a light bulb. 
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Any subset of the sample set S is called an event. If this event contains a single point, 
then the event is elementary or simple. 


e Example 16.2.1 


Consider an experiment that consists of two steps. In the first step, a die is tossed. If 
the number of dots on the top of the die is even, a coin is flipped; if the number of dots on 
the die is odd, a ball is selected from a box containing blue and green balls. The sample 
space is S = {1B,1G, 2H, 2T,3B,3G, AH, AT,5B,5G, 6H, 6T]. The event A of obtaining a 
blue ball is A = {1B,3B, 5B}, of obtaining a green ball is B = (1G,3G,5G], and obtaining 
an even number of dots when the die is tossed is C = (2H, 2T, 4H, AT, 6H,6T ]. The simple 
event of obtaining a 1 on the die and a blue ball is D = {1B}. 


| Equally likely outcomes 


An important class of probability problems consists of those whose outcomes are equally 
likely. This expression “equally likely” is essentially an intuitive one. For example, if a coin 
is tossed it seems reasonable that the coin is just as likely to fall “heads” as to fall “tails.” 
Probability seeks to quantify this common sense. 

Consider a sample space S of an experiment that consists of finitely many outcomes 
that are equally likely. Then the probability of an event A is 


. Number of points in A 


P(A) (16.2.1) 


~ Number of points in S" 


With this simple definition we are ready to do some simple problems. An important aid 
in our counting is whether we can count a particular sample only once (sampling without 
replacement) or repeatedly (sampling with replacement). The following examples illustrate 
both cases. 


e Example 16.2.2: Balls drawn from urns with replacement 


Imagine the situation where we have an urn that has k red balls and N — k black balls. 
A classic problem asks: What is the chance of two balls being drawn, one after another 
with replacement, where the first ball is red and the second one is black? 

We begin by labeling the k red balls with 1,2,3,..., k and black balls are numbered 
k+1,k+2,...,N. The possible outcomes of the experiment can be written as a 2-tuple 
(21, 22), where 2; € 1,2,3,..., N and z2 € 1,2,3,..., N. A successful outcome is a red ball 
followed by a black one; we can express this case by E = ((z1,22) : 21 = 1,2,...,k3 22 = 
k+1,k+2,...,N}. Now the total number of 2-tuples in the sample space is N? while the 
total number of 2-tuples in E is k(N — k). Therefore, the probability is 


P(E) = e 5) = p(1— p), (16.2.2) 


where p — k/N. 
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e Example 16.2.3: Balls drawn from urns without replacement 


Let us redo the previous example where the same ball now cannot be chosen twice. We 
can express this mathematically by the condition z1 Z z2. The sample space has N(N — 1) 
balls and the number of successful 2-tuples is again k(N — k). The probability is therefore 
given by 
k(N — k) kN-k N N 
N(N-1) N N N-1 N-1 


P(E) = (16.2.3) 
The restriction that we cannot replace the original ball has resulted in a higher probability. 
Why? We have reduced the number of red balls and thereby reduced the chance that we 
again selected another red ball while the situation with the black balls remains unchanged. 


e Example 16.2.4: The birthday problem? 


A classic problem in probability is: What is the chance that at least two individuals 
share the same birthday in a crowd of n people? Actually it is easier to solve the comple- 
mentary problem: What is the chance that no one in a crowd of n individuals shares the 
same birthday? 

For simplicity let us assume that there are only 365 days in the year. Each individual 
then has a birthday on one of these 365 days. Therefore, there are a total of (365)" possible 
outcomes in a given crowd. 

Consider now each individual separately. The first person has a birthday on one of 365 
days. The second person, who cannot have the same birthday, has one of the remaining 
364 days. Therefore, if A denotes the event that no two people have the same birthday and 
each outcome is equally likely, then 


P(A) = na) _ any (e =n+1) needed 


To solve the original question, we note that P(A) = 1 — P(A) where P(A) denotes the 
probability that at least two individuals share the same birthday. 

If n = 50, P(A) ~ 0.03 and P(A) ~ 0.97. On the other hand, if n = 23, P(A) ~ 0.493 
and P(A) ~ 0.507. Figure 16.2.1 illustrates P(A) as a function of n. Nymann* computed 
the probability that in a group of n people, at least one pair will have the same birthday 


with at least one such pair among the first k people. 


In the previous examples we counted the objects in sets A and S. Sometimes we can 
define these sets only as areas on a graph. This graphical definition of probability is 


_ Area covered by set A 


P(A) (16.2.5) 


~ Area covered by set S` 


The following example illustrates this definition. 


3 First posed by von Mises, R., 1939: Über Aufteilungs- und Besetzungswahrscheinlichkeiten. Rev. Fac. 
Sci. Istambul, 4, 145-163. 


^ Nymann, J. E., 1975: Another generalization of the birthday problem. Math. Mag., 53, 111-125. 
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Probability of shared birthdays 


0 10 20 30 —— 40 50 60 
Number of people in crowd 


Figure 16.2.1: The probability that a pair of individuals in a crowd of n people share the same birthday. 


f 


Figure 16.2.2: The graphical solution of whether two fellows can chat online between noon and T' minutes 
after noon. The shaded area denotes the cases when the two will both be online whereas the rectangle gives 
the sample space. 


e Example 16.2.5 


Two friends, Joe and Dave, want to chat online but they will log on independently 
between noon and T' minutes after noon. Because of their schedules Joe can only wait 
t minutes after his log-on while Dave can only spare 7 minutes. Neither fellow can stay 
beyond T minutes after noon. What is the chance that they will chat? 

Let us denote Joe's log-on time by t; and Dave's log-on time by t2. Joe and Dave will 
chat if 0 < t2 — tı < t and 0 < t4 — t9 < T. In Figure 16.2.2 we show the situation where 
these inequalities are both satisfied. The area of the sample space is 77. Therefore, from 
the geometrical definition of probability, the probability P(A) that they will chat is 


T? — (T - ?/2 — (T — 73/2 


P(A) = ^ 


(16.2.6) 


oO 
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Shaded area: ANB 


Shaded area: ANB 


Figure 16.2.3: The Venn diagram used in the derivation of Property 5. 


So far there has been a single event that interests us and we have only had to compute 
P(A). Suppose we now have two events that we wish to follow. How are the probabilities 
P(A) and P(B) related? 

Consider the case of flipping a coin. We could define event A as obtaining a head, A = 
{head}. Event B could be obtaining a tail, B = {tail}. Clearly AU B = {head, tail} = S, 
the sample space. Furthermore, AN B = () and A and B are mutually exclusive. We already 
know that P(A) = P(B) = 4. But what happens if AN B is not an empty set? 


From our definition of probability and previous examples, we introduce the following 
three basic axions: 


Axion 1: P(A), P(B) > 0, 
Axion 2: P(S) = 1, 
Axion 3: P(AU B) = P(A) + P(B) if An B — 0. 


'The first two axions are clearly true from the definition of probability and sample space. 
It is the third axion that needs some attention. Here we have two mutually exclusive events 
A and B in the sample space S. Because the number of points in AU B equals the number 
of points in A plus the number of points in B, n(AU B) = n(A) 4- n(B). Dividing both sides 
of this equation by the number of sample points and applying Equation 16.2.1, we obtain 
Axion 3 when AN B =f. 


From these three axioms, the following properties can be written down: 
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All of these properties follow readily from our definition of probability except for Prop- 
erty 5 and this is an important one. To prove this property from Axion 3, consider the 
Venn diagram shown in Figure 16.2.3. From this figure we see that 

AUB=AU(ANB) and B=(ANB)U(ANB). (16.2.7) 
From Axion 3, we have that 


P(AUB) = P(A)+ P(An B), (16.2.8) 


and 


P(B) = P(ANB)+P(ANB). (16.2.9) 


Eliminating P(An B) between Equation 16.2.8 and Equation 16.2.9, we obtain Property 5. 
The following example illustrates a probability problem with two events A and B. 


e Example 16.2.6 


Consider Figure 16.2.4. From this figure, we see that A = (An B) U (An B). Because 
ANB and An B are mutually exclusive, then from Axion 3 we have that 


P(A) = P(An B) 4 P(An B). (16.2.10) 


e Example 16.2.7 


A company has 400 employees. Every quarter, 100 of them are tested for drugs. The 
company's policy is to test everyone at random, whether they have been previously tested 
or not. What is the chance that someone is not tested? 

The chance that someone will be tested is 1/4. Therefore, the chance that someone 
will not be tested is 1 — 1/4 — 3/4. 
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| Permutations and combinations 


By now it should be evident that your success at computing probabilities lies in correctly 
counting the objects in a given set. Here we examine two important concepts for systemic 
counting: permutations and combinations. 

A permutation consists of ordering n objects without any regard to their order. For 
example, the six permutations of the three letters a, b, and c are abc, acb, bac, bca, cab, and 
cba. The number of permutations equals n!. 

In a combination of given objects we select one or more objects without regard to their 
order. There are two types of combinations: (1) n different objects, taken k at a time, 
without repetition, and (2) n different objects, taken k at a time, with repetitions. In the 
first case, the number of sets that can be made up from n objects, each set containing k 
different objects and no two sets containing exactly the same k things, equals 


! 
number of different combinations — b = Sino (16.2.11) 


Using the three letters a, b, and c, there are three combinations, taken two letters at a time, 
without repetition: ab, ac, and bc. 

In the second case, the number of sets, consisting of k objects chosen from the n objects 
and each being used as often as desired, is 


nup p (16.2.12) 


number of different combinations — ( k 


Returning to our example using three letters, there are six combinations with repetitions: 
ab, ac, bc, aa, bb, and cc. 


e Example 16.2.8 
An urn contains r red balls and b blue balls. If a random sample of size m is chosen, 
what is the probability that it contains exactly k red balls? 


If we choose a random sample of size m, we obtain s Ü possible outcomes. The 


b 
m-—k 
fore, the probability that a sample of size m contains exactly k red balls is 


r b 
k m-—k 
r+b 
m 
A dog kennel has 50 dogs, including 5 German shepherds. (a) What is the probability of 


choosing 3 German shepherds if 10 dogs are randomly selected? (b) What is the probability 
of choosing all of the German shepherds in a group of 10 dogs that is chosen at random? 


number of samples that includes k red balls and m — k blue balls is (4) ( ) . There- 


e Example 16.2.9 
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Let S denote the sample space of groups of 10 dogs. The number of those groups 
is n(S) = 50!/(10!40!). Let A; denote the set of 10 dogs that contain à German shep- 
herds. Then the number of groups of 10 dogs that contain 4 German shepherds is n(A;) = 
10!/[i!(10 — 2)!]. Therefore, the probability that out of 50 dogs, we can select at random 10 
dogs that include ¿ German shepherds is 


n(A;) 10110140! 


P(A) = 8) = W0— i50 un) 


Thus, P(A3) = 1.1682 x 1078 and P(As) = 2.453 x 1078. 


e Example 16.2.10 


Consider an urn with n red balls and n blue balls inside. Let R = (r1,r2,...,r,] and 
B = {b1,b2,..., bn}. Then the number of subsets of RU B with n elements is a . On 


the other hand, any subset of RU B with n elements can be written as the union of a subset 
of R with i elements and a subset of B with n — i elements for some 0 < i < n. Because, 
for each i, there are E 2v i) such subsets, the total number of subsets of red and 


blue balls with n elements equals 37 s 


2 n n 2 
( ex ( ý J) (16.2.14) 
n i n—i i 
i=0 i=0 
because ( i ) = e 
n-—i i 
| Conditional probability 


Often we are interested in the probability of an event A provided event B occurs. 
Denoting this conditional probability by P(A|B), its probability is given by 


) na 2 Since both approaches must be 


equivalent, 


P(AnB) 


P(A|B) = — P(B) ^ 


P(B) » 0, (16.2.15) 


where P(An B) is the joint probability of A and B. Similarly, 


. P(An B) 
P(B|A) = —P P(A) » 0. (16.2.16) 
Therefore, 
P(An B) = P(A|B)P(B) = P(B|A)P(A), (16.2.17) 
and we obtain the famous Bayes’ rule 
p(Ajp) = POBLADO) (16.2.18) 


P(B) 
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e Example 16.2.11 


Consider a box containing 10 pencils. Three of the pencils are defective with broken 
lead. If we draw 2 pencils out at random, what is the chance that we will have selected 
nondefective pencils? 

There are two possible ways of selecting our two pencils: with and without replacement. 
Let Event A be that the first pencil is not defective and Event B be that the second pencil 
is not defective. Regardless of whether we replace the first pencil or not, P(A) = i because 
each pencil is equally likely to be picked. If we then replace the first pencil, we have the 
same situation before any selection was made and P(B|A) = P(A) = i. Therefore, 


P(AnB) = P(A)P(B|A) = 0.49. (16.2.19) 


On the other hand, if we do not replace the first selected pencil, P(B|A) — S because 
there is one fewer nondefective pencil. Consequently, 


P(An B) = P(A)P(B|A) = Ka x 2 = = < 0.49. (16.2.20) 

10 9 30 
Why do we have a better chance of obtaining defective pencils if we don't replace the 
first one? Our removal of that first, nondefective pencil has reduced the uncertainty because 
we know that there are relatively more defective pencils in the remaining 9 pencils. This 
reduction in uncertainty must be reflected in a reduction in the chances that both selected 
pencils will be nondefective. 


| Law of total probability 


Conditional probabilities are useful because they allow us to simplify probability cal- 
culations. Suppose we have n mutually exclusive events A1, Ag, ..., An whose probabilities 
sum to unity, then 


P(B) = P(B|Ai)P(A)) + P(B|A3)P(A2) +--+ P(BJAn)P(An), (16.2.21) 


where B is an arbitrary event, and P(B|A;) is the conditional probability of B assuming 
Aj. In other words, the law (or formula) of total probability expresses the total probability 
of an outcome that can be realized via several distinct events. 


e Example 16.2.12 


There are three boxes, each containing a different number of light bulbs. The first box 
has 10 bulbs, of which 4 are dead. The second has 6 bulbs, of which one is dead. Finally, 
there is a third box of eight bulbs, of which 3 bulbs are dead. What is the probability of 
choosing a dead bulb if a bulb is randomly chosen from one of the three boxes? 

'The probability of choosing a dead bulb is 


P(D) = P(D|B,)P(B1) + P(D|B3)P(B3) + P(D|Bs)P(Bs) (16.2.22) 


-( (Go) * (3) (6) * (5) (8) = s 06225 
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If we had only one box with a total 24 bulbs, of which 8 were dead, then our chance of 
choosing a dead bulb would be 1/3 > 113/360. 


| Independent events 


If events A and B satisfy the equation 


P(ANB) = P(A)P(B), (16.2.24) 


they are called independent events. From Equation 16.2.15 and Equation 16.2.16, we see 
that if Equation 16.2.24 holds, then 


P(A|B)= P(A), . P(B|A) = P(B), (16.2.25) 


assuming that P(A) Z 0 and P(B) # 0. Therefore, the term “independent” refers to the 
fact that the probability of A does not depend on the occurrence or non-occurrence of B, 
and vice versa. 


e Example 16.2.13 


Imagine some activity where you get two chances to be successful (for example, jumping 
for fruit still on a tree or shooting basketballs). If each attempt is independent and the 
probability of success 0.6 is the same for each trial, what is the probability of success after 
(at most) two tries? 

There are two ways of achieving success. We can be successful in the first attempt 
with P(S1) = 0.6 or we can fail and then be successful on the second attempt: P(Fin 
S2) = P(Fi)P(S$3) = (0.4)(0.6) = 0.24, since each attempt is independent. Therefore, 
the probability of achieving success in two tries is 0.6 + 0.24 = 0.84. Alternatively, we 
can compute the probability of failure in two attempts: P(Fı N F2) = 0.16. Then the 
probability of success with two tries would be the complement of the probability of two 
failures: 1 — 0.16 — 0.84. 


e Example 16.2.14 


Consider the tossing of a fair die. Let event A denote the tossing of a 2 or 3. Then 
P(A) = P({2,3}) = 1. Let event B denote tossing an odd number, B = {1,3,5}. Then 
PB) =<. 

Now AN B = (3) and P(An B) = $. Because P(An B) = P(A)P(B), events A and 
B are independent. 


Often we can characterize each outcome of an experiment consisting of n experiments 
as either a “success” or a “failure.” If the probability of each individual success is p, 
then the probability of k successes and n — k failures is p*(1 — p)"~*. Because there are 
n!/[k!(n — k)!] ways of achieving these k successes, the probability of an event having k 
successes in n independent trials is 


(16.2.26) 
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where p is the probability of a success during one of the independent trials. 
e Example 16.2.15 


What is the probability of having two boys in a four-child family? 
Let us assume that the probability of having a male is 0.5. Taking the birth of one 
child as a single trial, 


4 
P4(2) = a (5) = : (16.2.27) 
Note that this is not 0.5, as one might initially guess. 

Problems 
1. For the following experiments, describe the sample space: 
(a) flipping a coin twice. 
(b) selecting two items out of three items {a,b,c} without replacement. 


(c) selecting two items out of three items {a, b,c} with replacement. 


(d) selecting three balls, one by one, from a box that contains four blue balls and five green 
balls without replacement. 


(e) selecting three balls, one by one, from a box that contains four blue balls and five green 
balls with replacement. 


2. Consider two fair dice. What is the probability of throwing them so that the dots sum 
to seven? 


3. In throwing a fair die, what is the probability of obtaining a one or two on the top side 
of the cube? 


4. What is the probability of getting heads exactly (a) twice or (b) thrice if you flip a fair 
coin 6 times? 


5. An urn contains six red balls, three blue balls, and two green balls. Two balls are 
randomly selected. What is the sample space for this experiment? Let X denote the 
number of green balls selected. What are the possible values of X? Calculate P(X — 1). 


6. Consider an urn with 30 blue balls and 50 red balls in it. These balls are identical except 
for their color. If they are well mixed and you draw 3 balls without replacement, what is 
the probability that the balls are all of the same color? 


7. A deck of cards has 52 cards, including 4 jacks and 4 ten's. What is the probability of 
selecting a jack or ten? 


8. Two boys and two girls take their place on a stage to receive an award. What is the 
probability that the boys take the two end seats? 
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9. A lottery consists of posting a 3-digit number given by selecting 3 balls from 10 balls, 
each ball having the number from 1 to 10. The balls are not replaced after they are drawn. 
What are your chances of winning the lottery if the order does not matter? What are your 
chances of winning the lottery if the order does matter? Write a short MATLAB code and 
verify your results. You may want to read about the MATLAB intrinsic function randperm. 


10. A circle of radius 1 is inscribed in a square with sides of length 2. A point is selected 
at random in the square in such a manner that all the subsets of equal area of the square 
are equally likely to contain the point. What is the probability that it is inside the circle? 


11. In a rural high school, 2096 of the students play football and 1096 of them play football 
and wrestle. If Ed, a randomly selected student of this high school, played football, what 
is the probability that he also wrestles for his high school? 


12. You have a well-shuffled card deck. What is the probability the second card in the deck 
is an ace? 


13. We have two urns: One has 4 red balls and 6 green balls, the other has 6 red and 4 
green. We toss a fair coin. If heads, we pick a random ball from the first urn, if tails from 
the second. What is the probability of getting a red ball? How do your results compare 
with the probability of getting a red ball if all of the red and green balls had been placed 
into a single urn? 


14. A customer decides between two dinners: a “cheap” one and an “expensive” one. The 
probability that the customer chooses the expensive meal is P(E) — 0.2. A customer who 
chooses the expensive meal likes it with a 80% probability P(L|E) = 0.8. A customer who 
chooses the cheap meal dislikes it with 70% probability P(D|C) = 0.7. 


(a) Compute the probability that a customer (1) will choose a cheap meal, (2) will be 
disappointed with an expensive meal, and (3) will like the the cheap meal. 


(b) Use the law of total probability to compute the probability that a customer will be 
disappointed. 


(c) If a customer found his dinner to his liking, what is the probability that he or she chose 
the expensive meal? Hint: Use Bayes' theorem. 


15. Suppose that two points are randomly and independently selected from the interval 
(0, 1). What is the probability the first one is greater than 1/4, and the second one is less 
than 3/4? Check your result using rand in MATLAB. 


16. A certain brand of electronics chip is found to fail prematurely in 196 of all cases. If 
three of these chips are used in three independent sets of equipment, what is the probability 
that (a) all three will fail prematurely, (b) that two will fail prematurely, (c) that one will 
fail prematurely, and (d) that none will fail? 


Project: Experimenting with MATLAPB's Intrinsic Function rand 


The MATLAB function rand can be used in simulations where sampling occurs with 
replacement. If we write X = rand(1,100), the vector X contains 100 elements whose values 
vary between 0 and 1. Therefore, if you wish to simulate a fair die, then we can set up the 
following table: 
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0 « X « 1/6 die with one dot showing 
1/6 « X « 1/3 die with two dots showing 
1/3 « X « 1/2 die with three dots showing 
1/2 « X « 2/3 die with four dots showing 
2/3 « X « 5/6 die with five dots showing 
5/6«X«1 die with six dots showing. 


We can then write MATLAB code that counts the number of times that we obtain a one or 
two. Call this number n. Then the probability that we would obtain one or two dots on a 
fair die is n/100. Carry out this experiment and compare your answer with the result from 
Problem 2. What occurs as you do more and more experiments? 


Project: Experimenting with 
MATLAPB's Intrinsic Function randperm 


MATLAB’s intrinsic function randperm(m) creates a random ordering of the numbers 
from 1 to m. If you execute perm = randperm(365), this would produce a vector of length 
365 and each element has a value lying between 1 and 365. If you repeat the process, you 
would obtain another list of 365 numbers but they would be in a different order. 

Let us simulate the birthday problem. Invoking the randperm command, use the first 
element to simulate the birthday of student 1 in a class of N students. Repeatedly invoking 
this command, create vector birthdays that contains the birthdays of the N students. Then 
find out if any of the days are duplicates of another. (Hint: You might want to explore the 
MATLAB command unique.) Repeating this experiment many times, compute the chance 
that a class of size N has at least two students that have the same birthday. Compare your 
results with Equation 16.2.4. What occurs as the number of experiments increases? 


16.3 DISCRETE RANDOM VARIABLES 


In the previous section we presented the basic concepts of probability. In high school 
algebra you were introduced to the concept of a variable - a quantity that could vary unlike 
constants and parameters. Here we extend this idea to situations where the variations are 
due to randomness. 

A random variable is a single-valued real function that assigns a real number, the value, 
to each sample point t of S. The variable can be discrete, such as the flipping of a coin, or 
continuous, such as the lifetime of a light bulb. The sample space S is the domain of the 
random variable X(t), and the collection of all numbers X(t) is the range. Two or more 
sample points can give the same value of X (t) but we will never allow two different numbers 
in the range of X(t) for a given t. 

The term “random variable" is probably a poor one. Consider the simple example of 
tossing a coin. À random variable that describes this experiment is 


o J1, sı = head, 
X[si] = { 0, d (16.3.1) 


An obvious question is: What is random about Equation 16.3.1? If a head is tossed, we 
obtain the answer one; if a tail is tossed, we obtain a zero. Everything is well defined; 
there is no element of chance here. The randomness arises from the tossing of the coin. 
Until the experiment (tossing of the coin) is performed, we do not know the outcome of 
the experiment and the value of the random variable. Therefore, a random variable is a 
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Figure 16.3.1: The probability mass function for a fair die. 


variable that may take different values if a random experiment is conducted and its value is 
not known in advance. 
We begin our study of random variables by focusing on those arising from discrete 
events. If X is discrete, X assumes only finitely many or countably many values: £1, £2, £3, 
. For each possible value of z;, there is a corresponding positive probability px [ai] = 
P(X = zı), px|z2] = P(X = 22),... given by the probability mass function. For values of 
x different from x;, say xı < x < zs, the probability mass function equals zero. Therefore, 
we have that 


Pi, T = Ti, 
px [ei] = { 0, otherwise, (16.3.2) 
where i = 1,2,3,.... A family of discrete random variables having the same probability 


mass family is called identically distributed. ‘ 
e Example 16.3.1 
Consider a fair die. We can describe the results from rolling this fair die via the discrete 


random variable X, which has the possible values x; = 1,2,3,4,5,6 with the probability 
px [ui] = i each. Note that 0 < px[v;] < 1 here. Furthermore, 


6 
2px [ag] = 1. (16.3.3) 


Figure 16.3.1 illustrates the probability mass function. 


e Example 16.3.2 


Let us now modify Example 16.3.1 so that 
1, si = 1,2, 
X[si] = i 2, 5i = 3,4, (16.3.4) 
; = 95,6 


The probability mass function becomes 


px(1] = px [2] = »x[3] = 5. (16.3.5) 


Probability 819 


Some Properties of the Probability Mass Function px [xj] 


0 € px [rx] < 1, px|[r]—- 0 if xÆ Zk, Rae ad 
S px[2n] =1 


Fx (2) = P(X € zx) = X` pxlos] 


ERST 
P(a < z <b) = 5 px [zk] 
alt «b 
e Example 16.3.3 
Consider the probability mass function: 
|] k(1/2)”, n=0,1,2,..., 
px [tn] = { 0, otherwise. 1039) 


Let us (a) find the value of k, (b) find P(X = 2), (c) find P(X x 2), and (d) P(X > 1). 
From the properties of probability mass function, 


Sei 1 
kN (3) =k = 2k = 1. (16.3.7) 
n=0 


Therefore, k = i. Note that 0 € px [v] € 1. 
Having found k, we immediately have 


P(X = 2) = px[v2] = §, (16.3.8) 
P(X < 2) = px [zo] + px [zi] + px [v2] = §, (16.3.9) 

and 
P(X >1)=1- Px =o = 5. (16.3.10) 


Having introduced the probability mass function, an alternative means of describing the 
probabilities of a discrete random variable is the cumulative distribution function. It is 
defined as 


Fy (x) = P(X < x), —oo < T < oo. (16.3.11) 
It is computed via 
= X pxļe]= J pi (16.3.12) 
Ti KT Ti KT 


Consequently, combining Equation 16.3.11 and Equation 16.3.12, we obtain 


Pa@<esb)= X` pi (16.3.13) 


a«aixb 


Equation 16.3.13 gives the probability over the interval (a, b]. 
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e Example 16.3.4 


A Bernoulli experiment is a random experiment, the outcome of which is a success 
or failure. Consider now a sequence of independent Bernoulli trials with probability p of 
success from trial to trial. This sequence is observed until the first success occurs. Let X 
denote a random variable that equals the trial number on which the first success occurs. 
The probability mass function is then 


px(tn] = (1 — p)*^!p, n=1,2,3,.... (16.3.14) 


Let us compute the cumulative distribution function. 
For geometric series, we begin by noting that 


a 


n n—1 
J = J = 1. 16.3.15 
ze a 1 —r! Irl < ( ) 


Next we check Equation 16.3.14 and determine whether it is a valid probability mass func- 
tion. It is because 


2x [£n] = 2.0 —p)"""p= eae =1, (16.3.16) 
where we used Equation 16.3.15. Next, we note that 
P(X >m)= 5 (1— pip = g ur = (1— p)”. (16.3.17) 
n=m+1 —(1—p) 
Therefore, 
Fx(r)—- P(X <m)=1-P(X >m)=1-(1-p)”, (16.3.18) 


where m = 1,2,3,.... 


e Example 16.3.5: Generating discrete random variables via MATLAB 


In this example we show how to generate a discrete random variable using MATLAB's 
intrinsic function rand. This MATLAB command produces random, uniformly distributed 
(equally probable) reals over the interval (0, 1). How can we use this function when in the 
case of discrete random variables we have only integer values, such as k = 1, 2,3,4,5,6, in 
the case of tossing a die?” 

Consider the Bernoulli random variable X — k, k — 0,1. As you will show in your 
homework, it has the cumulative distribution function of 


0, c <0, 
Fx(x)-— | l—-p, 0<a<l, (16.3.19) 
1 Lag, 


, 


5 This technique is known as the inverse transform sampling method. See pages 85-102 in Devroye, L., 
1986: Non-Uniform Random Variable Generation. Springer-Verlag, 843 pp. 
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Figure 16.3.2: The cumulative distribution function for a Bernoulli random variable. 


See Figure 16.3.2. 

Imagine now a program that includes the MATLAB function rand, which yields the 
value t. Then, if 0 < t < 1 — p, Figure 16.3.2 gives us that X = 0. On the other hand, if 
l1—p<t< 1, then X = 1. Thus, to obtain M realizations of the Bernoulli random variable 
X, the MATLAB code would read for a given p: 


clear; 
for i = 1:M 
t - rand(1,1); 
if (t <= 1-p) X(1,1) = 0; 
else 
X(i,1) = 1; 
end; end 


'The end product of this code creates a vector X of length M consisting of a random variable 
with either zeros or ones. This is shown in Figure 16.3.3(a) when p — 0.4. 

Once we have generated this random variable, we can use its relative frequency to 
compute its probability mass function and cumulative distribution function from 


Number of outcomes equal to k 
M , 


Px [xr] = (16.3.20) 


and 
Number of outcomes < x 


M 
In Figure 16.3.3(b) we have computed the value of fx [1]. Clearly it should equal p. As this 
figure shows, we obtain poor results when M is small, with Dx [1] moving randomly above 
and below the correct answer. As M becomes larger, our estimate improves. 


Fx(xz) = 


(16.3.21) 


Problems 
1. The Bernoulli distribution has the probability mass function 


px [tp] = P(X =k) = p*(1—p)'-*, k=0,1, 
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Figure 16.3.3: (a) Outcomes of the Bernoulli random variable generated by the MATLAB function rand. 
(b) The computed value of the probability mass function px [1] as a function of M realization of the Bernoulli 
random variable. The dashed line is the line for the exact answer p — 0.4. 


where 0 € p € 1. (a) Show that this distribution is a valid probability mass function. (b) 
Find its cumulative distribution function. 


2. An experiment is performed where a digit, ranging from 0 to 9, is repeatedly and 
randomly chosen. If X denotes the times that this experiment must be repeated until the 
digit 0 is selected, find P(X). 


3. A scientific company needs a programmer who knows an unusual programming language. 
If only 596 of programmers know this language, how many programmers should the company 
interview to have a 75% chance of finding such a programmer? 


16.4 CONTINUOUS RANDOM VARIABLES 


In the previous section we examined random variables that can assume only certain 
discrete values. Here we extend the concept of random variables so that they can take on 
values over a continuous interval. Typical examples of continuous random variables include 
the noisy portion of the voltage within an amplifier, the phase of a propagating wave, and 
the amount of precipitation. 

An important quantity that we introduced in the previous section was the probability 
mass function. What is the corresponding function for continuous random variables? From 
the fundamental concepts of probability, we know that the probability of a continuous 
variable assuming one specific value out of its possible range values equals zero; it is merely 
one point out of an infinite number of points in the sample space. On the other hand, there 
is a finite probability that the value assumed by the random variable X will lie within an 
arbitrarily small interval dx and this probability will depend on the length of the interval. 

Another factor that should influence the probability is the value of x. There is no 
reason why the probability of X should be independent of x. Consequently, an equation 
for probability in the interval x < X < x + dz requires a function px (a), which acts as a 
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Figure 16.4.1: A probability density function. 


weighting function and models the relative frequency behavior of X. For these reasons, the 
probability that a continuous random variable X will assume a value lying between x and 
x + dz is given by 

P(x < X < z + dr) = px (x) dz. (16.4.1) 


Figure 16.4.1 illustrates a possible example of px(x) where the shaded area equals the 
probability P(x < X € xz + dz). Clearly the function px(x) = P(x < X < x + dx)/dx 
has the dimension of probability per infinitesimal interval dz and is called, for that reason, 
the probability density. Furthermore, although px(r)dr < 1, this does not mean that 
px(z) € 1. A family of random variables having the same probability density is identically 
distributed. 

The function px (z) must also satisfy several additional conditions. Because probability 
cannot be negative, px (x) > 0 of all x. Furthermore, as Figure 16.4.1 suggests, if we add 
up all of the possible values of x, then we have a certain event. We can express this 
mathematically by 


a px(z) dz — 1. (16.4.2) 


Thus, a probability density has the properties given by Equation 16.4.1 and Equation 16.4.2. 
It must also be a single-valued function of x. Note that these conditions do not require that 
px(x) is a continuous function of z. 

Let us now consider the probability P(a < X < b) where a and b are constants. If 
we subdivide the range of x between a and b into infinitesimal intervals (r,x + dx), the 
probability that the random variable will assume a value from one such interval is given by 
Equation 16.4.1. The probability that the variable will assume a value in the interval (a, b) 
equals the sum of the probabilities from each subinterval between a and b and is given by 
the area under the curve p(x) between x = a and x = b. Therefore, 


b 


P(a« X <b) =| px (x) da. (16.4.3) 
If a — —oo, we have that 
b 
P(X <b)= / px (x) da. (16.4.4) 


Alternatively, setting b — oo, 


P(a« X)— 1. px (x) da. (16.4.5) 
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Some Properties of the Probability Density Function px (2) 


From Equation 16.4.3 we also have 


P(X >a) =1-P(X <a)=1 -J px (x) dx = i px (x) da. (16.4.6) 
From Equation 16.4.4 we now define 
Fe(0)= PX <2) =f px (Gab. (16.4.7) 


This function Fx (x) is called the cumulative distribution function, or simply the distribution 
function, of the random variable X. Clearly, 


px(z) = Fx (2). (16.4.8) 


Therefore, from the properties of px (x), we have that (1) Fx (x) is a nondecreasing function 
of x, (2) Fx(—oo) = 0, (3) Fx(co) = 1, and (4) P(a < X x b) = Fx(b) — Fx (a). 


e Example 16.4.1 


The continuous random variable X has the probability density function 


k (x — 2?) 0cz«l 
= i : 16.4.9 
px(2) { 0, otherwise. ( ) 


What must be the value of k? What is the cumulative distribution function? What is 
P(X < 1/2)? 
From Equation 16.4.2, we have that 


oo 1 2 3 k 
i px(a)de =k f (a — a?) dr=k > -Z| =, (16.4.10) 
MS 0 2 3l; 6 
Therefore, k must equal 6. 
Next, we note that 
Fe(0)=P(X <2) =f pxO d. (16.4.11) 
If x < 0, Fx(x) 20. For 0 € x <1, then 
x e £3 L2 
Fx (x) = 6 | (E- £) dé =6 (S - >) = 3x? — 22°. (16.4.12) 
0 0 
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Figure 16.4.2: The cumulative distribution function for an exponential random variable. 


Finally, if x > 1, 


1 
Fy(x) = 6 | (£ — €?) d£ — 1. (16.4.13) 
0 
In summary, 
0, 0m, 
Fx(a)— | 3x7 — 223, 0<z<1, (16.4.14) 
1, l«zx. 


Because P(X < x) = Fx(z), we have that P(X < 2) = 3 and P(X > 2) 2 1— P(X < 
1) — 1 
-1 


e Example 16.4.2: Generating continuous random variables via MATLAB 


In the previous section we showed how the MATLAB function rand can be used to gen- 
erate outcomes for a discrete random variable. Similar considerations hold for a continuous 
random variable. 

Consider the exponential random variable X. Its probability density function is 


0, cz «0, 
px(«) = lacis es (16.4.15) 


where À > 0. For homework you will show that the corresponding cumulative distribution 


function is 
0, x <0, 


Fx(z)— { 1— e^ eu. (16.4.16) 


Figure 16.4.2 illustrates this cumulative density function when A = 1. How can we use these 
results to generate a MATLAB code that produces an exponential random variable? 

Recall that both MATLAB function rand and the cumulative distribution function pro- 
duce values that vary between 0 and 1. Given a value from rand, we can compute the 
corresponding X = x, which would give the same value from the cumulative distribution 
function. In short, we are creating random values for the cumulative distribution function 
and using those values to give the exponential random variable via 


X = y = — ln (1 — rand) /A, (16.4.17) 


6 This technique is known as the inverse transform sampling method. See pages 27-39 in Devroye, L., 
1986: Non-Uniform Random Variable Generation. Springer-Verlag, 843 pp. 
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Figure 16.4.3: (a) Outcomes of a numerical experiment to generate an exponential random variable using 
the MATLAB function rand. (b) The function fx(x) given by Equation 16.4.18 as a function of x for 
an exponential random variable with M = 1000. The dashed black line is the exact probability density 
function. 


where we have set Fx(z) = rand. Therefore, the MATLAB code to generate exponential 
random variables for a particular lambda is 


clear; 
for i = 1:M 

t - rand(1,1); 

X(i,1) = -log(1-t) / lambda; 
end 


where M is the number of experiments that we run. In Figure 16.4.3(a) we illustrates the first 
200 outcomes from our numerical experiment to generate an exponential random variable. 

'To compute the probability density function we use the finite difference approximation 
of Equation 16.4.1, or 


. Number of outcomes in [xo — Az/2, xo + Ax/2] 
B(zo) = MAr , 


(16.4.18) 


where Az is the size of the bins into which we collect the various outcomes. Figure 16.4.3(b) 
illustrates this numerical estimation of the probability density function in the case of an 
exponential random variable. The function jx (x) was created from the MATLAB code: 


clear; 
delta x = 0.2; lambda = 1; M = 1000; % Initialize Az, A and M 
^4 sample M outcomes from the uniformly distributed distribution 
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t = rand(M,1); 

^ generate the exponential random variable 

x = - log(1-t)/lambda; 

^ create the various bins [ro — Ax/2, xo + Az/2] 
bincenters=[delta_x/2:delta_x:5]; 
bins-length(bincenters); % count the number of bins 
^ now bin the M outcomes into the various bins 

[n,x out] = hist(x,bincenters); 

n =n / (delta x*M); 4 compute the probability per bin 
bar.h = bar(x_out,n); % create the bar graph 

bar child = get(bar h,'Children?); 

set (bar_child,’CData’ ,n); 

colormap (Autumn) ; 


Problems 
1. The probability density function for the exponential random variable is 


0, x <0, 
px (x) = der 0czc 


with A > 0. Find its cumulative distribution function. 
2. Given the probability density function 


E ka, 0cz«2, 
Bur 0, otherwise, 


where k is a constant, (a) compute the value of k, (b) find the cumulative density function 
Fx (x), and (c) find the P(1 < X < 2). 


3. Given the probability density function 


 fkü-l,  leļ<1, 
px(a) = | 0, l| > 1, 


where k is a constant, (a) compute the value of k and (b) find the cumulative density 
function Fx (x). 


Project: Central Limit Theorem 


Consider the sum S = (X4 + X2 + X3 +--+- + X190)/100, where X; is the ith sample 
from a uniform distribution. 


Step 1: Write a MATLAB program to compute the probability density function of S. See 
Figure 16.4.4. 


Step 2: The central limit theorem states the distribution of the sum (or average) of a 
large number of independent, identically distributed random variables will be approximately 
normal, regardless of the underlying distribution. Do your numerical results agree with this 
theorem? 
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probability density 


0.35 0.4 0.45 0.5 0.55 0.6 0.65 


Figure 16.4.4: Computed probability density function for the sum S = (X1 + X2 + Xa +---+X100)/100, 
where X; is the ith sample from a uniform distribution. 


16.5 MEAN AND VARIANCE 


In the previous two sections we explored the concepts of random variable and distribu- 
tion. Here we introduce two parameters, mean and variance, that are useful in characterizing 
a distribution. 

The mean py is defined by 


5 £k Ppx[£k], X discrete, 
px -E(X)24 6 (16.5.1) 
/ xpx(a)dx, X continuous. 


— oo 


The mean provides the position of the center of the distribution. The operator E(X), which 
is called the expectation of X , gives the average value of X that one should ezpect after many 
trials. 

Two important properties involve the expectation of the sum and product of two ran- 
dom variables X and Y. The first one is 


E(X +Y) = E(X) - E(Y). (16.5.2) 
Second, if X and Y are independent random variables, then 
E(XY) = E(X)E(Y). (16.5.3) 


The proofs can be found elsewhere." 
'The variance provides the spread of a distribution. It is computed via 


o% = Var(X) = E([X — E(X)?), (16.5.4) 


T For example, Kay, S. M., 2006: Intuitive Probability and Random Processes Using MATLAB. Springer, 
833 pp. See Sections 7.7 and 12.7. 
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Or 
s uxYpx|m], X discrete, 
o% = us (16.5.5) 
1 (r— ux)°px(x)dz, X continuous. 
If we expand the right side of Equation 16.5.4, an alternative method for finding the variance 
is 


oy = Var(X) = E(X?) - |[E(X)P?, (16.5.6) 


where 
5 x, Px|Le], X discrete, 
k 


E(X”) = có (16.5.7) 
J x"px(r)dx, X continuous. 
e Example 16.5.1: Mean and variance of M equally likely outcomes 
Consider the random variable X = k where k = 1,2,..., M. If each event has an 


equally likely outcome, px |z] = 1/M. Then the expected or average or mean value is 


M 


1 M(M+1) M+1 
ux = 7p = E a (16.5.8) 


Note that the mean does not equal any of the possible values of X. Therefore, the expected 
value need not equal a value that will be actually observed. 
Turning to the variance, 


Var(X) = (M + 1) (GM + 1)/6 — (M +1)/4] (16.5.9) 
= (M - 1) 4M - 2- 3M - 3] /12 (16.5.10) 
= (M +1)(M - 1)/12 = (M? - 1)/12, (16.5.11) 


because 


M 
EQ = zi |. M(M uu ) _ (M+ DOM 1) TIS 


We used Equation 16.5.6 to compute the variance. 


e Example 16.5.2 


Let us find the mean and variance of the random variable X whose probability density 
function is 


_ J ka, 0<a<l, 
px (2) = { 0, otherwise. 659) 
From Equation 16.5.1, we have that 
1 311l 
k k 
ux = E(X) =| (kx) dz = ©] = =. (16.5.14) 
0 3 |) 3 
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From Equation 16.5.6, the variance of X is 


1 2 A; 2 2 
o% = Var(X) = EX*)-LEQOF = f Phom a -F 2 (16.5.15) 
0 


e Example 16.5.3: Characteristic functions 
The characteristic function of a random variable is defined by 
x(w) = Elexp(iwX)]. (16.5.16) 


If X is a discrete random variable, then 
ox(w)= M px[zalei. (16.5.17) 


On the other hand, if X is a continuous random variable, 


ox(w) = T px (x)e* dz, (16.5.18) 


— oo 


the inverse Fourier transform (times 27) of the Fourier transform, px (x). 
Characteristic functions are useful for computing various moments of a random variable 
via r 
1 d” w 


16.5.19 
in dw” ( ) 


w=0 
This follows by taking repeated differentiation of Equation 16.5.16 and then evaluating the 
differentiation at w = 0. 

Consider, for example, the exponential probability density function px(x) = Ae? 
with x, > 0. A straightforward calculation gives 


À 
ox(w) = -. (16.5.20) 
À — wi 
Substituting Equation 16.5.20 into Equation 16.5.19 yields 
n! 
In particular, 
1 2 2 
E(X) = ` and E(X*) = Pos (16.5.22) 


Consequently, wx = 1/A and Var(X) = E(X?) — u% =1/)?. 
e Example 16.5.4: Characteristic function for a Gaussian distribution 


Let us find the characteristic function for the Gaussian distribution and then use that 
characteristic function to compute the mean and variance. 
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Because 


1 
px(x) = e (eH /Qe^) (16.5.23) 
2T O 


the characteristic function equals 


1 e ] 
éx(u) = Fe J e- 8-1 (Qe?) +iwe qu. (16.5.24) 
"DEN 1 SP (x — u — iwo?)? 
— wu- w /2 J ex | dx 16.5.25 
Us m€— 20? l 
E cien o? w? [2 (16.5.26) 


because the quantity within the wavy brackets equals one. 
Given this characteristic function, Equation 16.5.26, we have that 


ylw) = (ip — ^u eion-etut [a (16.5.27) 
Therefore, $^. (0) = ip and from Equation 16.5.19, ux = E(X) = u. Furthermore, 


n 


(o) = (i — 0? 


wylewu ut [2 S2 iut [2 (16.5.28) 
Consequently, 94 (0) = —4? — o? and Var(X) = E(X?) — uł = 0°. 


Problems 


1. Let X(s) denote a discrete random variable associated with a fair coin toss. Then 


0, s — tail, 
X(s) = { s = head. 


Find the expected value and variance of this random variable. 

2. The geometric random variable X has the probability mass function: 
px |x] = P(X =k) = p(1—p)*, k —1,2,3,.... 

Find its mean and variance. Hint: 


kr 1- k(k — 1)? = xc Leg 


and E(X?) = E[X(X — 1) + E(X). 
3. Given 


OE kz(2 — x) 0 « x «2, 
PAN 0, otherwise, 


(a) find k and (b) its mean and variance. 


4. Given the probability density 
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find its characteristic function using integral tables. 


For the following distributions, first find their characteristic functions. Then compute the 
mean and variance using Equation 16.5.19. 


5. Binomial distribution: 
[9 k n-k 
pxl] = (p) x, 0<p<1, 


where q = 1 — p. Hint: Use the binomial theorem to simplify Equation 16.5.17. 


6. Poisson distribution: : 
À 
—A Em 


(OSA. 


px [tr] =e 


7. Geometric distribution: 
px[zk] =p, 0<p<1, 


where q = 1 — p. 
8. Uniform distribution: 


H(x — a) — H(x — b) 


b—a , 


px (x) = 


Project: MATLAB’s Intrinsic Function mean and var 


MATLAB has the special commands mean and var to compute the mean and variance, 
respectively, of the random variable X. Use the MATLAB command randn to create a 
random variable X(n) of length N. Then, find the mean and variance of Xn). How do these 
parameters vary with N? 


Project: Monte Carlo Integration and Importance Sampling 


Consider the integral / — J V1— z? dx = 1/4. If we were to compute it numerically 
by the conventional midpoint rule, the approximate value is given by 


In => fs) (1) 


where f(x) = V1 — x? and x, = (n — 1/2)/N. For N = 10, 50, 100, and 500, the absolute 

value of the relative error is 2.7 x 107°, 2.4 x 1074, 8.6 x 1075, and 7.7 x 107, respectively. 
Monte Carlo integration is a simple alternative method for doing the numerical inte- 

gration using random sampling. It is a particularly powerful technique for approximating 

complicated integrals. Here you will explore a simple one-dimensional version of this scheme. 
Consider the random variable: 


Iu — v; fem), (2) 
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M- 10 M - 50 
6 14, 
5| 12 
E 
2, p i 
o 
o 8 
EE 
s] 6 
c 
Q2 
9 4 
a 
" 
Ex 02 Ex -0.05 0 0.05 0.1 
M - 100 M - 500 
20, = 50, 
215 Er 
E] 
2 
29 30 
EET, 
E 20 
Q 
Ss 
e 10) 
-Oo -0.06 70.04 =0.02 0 0.02 0.04 , 0.06 0.08 oa -0.03 70.02 0.01 0 001 0.02 , 0.03 0.04 
approximate — exact value of the integral approximate — exact value of the integral 
Figure 16.5.1: The probability density function arising from using Monto Carlo integration to compute 


fh V1 — x? dx for various values of M. 


where £m is the mth sample point taken from the uniform distribution. Im is a random 
variable because it is a function of the random variable £m. Therefore, 


M M a 


E(u) = 57 Y, Elem) = 57 Df ftp) de 
M 1 1 
- p flayae= f f(a)ae=1, 


because p(x), the probability of the uniform distribution, equals 1. Furthermore, as we 
increase the number of samples M, Im approaches I. By the strong law of large numbers, 
this limit is guaranteed to converge to the exact solution: P(limy. so Z4 — I) = 1. Equation 
(2) is not the midpoint rule because the uniform grid x, has been replaced by randomly 
spaced grid points. 


Step 1: Write a MATLAB program that computes Iw for various values of M when £m is 
selected from a uniform distribution. By running your code thousands of times, find the 
probability density as a function of the difference between Ij, and J. Compute the mean 
and variance of Im. How does the variance vary with M? See Figure 16.5.1. 


'The reason why standard Monte Carlo integration is not particularly good is the fact 
that we used a uniform distribution. A better idea would be to sample from regions where 
the integrand is larger. This is the essence of the concept of importance sampling: That 
certain values of the input random variable £m in a simulation have more impact on the 
parameters being estimated than others. 

We begin by noting that 


[p _ f f) 
=f f(x)dx = ; nay 
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where pi(r) is a new probability density function that replaces the uniform probability 
distribution and is relatively larger when f(x) is larger and relatively smaller when f(z) is 
smaller. 

The question now becomes how to compute pi(x). We shall use the VEGAS algo- 
rithm, which constructs p;(x) by sampling f(x) K times, where K < M. Within each kth 
subinterval we assume that there are M/K uniformly distributed points. Therefore, 


K f (Sm) 
Ba 


where s; is the center point of the kth subinterval within which the mth point is located. 
For each m, we must find £m. This is done in two steps: First we randomly choose the kth 
subinterval using a uniform distribution. Then we randomly choose the point r,, within 
that subinterval using a uniform distribution. Therefore, our modified integration scheme 


pi (Lm) = 


becom 
ecomes : m PE 
WM 2 pi(zm): (3) 
Now, 
1 © dis 1 4 1 ig 
Elm) = 55 E E. 2, n M Cm (r)potz) dx 


1 M 1 1 
"uj fG)ds = f f(x) dz = I, 


because po(x) = 1. 


Step 2: Write a MATLAB program that computes I, for various values of K for a fixed value 
of M. Recall that you must first select the subdivision using the MATLAB function rand 
and then the value of £m within the subdivision using a uniform distribution. By running 
your code thousands of times, find the probability density as a function of the difference 
between Im and J. Compute the mean and variance of Im. How does the variance vary 
with M? See Figure 16.5.2. 


16.6 SOME COMMONLY USED DISTRIBUTIONS 


In the previous sections we introduced the concept of probability distributions and their 
description via mean and variance. In this section we focus on some special distributions, 
both discrete and continuous, that appear often in engineering. 


| Bernoulli distribution 


Consider an experiment where the outcome can be classified as either a success or 
failure. The probability of a success is p and the probability of a failure is 1 — p. Then these 
“Bernoulli trials” have a random variable X associated with them where the probability 
mass function is given by 


px|z.] = P(X =k) = pF(1— p), k=0,1, (16.6.1) 
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Figure 16.5.2: The probability density function arising from using importance sampling with Monto Carlo 


integration to compute s V1 — x? dz for various values of K and M = 100. 
8 0 


where 0 € p € 1. From Equation 16.3.12 the cumulative density function of the Bernoulli 
random variable X is 


0, c <0, 
Fy(x) = | l—p, mamai, (16.6.2) 
E, 1<a. 


The mean and variance of the Bernoulli random variable X are 


ux = E(X) =p, and o% = Var(X) = p(1— p). (16.6.3) 


e Example 16.6.1 


A simple pass and fail process is taking a final exam, which can be modeled by a 
Bernoulli distribution. Suppose a class passed a final exam with the probability of 0.75. If 
X denotes the random variable that someone passed the exam, then 


E(X)=p=0.75, and ~~ Var(X) = p(1 — p) = (0.75) (0.25) = 0.1875. — (16.6.4) 


| Geometric distribution 


Consider again an experiment where we either have success with probability p or failure 
with probability 1—p. This experiment is repeated until the first success occurs. Let random 
variable X denote the trial number on which this first success occurs. Its probability mass 
function is 


pxlzą] = P(X =k) = p(1— p), ees en (16.6.5) 

From Equation 16.3.12 the cumulative density function of this geometric random variable 
X is 

Fx(xz)— P(X € x) 21— (1— p)". (16.6.6) 
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The mean and variance of the geometric random variable X are 


ux = E(X)=-, and =o = Var(X) = —-. (16.6.7) 


1 
p p 
e Example 16.6.2 

A particle within an accelerator has the probability 0.01 of hitting a target material. 
(a) What is the probability that the first particle to hit the target is the 50th? (b) What is 
the probability that the target will be hit by any particle? 


P(first particle to hit is the 50th) = 0.01(0.99)*° = 0.0061. (16.6.8) 


50 
P(target hit by any of first 50th particles) = » 0.01(0.99)"—! = 0.3950. (16.6.9) 
n=1 


e Example 16.6.3 


The police ticket 5% of parked cars. Assuming that the cars are ticketed independently, 
find the probability of 1 ticket on a block with 7 parked cars. 
Each car is a Bernoulli trial with P(ticket) = 0.05. Therefore, 


P(1 ticket on block) = P(1 ticket in 7 trials) = @ (0.95)°(0.05) = 0.2573. (16.6.10) 


| Binomial distribution 


Consider now an experiment in which n independent Bernoulli trials are performed and 
X represents the number of successes that occur in the n trials. In this case the random 
variable X is called binomial with parameters (n, p) with a probability mass function given 
by 


px|z,] = P(X =k) = i p'(10—p7*,  k=0,1,...,n, (16.6.11) 


where 0 < p < 1, and 


n n! 
k) T Hn K id 
( ) ( kj (16.6.12) 


the binomial coefficient. The term p* arises from the k successes while (1 — p)"-^ is due 


to the failures. The binomial coefficient gives the number of ways that we pick those k 
successes from the n trials. 
The corresponding cumulative density function of X is 


Fx(z) 2 M. (g) -p n€Xz«mn-l. (16.6.13) 
k=0 
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The mean and variance of the binomial random variable X are 
ux = E(X) =np, and o? = Var(X) = np(1 — p). (16.6.14) 


A Bernoulli random variable is the same as a binomial random variable when the parameters 
are (1,n). 


e Example 16.6.4 


Let us find the probability of rolling the same side of a die (say, the side with N dots 
on it) at least 3 times when a fair die is rolled 4 times. 

During our 4 tosses, we could obtain no rolls with N dots on the side (k = 0), one roll 
with N dots (k = 1), two rolls with N dots (k = 2), three rolls with N dots (k = 3), or 
four rolls with N dots (k — 4). If we define A as the event of rolling a die so that the side 
with N dots appears at least three times, then we must add the probabilities for k = 3 and 
k — 4. Therefore, 


P(A) = px [za] + px [v4] = (S) P qiu (1) p*(1— p)? (16.6.15) 


a! 
=e eps 


4(1 — p)? = 0.0162 (16.6.16) 


because p = L, 


e Example 16.6.5 


If 10 random binary digits are transmitted, what is the probability that more than 
seven l’s are included among them? 
Let X denote the number of 1’s among the 10 digits. Then 


P(X > 7) = P(X =8) a 9) + P(X = i» = px [zs] Dix px Eno] (16.6.17) 
-(3) G) G) +(2) G) (9 * (5) @) G) aem 


= (45+ 10+ 1) G) (16.6.19) 


2 


| Poisson distribution 


The Poisson probability distribution arises as an approximation for the binomial dis- 
tribution as n — oo and p > 0 such that np remains finite. To see this, let us rewrite the 
binomial distribution as follows: 


n! AM AVTE n(n—1)(n—2)--(n— k-- 1) A^ (1— A/n)^ 
( ) ( 2) E nk nl (1 An)! 
(16.6.20) 
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if A = np. For finite A, 


AY ! 22 NN 
lim [1-—-] >1, lim [1-—-] e^, (16.6.21) 
n— oo n n= 0 TL 
ps 1 2 k+1 
js ee aati E d. (16.6.22) 
n— oo Ty 


Therefore, for large n, small p and moderate A, we can approximate the binomial distribution 
by the Poisson distribution: 


AF 
px [tz] = P(X = k) z x k=0,1,.... (16.6.23) 
The corresponding cumulative density function of X is 
ANM 
Fx(r) =X qp BS ent. (16.6.24) 
k=0 


The mean and variance of the Poisson random variable X are 
ux = E(X)=), and oy = Var(X) =). (16.6.25) 


In addition to this approximation, the Poisson distribution is the probability distribution 
for a Poisson process. But that has to wait for the next chapter. 


e Example 16.6.6 


Consider a student union on a campus. On average 3 persons enter the union per 
minute. What is the probability that, during any given minute, 3 or more persons will 
enter the union? 

To make use of Poisson’s distribution to solve this problem, we must have both a large 
n and a small p with the average A = np = 3. Therefore, we divide time into a large number 
of small intervals so that n is large while the probability that someone will enter the union 
is small. Assuming independence of events, we have a binomial distribution with large n. 
Let A denote the event that 3 or more persons will enter the union, then 


P(A) = pxl0] + px[1] + px[2] = e^? (s TE =) = 0.423. (16.6.26) 


| Uniform distribution 


The continuous random variable X is called uniform if its probability density function 


_ f 1/(b — a), à « z «b, 
Pale) = { 0, otherwise. oot) 
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The corresponding cumulative density function of X is 


0, ra, 
Fy(x) = | (a — a)/(b — a), a«z«b, (16.6.28) 
b «x. 


The mean and variance of a uniform random variable X are 


px — E(X)— l(acb) and | of = Var(X) = (16.6.29) 


Uniform distributions are used when we have no prior knowledge of the actual probability 
density function and all continuous values in some range appear equally likely. 


| Exponential distribution 


The continuous random variable X is called exponential with parameter A > 0 if its 
probability density function is 


—Ac 
px(z) = M : ig s (16.6.30) 


The corresponding cumulative density function of X is 


_ p` ÀT 
Fx(z)— { 1 3 , dio (16.6.31) 


'The mean and variance of an exponential random variable X are 
ux cd and — o$ = Var(X) = 1/X*. (16.6.32) 


This distribution has the interesting property that is “memoryless.” By memoryless, 
we mean that for a nonnegative random variable X, then 


P(X >s+t|X 5t) — P(X >s), (16.6.33) 


where x,t > 0. For example, if the lifetime of a light bulb is exponentially distributed, then 
the light bulb that has been in use for some hours is as good as a new light bulb with regard 
to the amount of time remaining until it fails. 

To prove this, from Equation 16.2.4, Equation 16.6.33 becomes 


P(X » s-F-t and X >t) 


PX — P(X » s), (16.6.34) 


P(X >s+tand X >t) = P(X > t)P(X > s), (16.6.35) 
since P(X > s +t and X >t) = P(X > s+ t). Now, because 


P(X >s+t)=1-— [1 = ener) = 780, (16.6.36) 
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P(X > 8)=1-— (1 -e> ) Se, (16.6.37) 


and 
P(X >t)=1- (Ie) ee, (16.6.38) 


Therefore, Equation 16.6.35 is satisfied and X is memoryless. 
e Example 16.6.7 


A component in an electrical circuit has an exponentially distributed failure time with 
a mean of 1000 hours. Calculate the time so that the probability of the time to failure is 
less than 107°. 

Let the exponential random variable X = k have the units of hours. Then A = 107°. 
From the definition of the cumulative density function, 


Fx(z,)— P(X € a) = 0.001, and 1 — exp(—Az,) = 0.001. (16.6.39) 


Solving for x+, 


x, = —1n(0.999)/A = 1. (16.6.40) 


e Example 16.6.8 


A computer contains a certain component whose time (in years) to failure is given by 
the random variable T distributed exponentially with A = 1/5. If 5 of these components 
are installed in different computers, what is the probability that at least 2 of them will still 
work at the end of 8 years? 

The probability that a component will last 8 years or longer is 


P(T > 8) = e-8/5 = 0.2019, (16.6.41) 


because A — 1/5. 
Let X denote the number of components functioning after 8 years. Then, 


P(X >2)=1=P(X =0) = P(X «1) (16.6.42) 
=1- (à) (0.2019)9 (0.7981)? — E (0.2019)! (0.7981)* (16.6.43) 
— 0.2666. (16.6.44) 


Normal (or Gaussian) distribution 


'The normal distribution is the most important continuous distribution. It occurs in 
many applications and plays a key role in the study of random phenomena in nature. 
A random variable X is called a normal random variable if its probability density 
function is 
e- (cu /Qc) 


px(«) = ze =? (16.6.45) 
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where the mean and variance of a normal random variable X are 
ux = E(X) =p, and oy = Var(X) = 0°. (16.6.46) 


The distribution is symmetric with respect to x = p and its shape is sometimes called 
“bell shaped.” For small c? we obtain a high peak and steep slope while with increasing c? 
the curve becomes flatter and flatter. 

The corresponding cumulative density function of X is 


1 x 1 (xp) /o 
Fx (2) = a= | e 6-H)? /(20") ge = =z | e €? qe, (16.6.47) 


The integral in Equation 16.6.46 must be evaluated numerically. It is convenient to introduce 
the probability integral: 


ys ee 
(z) = =z fe d£. (16.6.48) 
Note that ®(—z) = 1 — ®(z). Therefore, 
Fx(z)— o( - r) : (16.6.49) 
and 
P(a < X € b) = Fx (b) — Fx (a). (16.6.50) 


Consider now the intervals consisting of one c, two c, and three o around the mean p. 
'Then, from Equation 16.6.50, 


P(u—o« X < u+) — 0.68, (16.6.51) 
P(u—2o < X < u+ 20) = 0.955, (16.6.52) 

and 
P(u— 30 < X < u +30) = 0.997. (16.6.53) 


Therefore, approximately 2 of the values will be distributed between u — o and u 4 c, 
approximately 95% of the values will be distributed between u — 2e and +20, and almost 
all values will be distributed between u — 3e and u + 3c. For most uses, then, all values 
will lie between u — 3e and u + 3c, the so-called “three-sigma limits.” 

As stated earlier, the mean and variance of a normal random variable X are 


ux = E(X)— py, and or = Var(X) = 0°. (16.6.54) 


The notation N (u; o) commonly denotes that X is normal with mean p and variance 
c?. The special case of a normal random variable Z with zero mean and unit variance, 


N (0,1), is called a standard normal random variable. 
Problems 
1. Four coins are tossed simultaneously. Find the probability function for the random 


variable X that gives the number of heads. Then compute the probabilities of (a) obtaining 
no heads, (b) exactly one head, (c) at least one head, and (d) not less than four heads. 
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2. A binary source generates the digits 1 and 0 randomly with equal probability. (a) What 
is the probability that three 1’s and three 0’s will occur in a six-digit sequence? (b) What 
is the probability that a£ least three 1’s will occur in a six-digit sequence? 


3. Show that the probability of exactly n heads in 2n tosses of a fair coin is 


1-390292] 
2-4-6-2 ` 


4. If your cell phone rings, on average, 3 times between noon and 3 P.M., what is the 
probability that during that time period you will receive (a) no calls, (b) 6 or more calls, and 
(c) not more than 2 calls? Assume that the probability is given by a Poisson distribution. 


5. A company sells blank DVDs in packages of 10. If the probability of a defective DVD is 
0.001, (a) what is the probability that a package contains a defective DVD? (b) what is the 
probability that a package has two or more defective DVDs? 


6. A school plans to offer a course on probability in a classroom that contains 20 seats. 
From experience they know that 95% of the students who enroll actually show up. If the 
school allows 22 students to enroll before the session is closed, what is the probability of 
the class being oversubscribed? 


7. The lifetime (in hours) of a certain electronic device is a random variable T having a 
probability density function pr (t) = 100 H(t — 100)/t?. What is the probability that exactly 
3 of 5 such devices must be replaced within the first 150 hours of operation? Assume that 
the events that the ith device must be replaced within this time are independent. 


16.7 JOINT DISTRIBUTIONS 


In the previous sections we introduced distributions that depended upon a single ran- 
dom variable. Here we generalize these techniques for two random variables. The range of 
the two-dimensional random variable (X, Y) is Rxy = {(x,y);€ € S and X (£) = x, Y (£) = 


y}- 


| Discrete joint distribution 


Let X and Y denote two discrete random variables defined on the same sample space 
(jointly distributed). The function pxv[r;,y;] = P[X = ri, Y = yj] is the joint probability 
mass function of X and Y. As one might expect, pxy [v;, y;] > 0. 

Let the sets of possible values of X and Y be A and B. If x; ¢ A or y; ¢ B, then 
pxy [zi, yj] = 0. Furthermore, 


M opxvizoy] 21. (16.7.1) 
ri€ A,yj€B 


The marginal probability functions of X and Y are defined by 


px [zi] = D9 pxy vi. v;]; (16.7.2) 
yjEB 
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and 


prlu] = >> pxy lzi yj]. (16.7.3) 
zıEA 


If X and Y are independent random variables, then pxy|xi, y;] = px [vi]: pv [yj]. 
e Example 16.7.1 


A joint probability mass function is given by 


k(zj + 29;), $;—1,2,3,9; = 1,2; 
pxy |i, yj] ={ ( 0 vs) EEE 9j (16.7.4) 
Let us find the value of k, px [;], and py [y,]. 
From Equation 16.7.1, we have that 
3 2 
k 3 (mi + 2y;) =1, (16.7.5) 
zi=l yj=1 
or 
k[(1 +2) + (1-4) + (2-2) - (2-4) - (82) - (384) =1. (16.7.6) 


Therefore, k — 1/30. 
Turning to px [x;] and py [y;], 


2 
E (x; + 2y;) = k(x; + 2) + k(z; + 4) = k(2z; + 6) = (x; -3)/15, (16.7.7) 
where x; = 1,2,3, and 


3 
py [yy] Ez (xi + 2y;) = k(1 + 2y;) + k(2 + 2y;) + k(3 + 2y;) = k(6 + 6y;) = (1 + yj)/5. 


(16.7.8) 


where y; — 1,2. 


e Example 16.7.2 


Consider an urn that contains 1 red ball, 2 blue balls, and 2 green balls. Let (X,Y) 
be a bivariate random variable where X and Y denote the number of red and blue balls, 
respectively, chosen from the urn. There are 18 possible ways that three balls can be drawn 
from the urn: rbb, rbg, rgb, rgg, brb, brg, bbr, bbg, bgr, bgb, bgg, grb, grg, gbr, gbb, gbg, ggr, 
and ggb. 

The range of X and Y in the present problem is Rx y — ((0, 1), (0, 2), (1,0), (1,1), (1,2)}. 
The joint probability mass function of (X, Y) is given by pxv [ri yj] = P(X = üY = j), 
where z; = 0,1 and y; = 0,1,2. From our list of possible drawings, we find that pxy [0,0] = 
0, pxv [0,1] = 1/6, pxy [0,2] = 1/6, pxy [1,0] = 1/6, pxv [1 1] = 1/3, and pxv[1, 2] = 1/6. 
Note that all of these probabilities sum to one. 

Given these probabilities, the marginal probabilities are px[0] = 1/3, px[1] = 2/3, 
py [0] = 1/3, py [1] = 1/2, and py [2] = 1/3. Because pxy [0,0] 4 px[0]py [0], X and Y are 
not independent variables. 


844 Advanced Engineering Mathematics with MATLAB 


e Example 16.7.3 


Consider a community where 5096 of the families have a pet. Of these families, 6096 
have one pet, 30% have 2 pets, and 1096 have 3 pets. Furthermore, each pet is equally 
likely (and independently) to be a male or female. If a family is chosen at random from 
the community, then we want to compute the joint probability that his family has M male 
pets and F female pets. 

'These probabilities are as follows: 


P(F — 0, M — 0) = P{no pets} = 0.5, (16.7.9) 
P{F =1,M = 0} = P{1 female and total of 1 pet} (16.7.10) 
= P{1 pet}P{1 female|1 pet} (16.7.11) 
1 

= (0.5)(0.6) x c= 0.15, (16.7.12) 
P{F = 2, M = 0) = P{2 females and total of 2 pets} (16.7.13) 
= P{2 pets} P(2 females|2 pets} (16.7.14) 

1\2 
= (0.5)(0.3) x (5) = 0.0375, (16.7.15) 

and 

P{F = 3, M = 0) = P{3 females and total of 3 pets} (16.7.16) 
= P{3 pets} P(3 females|3 pets} (16.7.17) 

1\3 
= (0.5)(0.1) x (5) = 0.00625. (16.7.18) 


The remaining probabilities can be obtained in a similar manner. 


| Continuous joint distribution 


Let us now turn to the case when we have two continuous random variables. In analog 
with the definition given in Section 16.4, we define the two-dimensional probability density 
pxy (x,y) by 


P(x < X < z+dz,y<Y € y+ dy) = pxy (v, y) dz dy. (16.7.19) 
Here, the comma in the probability parentheses means “and also.” 


Repeating the same analysis as in Section 16.4, we find that pxy (x,y) must be a 
single-valued function with pxy (x, y) > 0, and 


Í L Dxy (, y) dz dy — 1. (16.7.20) 
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The joint distribution function of X and Y is 
vo py 
Fey) PXzzYip-f f rwn. (16.7.21) 
Therefore, 


b pd 
Pa<X<be<Y<d= f / pxy (€, m) d£ dn. (16.7.22) 


The marginal probability density functions are defined by 


px(z)- d. pxv(x,y)dy, and py(y)= T pxy (x,y) dz. (16.7.23) 


— oo 


An important distinction exists upon whether the random variables are independent or not. 
Two variables X and Y are independent if and only if 


pxy (x,y) = px (z)pv (y). (16.7.24) 
and conversely. 
e Example 16.7.4 


The joint probability density function of bivariate random variables (X,Y) is 


_ J kay, O0<y<a<l, 
pxy (#4) = { 0, otherwise, (16/525) 


where k is a constant. (a) Find the value of k. (b) Are X and Y independent? 
The range Rxy for this problem is a right triangle with its sides given by x = 1, y = 0, 
and y = x. From Equation 16.7.20, 


oo oo 1 x 1 2T 
/ J pxy (x,y) dx dy = Zi x / yd! dx = I xr L| de (16.7.26) 
—oo J —oo 0 0 0 2 0 
k fi 3 k yr k 


Therefore, k = 8. 
To check for independence we must first compute px(r) and py (y). From Equation 
16.7.23 and holding x constant, 


px (x) = s | y dy = 4z?, e<l; (16.7.28) 
0 


px (x) = 0 otherwise. From Equation 16.7.23 and holding y constant, 


1 
py(y) — sy f a dx = 4y(1— y?), 0<y<l. (16.7.29) 
y 


Because pxv (x,y) Z px(x)py (y), X and Y are not independent. 
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(a) intersection (b) no intersection 


Figure 16.7.1: Schematic of Buffon's needle problem showing the random variables X and O. 
e Example 16.7.5: Buffon's needle problem 


A classic application of joint probability distributions is the solution of Buffon's needle 
problem: Consider an infinite plane with an infinite series of parallel lines spaced a unit 
distance apart. A needle of length L « 1 is thrown upward and we want to compute the 
probability that the stick will land so that it intersects one of these lines. See Figure 16.7.1. 

There are two random variables that determine the needle's orientation: X, the distance 
from the lower end O of the needle to the nearest line above and O, the angle from the 
vertical to the needle. Of course, we assume that the position where the needle lands is 
random; otherwise, it would not be a probability problem. 

Let us define X first. Its possible values lie between 0 and 1. Second, X is uniformly 
distributed on (0, 1) with the probability density 


; O<a<l, 
Px (2) = { 0, otherwise. POTD) 


Turning to O, its value lies between —7/2 to 7/2 and is uniformly distributed between these 
values. Therefore, the probability density is 


_ jf Am, —n/2«0«mJ[2, 
pe(8) = { 0, otherwise. ast) 


The probability p that we seek is 
p = P{needle intersects line} = P{X < Lcos(O)}. (16.7.32) 


Because X and © are independent, their joint density equals the product of the densities 
for X and ©: pxe(z,0) = px (x)pe(0). 

The final challenge is to use pxo(x, 0) to compute p. In Section 16.2 we gave a geometric 
definition of probability. The area of the sample space is 7 because it consists of a rectangle 
in (X,0) space with 0 < x < 1 and —7/2 < 0 < m/2. The values of X and © that 


8 First posed in 1733, its solution is given on pages 100-104 of Buffon, G., 1777: Essai d'arithmétique 
morale. Histoire naturelle, générale et particulière, Supplément, 4, 46-123. 
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lead to the intersection with a parallel line is 0 < x < Lcos(0) where —7/2 < 0 < 7/2. 
Consequently, from Equation 16.2.5, 


1/2 L cos(0 n/2 L cos(0) 
n n esf x, 0) dx d0 -f n px (x)po(0) dz dé (16.7.33) 


p = 
m/2 1/2 
n/2 L cos(0 2L 
= if y gn do = TNI Lcos(0) d0 = —. (16.7.34) 
7/2 T J—«/2 T 


Consequently, given L, we can perform the tossing either physically or numerically, measure 
p, and compute the value of 7. 


| Convolution 


It is often important to calculate the distribution of X + Y from the distribution of 
X and Y when X and Y are independent. We shall derive the relationship for continuous 
random variables and then state the result for X and Y discrete. 

Let X have a probability density function px(x) and Y has the probability density 
py(y). Then the cumulative distribution function of X + Y is 


Gx+y(a) = P(r--y €a)— Ioni Dy (y) da dy (16.7.35) 
=f [_ pxtorr(waeay = / ~ | [xtorae] ovans (16.7.36) 
= f Fxla- yoriy) de (16.7.37) 

Therefore, 
pirlo = || Fiet oto] = [^ osa). (16.7.38) 


In the case when X and Y are discrete, 


px+y [ax] "S px [vi]pv lax — xi). (16.7.39) 


i——oo 
| Covariance 


In Section 16.5 we introduced the concept of variance of a random variable X. There 
we showed that this quantity measures the dispersion, or spread, of the distribution of X 
about its expectation. What about the case of two jointly distributed random numbers? 

Our first attempt might be to look at Var(X) and Var(Y). But this would simply 
display the dispersions of X and Y independently rather than jointly. Indeed, Var(X) 
would give the spread along the z-direction while Var(Y) would measure the dispersion 
along the y-direction. 
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Consider now Var(a.X + bY), the joint spread of X and Y along the (ax + by)-direction 
for two arbitrary real numbers a and b. Then 


Var(aX + bY) = E[(aX + bY) — E(aX + bY)]? 16.7.40) 
= E((aX + bY) — E(aX) — E(bY)P? 16.7.41) 
= E(a[X — E(X)] - Y — E(Y)|P 16.7.42) 
= E(aà?|X — E(X)? + ?|Y — E(Y)P + 2ab[X — E(X)]Y — E(Y)]) 
16.7.43) 
= a° Var(X) + ? Var(Y) + 2abE([X — E(X)|Y — E(Y)])- 16.7.44) 


Thus, the joint spread or dispersion of X and Y in any arbitrary direction ax + by depends 
upon three parameters: Var( X), Var(Y), and E([X — E(X)]|Y — E(Y)]}. Because Var(X) 
and Var(Y) give the dispersion of X and Y separately, it is the quantity E([X — E(X)]|Y — 
E(Y)]) that measures the joint spread of X and Y. This last quantity, 


Cov(X,Y) = E{[X — E(X)]Y - E(Y)]), (16.7.45) 


is called the covariance and is usually denoted by Cov(.X, Y) because it determines how X 
and Y covary jointly. It only makes sense when we have two different random variables 
because in the case of a single random variable, Cov(X, X) = o2, = Var(X). Furthermore, 
Cov(X,Y) € oxoy. In summary, 


Var(aX + bY) = a? Var(X) + b? Var(Y) + 2ab Cov(X, Y ). (16.7.46) 


An alternative method for computing the covariance occurs if we recall that wx = E(X) 
and py = E(Y). Then 


Cov(X, Y) = E[(X — ux)(Y — ny)] = E(XY — uxY — uv X + xuy) (16.7.47) 
= E(XY) - ux E(Y) - ny E(X) + uxuv (16.7.48) 
= E(XY) — uxpy — uy ux + uxnuy (16.7.49) 
= E(XY) - uxuy = E(XY) - E(X)E(Y), (16.7.50) 
where 
5 LiY; pxy |ti. Vj]. X discrete, 
BOM) = PS (16.7.51) 
J / xy pxy (x, y) dz dy, X continuous. 
Therefore, 
Cov(X,Y) = E(XY) — E(X)E(Y). (16.7.52) 
It is left as a homework assignment to show that 
Cov(aX + b, cY + d) = ac Cov(X, Y). (16.7.53) 


In general, Cov(X,Y) can be positive, negative, or zero. For it to be positive, X 
and Y decrease together or increase together. For a negative value, X would increase 
while Y decreases, or vice versa. If Cov(X, Y) > 0, X and Y are positively correlated. If 
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Cov(X,Y) < 0, X and Y are negatively correlated. Finally, if Cov( X, Y) = 0, X and Y are 
uncorrelated. 


e Example 16.7.6 


The following table gives a discrete joint density function: 


Ti 
pxy [£i, yj] 0 1 2 py [v;] 
3 9 3 15 
0 28 B3 38 38 
3 3 3 
yj 1 l4 ii 0 7 
1 1 
2 28 0 0 us 
px) 5 Bb E 
Because 
2 2 " 
E(XY) = 22i yj pxy [£i uj] = a’ (16.7.54) 


2 
3 1 
px = E(X) = 3 x px[2i] = p md py = E(Y) = So uj py ly] = z> (16.7.55) 
cum 


i=0 
then 
Cov(X, Y) = E(XY) — E(X)E(Y) = ME. (16.7.56) 
iU vs 14 42^ 56 m 
'Therefore, X and Y are negatively correlated. 
e Example 16.7.7 
The random variables X and Y have the joint probability density function 
£z +y, 0<a<10<y<l, 
pxy(®,y) = { 0 : otherwise. i (oO) 
Let us compute the covariance. 
First, we must compute px (x) and py (y). We find that 
1 1 
px) =f pxv(ou)ay= [e+ y)dy=24} (16.7.58) 
0 0 


for 0 < z < 1, and 


py (y) = ‘i pxy (x,y) dz = li (et+y)de=yts (16.7.59) 
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forü0c y «1. 
Because 
r3 r? > 
E(XY) -[ [om z-4y)drdy- f esl V3] +y = j^ 16.7.60) 
0 
y y? y y] 
Z4% )dy=%+>| =- 16.7.61 
MEDE aR. ) 
1 1 T 7 
ux = B(x) = | zpx() dz = | (x? +5) dz = —, 16.7.62) 
o R 2 12 
1 1 y 7 
py = EY) = f yoru) dy = f (v +2) dy = i5 16.7.63) 
0 0 
then 
Cov(X,Y) = E(XY) E(X)EY) =} SN 16.7.64) 
M ur E ^3 14 14 Rh 


Therefore, X and Y are negatively correlated. 


| Correlation 


Although the covariance tells us how X and Y vary jointly, it depends upon the same 
units in which X and Y are measured. It is often better if we free ourselves of this nuisance, 
and we now introduce the concept of correlation. 

Let X and Y be two random variables with 0 < 0% < oo and 0 < of < oo. The 
correlation coefficient p(.X, Y) between X and Y is given by 


(X,Y) = Cov | a) daro PRESA ITO (16.7.65) 


Ox Oy OXOY 


It is noteworthy that |p(.X, Y)| € 1. 


| Random Vectors 


It is often useful to express our two random variables X and Y as a two-dimensional 
random vector V = (X Y)T. Then, the covariance can be written as 2 x 2 covariance 
matriz, given by 


[E 


These considerations can be generalized into the n-dimensional random vector consisting of 
n random variables that are all associated with the same events. 


e Example 16.7.7 


Using MATLAB, let us create two random variables by invoking X = randn(N,1) and Y 
= randn(N,2), where N is the sample size. If N = 10, we would find that using the MATLAB 
command cov(X,Y) would yield 
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Figure 16.7.2: Scatter plot of points (X;, Y;) given by the random vector V in Example 16.7.7 when N = 
1000. 


>> ans = 


3.1325 0.9748 
0.9748 1.4862 


(If you do this experiment, you will also obtain a symmetric matrix but with different 
elements.) On the other hand, if N = 1000, we find that cov(X,Y) equals 


>> ans = 


0.9793 —0.0100 
—0.0100 0.9927 ' 


The interpretation of the covariance matrix is as follows: The variance (or spread) 
of data given by X and Y is (essentially) unity. The correlation between X and Y is 
(essentially) zero. These results are confirmed in Figure 16.7.2 where we have plotted X 
and Y as the data points (X;,Y;) when N = 1000. We can see the symmetric distribution 
of data points. 


Problems 


1. A search committee of 5 is selected from a science department that has 7 mathematics 
professors, 8 physics professors, and 5 chemistry professors. If X and Y denote the number 
of mathematics and physics professors, respectively, that are selected, calculate the joint 
probability function. 


2. In an experiment of rolling a fair die twice, let Z denote a random variable that equals 
the sum of the results. What is pz[z;|? Hint: Let X denote the result from the first toss 
and Y denote the result from the second toss. What you must find is Z = X +Y. 


3. Show that Cov(a.X + b, cY + d) = ac Cov( X,Y). 
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0.7 


0.6 


Estimated pz) 


Convolution Project 
Project: Convolution 


Consider two independent, uniformly distributed random variables (X,Y) that are summed 
to give Z = X +Y with 


om 1, 0<a<l, 
PEE deg otherwise, 
and 
EA 0cyc«1l, 
pie { 0, otherwise. 
Show that 
2 0<z<l, 
se [22s l2 
0, otherwise. 


Then confirm your results using MATLAB’s intrinsic function rand to generate (r;) and 
{y;} and computing pz(z). You may want to review Example 11.5.1. 


Further Readings 
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Hsu, H., 1997: Probability, Random Variables, € Random Processes. McGraw-Hill, 306 pp. 
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25. 


Chapter 17 


Random Processes 


In the previous chapter we introduced the concept of a random variable X. There X 
assumed various values of z according to a probability mass function px [k] or probability 
density function px (zr). In this chapter we generalize the random variable so that it is also 
a function of time t. As before, the values of z assumed by the random variable X(t) at a 
certain time is still unknown beforehand and unpredictable. 

Our random, time-varying variable X(t;£) is often used to describe a stochastic or 
random process. In that case, X(t) is the state of the process at time t. The process can 
be either discrete or continuous in t. 

A random process is not one function but a collection or family of functions, called 
sample functions, with some probability assigned to each. When we perform an experiment, 
we observe only one of these functions that is called a realization or sample path of the 
process. To observe more than a single function, we must repeat the experiment. 

'The state space of a random process is the set of all possible values that the random 
variable X (t) can assume. 

We can view random processes from many perspectives. First, it is a random function of 
time. This perspective is useful when we wish to relate an evolutionary physical phenomenon 
to its probabilistic model. Second, we can focus on its aspect as a random variable. This is 
useful in developing mathematical methods and tools to analyze random processes. 

Another method for characterizing a random process examines its behavior as t and € 
vary or are kept constant. For example, if we allow t and £ to vary, we obtain a family or 
ensemble of X(t). If we allow t to vary while £ is fixed, then X(t) is simply a function of 
time and gives a sample function or realization for this particular random process. On the 
other hand, if we fix t and allow € to vary, X(t) is a random variable equal to the state of 
the random process at time t. Finally, if we fix both t and £, then X(t) is a number. 
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1r ae 
0.5; 
-0.5 
+ uem 
0 20 40 í 60 80 100 
Figure 17.0.1: A realization of the random telegraph signal. 


e Example 17.0.1 


Consider a random process X(t) = A, where A is uniformly distributed in the interval 
(0, 1]. A plot of sample functions of X(t) (a plot of X(t) as a function of t) consists of 
horizontal straight lines that would cross the ordinate somewhere between 0 and 1. 


e Example 17.0.2 


Consider the coin tossing experiment where the outcomes are either heads H or tails 
T. We can introduce the random process defined by 


X(t; H) =sin(t), and X(t; T) = cos(t). (17.0.1) 


Note that the sample functions here are continuous functions of time. 


e Example 17.0.3: Random telegraph signal 


Consider a signal that switches between —a and +a at random times. Suppose the 
process starts (at time t = 0) in the —a state. It then remains in that state for a time 
interval Ti at which point it switches to the state X(t) = a. The process remains in that 
state until t = T5, then switches back to X(t) = —a. The switching time is given by a 
Poisson process, a random process that we discuss in Section 17.6. Figure 17.0.1 illustrates 
the random telegraph signal. 


Of all the possible random processes, a few are so useful in engineering and the physical 
sciences that they warrant special names. Some of them are: 


e Bernoulli process 


Imagine an electronics firm that produces electronic devices that either work (a success 
denoted by “S” ) or do not work (a failure or denoted “F”). We can model the production 
line as a series of independent, repeated events where p denotes the probability of producing 
a working device and q = 1 — p is the probability of producing a faulty device. Thus, the 
production line can be modeled as random process, called a Bernoulli process, which has 
discrete states and parameter space. 
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Figure 17.0.2: Three realization or sample functions of a Bernoulli random process with p = 0.4. The 
realization starts at k = 0 and continues forever. The dashed box highlights the values of the random 
variable X([21, s]. 


If we denote each discrete trial by the integer k, a Bernoulli process generates successive 
outcomes at times k — 0,1,2,.... Mathematically we can express this discrete random 
process by X[k, s] where k denotes the time and s denotes the number of the realization 
or sample function. Furthermore, this random process maps the original experimental 
sample space ((F,F,S,..), (S, F, F,...), (F, F, F,...),...} to the numerical sample space 
{(0,0,1,...),(1,0,0,...),(0,0,0,...),...}. Unlike the Bernoulli trial that we examined in 
the previous chapter, each simple event now becomes an infinite sequence of S’s and F’s. 

Figure 17.0.2 illustrates three realizations or sample functions for a Bernoulli random 
variable when p — 0.4. In each realizations s — 1,2,..., the abscissa denotes time where 
each successive trial occurs at times k = 0,1,2,.... When we fix the value of k, the quantity 
X [k, s] is à random variable with a probability mass function of a Bernoulli random variable. 


e Markov process 


Communication systems transmit either the digits 0 or 1. Each transmitted digit often 
must pass through several stages. At each stage there is a chance that the digit that enters 
one stage will be changed by the time when it leaves. 

A Markov process is a stochastic process that describes the probability that the digit 
will or will not be changed. It does this by computing the conditional distribution of any 
future state X4,4 by considering only the past states Xo, X4,..., X4, 4 and the present 
state Xn. In Section 17.4 we examine the simplest possible discrete Markov process, a 
Markov chain, when only the present and previous stages are involved. An example is the 
probabilistic description of birth and death, which is given in Section 17.5. 


e Poisson process 


The prediction of the total number of “events” that occur by time t is important to 
such diverse fields as telecommunications and banking. The most popular of these counting 
processes is the Poisson process. It occurs when: 
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1. The events occur “rarely.” 
2. The events occur in nonoverlapping intervals of time that are independent of each other. 
3. The events occur at a constant rate A. 

In Section 17.6 we explore this random process. 


e Wiener process 


A Wiener process W; is a random process that is continuous in time and possesses the 

following three properties: 

1. Wo = 0, 

2. Wi is almost surely continuous, and 

3. W; has independent increments with a distribution W; — Ws ~ N(0,t—s) for 0 € s € t. 
As a result of these properties, we have that 

1. the expectation is zero, E(W;) = 0, 

2. the variance is E(W?) — E?(W;) = t, and 

3. the covariance is cov(W;, W+) = min(s, t). 

Norbert Wiener (1894-1964) developed this process to rigorously describe the physical 
phenomena of Brownian motion - the apparent random motion of particles suspended in 
a fluid. In a Wiener process the distances traveled in Brownian motion are distributed 
according to a Gaussian distribution and the path is continuous but consists entirely of 
sharp corners. 


Project: Gambler's Ruin Problem 


Pete and John decide to play a coin-tossing game. Pete agrees to pay John 10 cents 
whenever the coin yields a “head” and John agrees to pay Pete 10 cents whenever it 
is a “tail.” Let S, denote the amount that John earns in n tosses of a coin. This 
game is a stochastic process with discrete time (number of tosses). The state space is 
(0, +10, +20, - --} cents. A realization occurs each time that they play a new game. 


Step 1: Create a MATLAB code to compute a realization of Sn. Plot several realizations 
(sample functions) of this random process. See Figure 17.0.3. 


Step 2: Suppose Pete has 10 dimes. Therefore, there is a chance he will run out of dimes at 
some n — N. Modify your MATLAB code to construct a probability density function that 
gives the probability Pete will run out of money at time n = N. See Figure 17.0.3. This is 
an example of the famous gambler’s ruin problem. 


17.1 FUNDAMENTAL CONCEPTS 


In Section 16.5 we introduced the concepts of mean (or expectation) and variance as 
it applies to discrete and continuous random variables. These parameters provide useful 
characterizations of a probability mass function or probability density function. Similar 
considerations hold in the case of random processes and we introduce them here. 


| Mean and variance 


We define the mean of the random process X(t) as the expected value of the process 
- that is, the expected value of the random variable defined by X(t) for a fixed instant of 
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Figure 17.0.3: (a) Top frame: John’s gains or losses as the result of the three different coin tossing games. 
(b) The probability density function for John’s winning 10 dimes as a function of the number of tosses that 
are necessary to win 10 dimes. 


time. Note that when we take the expectation, we hold the time as a nonrandom parameter 
and average only over the random quantities. We denote this mean of the random process 


by x(t), since, in general, it may depend on time. The definition of the mean is just the 
expectation of X (t): 


ux) = Ext) f epx(tsx) dn. (17.1.1) 


In a similar vein, we can generalize the concept of variance so that it applies to random 
processes. Here variance also becomes a time-dependent function defined by 


c2. (t) = Var[X(t)] = B{[Xx(t) = ux Or) ; (17.1.2) 


e Example 17.1.1: Random linear trajectories 
Consider the random process defined by 
X(t) — A4 Bt, (17.1.3) 
where A and B are uncorrelated random variables with means u4 and upg. Let us find the 
mean of this random process. 


From the linearity property of expectation, we have that 


ux(t) = E[X(t)) = E(A + Bt) = E(A) + E(B)t = pa + unt. (17.1.4) 
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e Example 17.1.2: Random sinusoidal signal 
A random sinusoidal signal is one governed by X(t) = Acos(wot + ©), where A and 
© are independent random variables, A has a mean p4 and variance c4, and © has the 
probability density function pe(x) that is nonzero only over the interval (0, 27). 
The mean of X(t) is given by 
px (t) = E[X(t)] = E[Acos(wot + O)|] = E[A]E|cos(wot + 9)]. (17.1.5) 


We have used the property that the expectation of two independent random variables equals 
the product of the expectation of each of the random variables. Simplifying Equation 17.1.5, 


pox (t) = uA E cos(wot + z)pe(x) dz. (17.1.6) 
A common assumption is that po(x) is uniformly distributed in the interval (0, 27), namely 
polz) = — 0 « z «2m. (17.1.7) 

Substituting Equation 17.1.7 into Equation 17.1.6, we find that 


2T 
ux(t)= taf cos(wot + x) dx = 0. (17.1.8) 
0 


e Example 17.1.3: Wiener random process or Brownian motion 
A Wiener (random) process is defined by 
t 
X(t) = i: U(E) d£, t>0, (17.1.9) 
0 


where U(t) denotes white Gaussian noise. It is often used to model Brownian motion. To 
find its mean, we have that 


E[x(t] = elf U(£) as = iB E([U (£)] dé = 0, (17.1.10) 


because the mean of white Gaussian noise equals zero. 


| Autocorrelation function 


When a random process is examined at two time instants t = tı and t = t2, we obtain 
two random variables X(t;) and X(t2). A useful relationship between these two random 
variables is found by computing their correlation as a function of time instants tı and to. 
Because it is a correlation between the values of the same process sampled at two different 
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instants of time, we shall call it the autocorrelation function of the process X(t) and denote 
it by Rx(t1,t2). It is defined in the usual way for expectations by 


Rx (ti, t2) = EX (&)X (ta)]. (17.1.11) 


Just as in the two random variables case, we can define the covariance and correlation 
coefficient, but here the name is slightly different. We define the autocovariance function 
as 


Cx (tr, ta) = E([X(&5) — ux (tX (to) — ux (12)]) (17.1.2) 
= Rx(ti,t2) — ux(ti)ux (t2). (17.1.13) 


Note that the variance of the process and its average power (the names used for the average 
of [X (t) 2 ux (t)? and [X (t)]?, respectively) can be directly obtained for the autocorrelation 
and the autocovariance functions, by simply using the same time instants for both tı and 
ta: 

E([X(0]?) = Rx(t,t), (17.1.14) 


and 
e(t) = EX[X(t) — ux (01) = Cx(t,t) = Rx (tt) — 02 (0. (17.1.15) 


Therefore, the average power, Equation 17.1.14, and the variance, Equation 17.1.15, of 
the process follows directly from the definition of the autocorrelation and autocovariance 
functions. 


e Example 17.1.4: Random linear trajectories 


Let us continue Example 17.1.1 and find the autocorrelation of a random linear trajec- 
tory given by X(t) = A+ Bt. From the definition of the autocorrelation, 


Rx (t1,t2) = E[X (ti) X(t2)| = E{[A + Bt ][A + Bt2]) (17.1.16) 
= E(4?) + E(AB)(t, + t3) + E(B’) tite (17.1.17) 
= (o + uà) + Mata (tr + t2) + (05 + wp) tite, (17.1.18) 


where o7, and o% are the variances of the random variables A and B. We can easily find 
the autocovariance by 


Cx (t1, t2) = Rx (t1, te) — ux (t1) x (te) = 03, + od tite. (17.1.19) 


e Example 17.1.5: Random sinusoidal signal 


We continue to examine the random sinusoidal signal given by X(t) = Acos(wot + O). 
The autocorrelation function is 


Rx (t4, t2) = E[X (t1)X(t2)] = E[A cos(uwot + O)A cos(wote + 0)] (17.1.20) 
E(A?)E[cos(wot2 = woty) + cos(wote + Wot, + 20)] (17.1.21) 


Nie 


= 3(04+ nà) f costul 13] +f , cos|wo(t2 + t1) + 2z]pe (x) 4s). 071.22) 
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In our derivation we used (1) the property that the expectation of A? equals the sum of 
the variance and the square of the mean, and (2) the first term involving the cosine is 
not random because it is a function of only the time instants and the frequency. From 
Equation 17.1.22 we see that autocorrelation function may depend on both time instants 
if the probability density function of the phase angle is arbitrary. On the other hand, if 
pe(r) is uniformly distributed, then the last term in Equation 17.1.22 vanishes because 
integrating the cosine function over the interval of one period is zero. In this case we can 
write the autocorrelation function as a function of only the time difference. The process 
also becomes wide-sense stationary with 


Rx(r) = E(X(t) X (t  7)] = à(03 + uå) cos(wor). (17.1.23) 


| Wide-sense stationary processes 


A process is strictly stationary if its distribution and density functions do not depend 
on the absolute values of the time instants tı and t9 but only on the difference of the time 
instants, |t1 — t|. However, this is a very rigorous condition. If we are concerned only with 
the mean and autocorrelation function, then we can soften our definition of a stationary 
process to a limited form, and we call such processes wide-sense stationary processes. A 
wide-sense stationary process has a constant mean, and its autocorrelation function depends 
only on the time difference: 


ux(t) = E[X(t)] = ux, (17.1.24) 


and 
Rx (ti, te) = E[X (t1)X (t3)] = Rx (t2 — tı). (17.1.25) 


Because time does not appear in the mean, we simply write it as a constant mean value uy. 


Similarly, because the autocorrelation function is a function only of the time difference, we 
can write it as a function of a single variable, the time difference r: 


Rx(r) = E[X(t)X(t 4 7)]. (17.1.26) 


We can obtain similar expressions for the autocovariance function, which in this case de- 
pends only on the time difference as well: 


Cx(r) = E([X(t) - nx]X(t 7) - ux] = Rx(7) - vx. (17.1.27) 
Finally, the average power and variance for a wide-sense stationary process are 
E([X()P) = Rx(0), (17.1.28) 


and 
ot, = Cx(0) = Rx(0) — pe, (17.1.29) 


respectively. Therefore, a wide-sense stationary process has a constant average power and 
constant variance. 
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Problems 


1. Find ux(t) and oł (t) for the random process given by X(t) = Acos(wt), where w is 
a constant and A is a random variable with the Gaussian (or normal) probability density 
function 


2. Consider a sine-wave random process X (t) = A cos(wt+ ©), where A and w are constants 
with A > 0. The phase function © is a random, uniform variable on the interval [—7, 7]. 
Find the mean and autocorrelation for this random function. 


3. Consider a countably infinite sequence {X,,n = 0,1,2,3,...} of a random variable 
defined by 


y= 1, for success in the nth trial, 
"^ 10, for failure in the nth trial, 


with the probabilities P(X, = 0) = 1 — p and P(X, = 1) = p. Thus, X, is a Bernoulli 
process. For this process, E( X4) = p and Var(X4) = p(1—p). Show that the autocorrelation 
is 


P, tı = t2, 
Rx (ti, t2) = 
x(t t2) Us tı Æ t2; 
and the autocovariance is 
1 — p) ty = t2 
C t t — p( , , 
ON { " 2 


Project: Computing the Autocorrelation Function 


In most instances you must compute the autocorrelation function numerically. The 
purpose of this project is to explore this computation using the random telegraph signal. 
The exact solution is given by Equation 17.2.24. You will compute the autocorrelation two 
ways: 


Step 1: Using Example 17.6.1, create MATLAB code that generates 500 realizations of the 
random telegraph signal. 


Step 2: Choosing an arbitrary time ts, compute X;(ts)Xz(ts +7) for 0 € 0 € Tmax and 
k = 1,2,3,...,500. Then find the average value of X&(ts)Xx(ts + T). Plot Rx(r) asa 
function of 7 and include the exact answer for comparison. Does it matter how many sample 
functions you use? 


Step 3: Now introduce a number of times tm = mAt, where m = 0,1,2,..., M. Using only 
a single realization k = K of your choice, compute X (mAt) x Xy (mAt 4- 7). Then find 
the average value of X y (mAt) X  (mAt 4- 7) and plot this result as a function of 7. On the 
same plot, include the exact solution. Does the value of At matter? See Figure 17.1.1 
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Figure 17.1.1: The autocorrelation function Rx(r) for the random telegraph signal as a function of T 
when A = 2. The dashed line gives the exact solution. In frame (a) Xz (tg) Xy (ts--T) has been averaged over 
500 realizations when tg = 2. In frame (b) X2oo(mAt) Xooo(mAt + T) has been averaged with M = 1200 
and At — 0.01. 


17.2 POWER SPECTRUM 


In earlier chapters we provided two alternative descriptions of signals, either in the 
time domain, which provides information on the shape of the waveform, or in the frequency 
domain, which provides information on the frequency content. Because random signals do 
not behave in any predictable fashion nor are they represented by a single function, it is 
unlikely that we can define the spectrum of a random signal by taking its Fourier transform. 
On the other hand, the autocorrelation of random signals describes in some sense whether 
the signal changes rapidly or slowly. In this section we explain and illustrate the concept 
of power spectrum of random signals. 

For a wide-sense stationary random signal X(t) with autocorrelation function Rx (7), 
the power spectrum Sx (tw) of the random signal is the Fourier transform of the autocorre- 
lation function: 


Sx(w) = A Rx (T)e *** dr. (17.2.1) 


Consequently, the autocorrelation can be obtained from inverse Fourier transform of the 
power spectrum, or 


Rx(r) = = | Sx (we? dw. (17.2.2) 


As with any Fourier transform, it enjoys certain properties. They are: 


1. The power spectrum is real and even: Sx(—w) = Sx(w) and S$ (w) = Sx(w), where 
S% (w) denotes the complex conjugate value of S (w). 


2. The power spectrum is nonnegative: Sx (uw) > 0. 


3. The average power of the random signal is equal to the integral of the power spectrum: 


E((X(0P) = Rx(0) = =f S x (w) dw. (17.2.3) 
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4. If the random signal has nonzero mean ux, then its power spectrum contains an impulse 
at zero frequency of magnitude 271%. 


5. The Fourier transform of the autocovariance function of the random process is itself 
also a power spectrum and usually does not contain an impulse component in zero 
frequency. 


Consider the following examples of the power spectrum: 
e Example 17.2.1: Random sinusoidal signal 
The sinusoidal signal is defined by 
X(t) = Acos(wot + O), (17.2.4) 


where the phase is uniformly distributed in the interval [0,27]. If the amplitude A has a 
mean of zero and a variance of c?, then the autocorrelation function is 


Rx (rT) = 1o? cos(uoT) = Rx (0) cos(uor). (17.2.5) 


The power spectrum of this signal is then 


Sx(w) = T Rx (0) cos(wor)e™™" dr = Rx (0)m [ó(w — wo) + lw + wo)] . (17.2.6) 


— oo 


Because this signal contains only one frequency wo, its power spectrum is just two impulses, 
one at wo and one at —wo. Since the negative frequency appears due only to the even 
property of the power spectrum, it is clear that all power is concentrated at the frequency 
of the sinusoidal signal. While this is a very simple example, it does illustrate that the 
power spectrum indeed represents the way the power in the random signal is distributed 
among the various frequencies. We shall see later that if we also use linear systems in 
order to amplify or attenuate certain frequencies, the results mirror what we expect in the 
deterministic case. 


e Example 17.2.2: Modulated signal 


Let us now examine a sinusoidal signal modulated by another random signal that 
contains low frequencies. This random process is described by 


Y (t) = X(t) cos(wot + O), (17.2.7) 


where the phase angle in Equation 17.2.7 is a random variable that is uniformly distributed 

in the interval [0, 27] and is independent of X(t). Then the autocorrelation function of Y (t) 

is given by 

Ry(r) = E[Y (HY (t - 7)] = E{ X(t) cos(wot + 9)X (t + 7) cos[wo(t + 7) + O]} (17.2.8) 
= E(X (t) X (t + 7)|E{cos(wot + 9) cos[wo(t + 7) + O]} = 3 Rx (7) cos(wot). (17.2.9) 


Let us take Rx(r) = Rx(0)e-?^7l, the autocorrelation function for a random telegraph 
signal (see Equation 17.2.22). In this case, 


Ry (T) = 1Rx (0)e ?^l'! cos(uot). (17.2.10) 
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Turning to the power spectrum, the definition gives 


ree "M 1 Rx (7) cos(uot)e ^7 dr (17.2.11) 
p Hir) (e07 + e tror] eo" dr (17.2.12) 
1 [Sx (w — wo) + Sx (w + wo)]. (17.2.13) 


Thus, the resulting power spectrum is shifted to the modulating frequency wo and its 
negative value, with peak values located at both w = wọ and w = —wpo. 


e Example 17.2.3: White noise 


There are instances when we want to approximate random signals where the autocor- 
relation function is very narrow and very large about 7 — 0. In those cases we construct an 
idealization of the autocorrelation function by using the impulse or delta function ó(7). 

In the present case when Rx (7) = C ó(r), the power spectrum is 


w) =| Có(r)e *** dr = C. (17.2.14) 


Thus, the power spectrum here is a flat spectrum whose value is equal to C. Because the 
power spectrum is a flat for all frequencies, it is often called “white noise” since it contains 
all frequencies with equal weight. 

An alternative derivation involves the random telegraph that we introduced in Example 
17.0.3. As the switching rate becomes large and the rate À approaches infinity, its amplitude 
increases as VÀ. Because Rx(0) increases linearly with A, the autocorrelation function 
becomes 


Rx(r) = CAexp(-2A[7 |). (17.2.15) 
'The resulting power spectrum equals 
4 2 
Bo ee i aE (17.2.16) 


X> W2 +42 a> 1 + [w/ (24)? 


The power spectrum is again flat for all frequencies. 

'The autocorrelation for white noise is an idealization because it has infinite average 
power. Obviously no real signal has infinite power since in practice the power spectrum 
decays eventually. Nevertheless, white noise is still quite useful because the decay usu- 
ally occurs at such high frequencies that we can tolerate the errors of introducing a flat 
spectrum. 


e Example 17.2.4: Random telegraph signal 


In Example 17.0.3 we introduced the random telegraph signal: X(t) equals either +h 
or —h, changing its value from one to the other in Poisson-distributed moments of time. 
The probability of n changes in a time interval T is 


(A7) e^ 


n! 


P,(n) = 


, (17.2.17) 
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where denotes the average frequency of changes. 

To compute the power spectrum we must first compute the correlation function via 
the product X(t)X(t +7). This product equals h? or —h?, depending on whether X(t) = 
X(t+7) or X(t) = —X(t + 7), respectively. These latter relationships depend on the 
number of changes during the time interval. Now, 


P[X (t) = X(t +7)] = P,(n even) = e^ V7 


n=1 


= e^ cosh(Ar), (17.2.18) 


Sy =e smh(Ar) — (17.2.19) 


— (2n + 1) 
Therefore, 
E|X (t) X (t +7)] = h? P, (n even) — h? P, (n odd) (17.2.20) 
= h?e^* [cosh(A7) — sinh(AT)] (17.2.21) 
= keal, (17.2.22) 


We have introduced the absolute value sign in Equation 17.2.24 because our derivation was 
based on t2 > tı and the absolute value sign takes care of the case tg < t1. 
Using Problem 1, we have that 


Ah? A 


we + 4)2° (17.2.23) 


Sx(w) = ue f e247 cos(Ar) dr = 
0 


Problems 


1. Show that EM 
Sx(w) — 2 | Rx (7) cos(wr) dr. 
0 


17.3 TWO-STATE MARKOV CHAINS 


A Markov chain is à probabilistic model in which the outcomes of successive trials 
depend only on its immediate predecessors. The mathematical description of a Markov 
chain involves the concepts of states and state transition. If X,, — i, then we have a process 
with state i and time n. Given a process in state 7, there is a fixed probability P;; that state 
i will transition into state j. In this section we focus on the situation of just two states. 

Imagine that you want to predict the chance of rainfall tomorrow.! From close obser- 
vation you note that the chance of rain tomorrow depends only on whether it is raining 
today and not on past weather conditions. From your observations you find that if it rains 
today, then it will rain tomorrow with probability o, and if it does not rain today, then 
the chance it will rain tomorrow is 8. Assuming that these probabilities of changes are 
stationary (unchanging), you would like to answer the following questions: 


1 See, for example, Gabriel, K. R., and J. Neumann, 1962: A Markov chain model for daily rainfall 
occurrence at Tel Aviv. Quart. J. R. Met. Soc., 88, 90-95. 
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1. Given that is raining (or not raining), what are the chances of it raining in eight days? 


2. Suppose the day is rainy (or dry), how long will the current weather remain before it 
changes for the first time? 


3. Suppose it begins to rain during the week, how long does it take before it stops? 


If the weather observation takes place at noon, we have a discrete parameter process; 
the two possible states of the process are rain and no rain. Let these be denoted by 0 for no 
rain and 1 for rain. The four possible transitions are (0 > 0), (0 > 1), (1 > 0), and (1 > 1). 
Let X, be the state of the process at the nth time point. We have X, — 0,1. Clearly, 
[X4,n = 0,1,2,...} is a two-state Markov chain. Therefore questions about precipitation 
can be answered if all the properties of the two-state Markov chains are known. Let 


PUM =P = a), dgycued mew (17.3.1) 


pue "^ denotes the probability that the state of the process at the nth time point is j given 
that it was at state 7 at the mth time point. Furthermore, if this probability is larger for 
i = j than when i Æ j, the system prefers to stay or persist in whatever state it is. When 
n — m 4 1, we have that 


POE =P Kini = 9 Bes ent). (17.3.2) 


This is known as the one-step transition probability, given that the process is at i at time 
m. 

There are two possibilities: either po mL) depends on m or Py is independent 
of m, where m is the initial value of the time parameter. Our precipitation model is an 
example of a second type of process in which the one-step transition probabilities do not 
change with time. Such processes are known as time homogeneous. Presently we shall 
restrict ourselves only to these processes. Consequently, without loss of generality we can 
use the following notation for the probabilities: 


(m,m+1) . 


Pij = P(Xm41 = g|Xm = i) for all mM, (17.3.3) 


and 


ne =P(Xmin =j|Xm =i) forall m. (17.3.4) 


| Chapman-Kolmogorov equation 


The Chapman?-Kolmogorov® equations provide a mechanism for computing the tran- 
sition probabilities after n steps. The n-step transition probabilities PE? denote the prob- 
ability that a process in state i will be in state j after n transitions, or 


PP =P  nz0, 4,920. (17.3.5) 


2 Chapman, S., 1928: On the Brownian displacements and thermal diffusion of grains suspended via 
non-uniform fluid. Proc. R. Soc. London, Ser. A, 119, 34-54. 


3 Kolmogorov, A. N., 1931: Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Math. 
Ann., 104, 415-458. 
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Therefore, Pe ^- Pjj. The Chapman-Kolmogorov equations give a method for computing 


these n-step transition probabilities via 


PONS -5P PO nmz0, (17.3.6) 


for all ¿ and 7. Here Pd represents the probability that the ith starting process will 
go to state j in n 4- m transitions via a path that takes it into state k at the nth transition. 
Equation 17.3.6 follows from 


py spa ga Se] = SP Xen j, Xn = k|Xo = d] (17.3.7) 


> 
tl 
fon) 


=y PD = j[X, = 5 Xo o4] PX, = [Xo =] -5 POPS. (PTS) 
k=0 


| Transmission probability matrix 


Returning to the task at hand, we have that 


PË = POHD = P. p= P?, (17.3.9) 


and by induction 
PO = p(n-tt1) 2 p(n-D,. p= pn. (17.3.10) 


where P(?) denotes the transition matrix after n steps. 

From our derivation, we see that: (1) The one-step transition probability matrix com- 
pletely defines the time-homogeneous two-state Markov chain. (2) All transition probability 
matrices show the important property that the elements in any of their rows add up to one. 
'This follows from the fact that the elements of a row represent the probabilities of mutually 
exclusive and exhaustive events on a sample space. 

For two-state Markov processes, this means that 


PO + PM 21, and P+ PO =1. (17.3.11) 


Furthermore, with the one-step transmission probability matrix: 
p=(*5° ron 0<a,b<1, |l—-a-b|«1l, (17.3.12) 


then the n-step transmission probability matrix is 


pin) pm b paled" a nan)" 
io PAM T x ww. ous 17.3.13 
Gi i») > D ee ale 


This follows from the Chapman-Kolmogorov equation that 


PY) 21-a, (17.3.14) 
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Table 17.3.1: The Probability of Rain on the nth Day. 


n Poo Pro Poi Pu 
1 0.7000 0.2000 0.3000 0.8000 
2 0.5500 0.3000 0.4500 0.7000 
3 0.4750 0.3500 0.5250 0.6500 
4 0.4375 0.3750 0.5625 0.6250 
5 0.4187 0.3875 0.5813 0.6125 
6 0.4094 0.3938 0.5906 0.6063 
7 0.4047 0.3969 0.5953 0.6031 
8 0.4023 0.3984 0.5977 0.6016 
9 0.4012 0.3992 0.5988 0.6008 
10 0.4006 0.3996 0.5994 0.6004 
oo 0.4000 0.4000 0.6000 0.6000 
and 
P -ü-aPCU.b.p n1 (17.3.15) 
=b+(1—a—b) Pe, (17.3.16) 


since Po =1- Pm, Solving these equations recursively for n = 1,2,3,... and simplifying, 


we obtain Equation 17.3.13 as long as both a and b do not equal zero. 
e Example 17.3.1 


Consider a precipitation model where the chance for rain depends only on whether it 
rained yesterday. If we denote the occurrence of rain by state 0 and state 1 denotes no rain, 
then observations might give you a transition probability that looks like: 


0.7 0.3 
P= [o bs (17.3.17) 


Given that the atmosphere starts today with any one of these states, the probability of 
finding that it is raining on the nth day is given by P". Table 17.3.1 illustrates the results 
as a function of n. T'hus, regardless of whether it rains today or not, in ten days the chance 
for rain is 0.4 while the chance for no rain is 0.6. 


| Limiting behavior 


As Table 17.3.1 suggests, as our Markov chain evolves, it reaches some steady-state. 
Let us explore this limit of n — oo because it often provides a simple and insightful repre- 
sentation of a Markov process. 

For large values of n it is possible to show that the limiting probability distribution of 
states is independent of the initial value. In particular, for |l — a — b| < 1, we have that 


b2 a 
lim P™ = E >) ; (17.3.18) 
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This follows from lim; ,5,(1 — a — b)” > 0 since |1 — a— b| < 1. From Equation 17.3.13 the 
second term in each of the elements of the matrix tends to zero as n — oo. 

Let us denote these limiting probabilities by m; = lim, ss P . Then, from Equation 
17.3.18, 


Too = 710 = = Tho, and Tol = 711 = 71; (17.3.19) 


a d 
a+b a+b 


and these limiting distributions are independent of the initial state. 


| Number of visits to a certain state 


When a random process visits several states we would like to know the number of visits 
to a certain state. Let ni denote the number of visits the two-state Markov chain {Xn} 


makes to state j, starting initially at state i, in n time periods. If ue denotes the expected 
number of visits that the process makes to state j in n steps after it originally started at 
state i, and the transition probability matrix P of the two-state Markov chain is 


1—a a 
P= ( b a (17.3.20) 


with |l — a — b| < 1, then 


(n) nb. + aedi Gae na el-e- mcd 
leg = a GOSS e404): na , BUR bee (17.3.21) 
a+b (a+b)? a+b ' (a+b)? 
To prove Equation 17.3.21, we introduce a random variable aie where 
(m) J 1, if X, = j and Xo = i, 
d x { 0, otherwise, ura) 


for à, j = 0,1. This random variable yj? gives the time at which the process visits state j. 
The probability distribution of Y,\") for fixed k is 


(n) 
Y; 0 1 
Probability 1- PO p 


Thus, we have that 
EYP] =PP, ij-201; k-21,2,..,n. (17.3.23) 


Because Y equals 1 whenever the process is in state j and 0 when it is not in j, the 


number of visits to j, starting originally from i, in n steps is 


(2) |y) 4 y@) (n) 
NS Y HYE + + YQ”. (17.3.24) 
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Taking the expected values and using the property that the expectation of a sum is the sum 
of expectations, 


i = E[ND| = PY + PP +--+ PP SIBI. (17.3.25) 


From Equation 17.3.13, we substitute for each p and find 


(k) Z | a(1— a — b)* 
uo = lod =) ee pec e 17.3.26) 
k=1 
2 1—a-b)* 
=J Py S est 17.3.2 
n -5 01 SIL. EE 3.27) 
(n) _ a wef è b(1-a-b)* 
Hio = > Pio = >, F "E XUL ; 17.3.28) 
and : . 
(n) . a b(1 —0— b) 
m 17.3.29 
Hii yei F l: Tb + Te ; ( ) 
finally, noting that 
27 -— (17.3.30) 
a+b a+b 
k=1 
and 
^. a(1— a — b)* a " 
2 1 b 17.3.31 
a+b 2:248 3.31) 
k=l k=1 
x m [(1—a-5)-*(1-a-5b*--*(1-a-0)"] (17.3.32) 
a(1— a — b) zl 
E 1+0 b)+- + (1-a b)" 17.3.3 
gyp (A+ (a8) +--+ (1-a b)"] (17.3.33) 
a(l — a — b) [1 — (1 — a — b)”] 
= 17.3.34 
(a+b) - (1 a - 0] ) 
Here we used the property of a geometric series that 
n-1 
1— g^" 
SEEMS (17.3.35) 
l-r 


e Example 17.3.2 


Let us continue with our precipitation model that we introduced in Example 17.3.1. If 
we wish to know the expected number of days within a week that the atmosphere will be 
in a given state, we have from Equation 17.3.21 that 


(7) 7b a(l-a-b)1-(1—-a-985)] | 
jog: Pag ad (a+b)? = 3.3953, (17.3.36) 
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(m) Tb (1—-a-b)i-(1-a-95)] | 2.4081 
= L—2. 17.3.37 
Hio = q Fb (a +b)? : ) 
mo Ta a(l-a-b)1-(1—-a-95)] | 4 
a =. 17.3.38 
Hn +b (a + 0)? SOE ) 
RS (=od 
(7) 7a 1—a 1 l-a 
a = 4. 17.3.39 
wy? =— + Cay 5969, 3.39) 


since a = 0.3 and b = 0.2. 


| Duration of stay 


In addition to computing the number of visits to a certain state, it would also be useful 
to know the fraction of the discrete time that a process stays in state j out of n when the 
process started in state i. These fractions are: 


lim —— = lim — = mo, (17.3.40) 
noo n n> Nn 
and 
mu, mu 
lig = lim == 7. (17.3.41) 
noo n n> N 


Thus, the limiting probabilities also give the fraction of time that the process spends in the 
two states in the long run. 

If the process is in state i (i = 0,1) at some time, let us compute the number of 
additional time periods it stays in state ? until it moves out of that state. We now want to 
show that this probability distribution o, i = 0, 1, is 


P(ag =n) = a(1— a)", (17.3.42) 
and 
P(oi1 =n) = b(1— b)”, (17.3.43) 
where n — 1,2,3,.... Furthermore, 
E(ag) = (1 — a)/a, E(o1) = (1 — b) /b, (17.3.44) 
and 
Var(ao) = (1 — a)/a?, Var(a1) = (1 — b)/P?, (17.3.45) 


where the transition probability matrix P of the Markov chain {Xn} equals 


1—a a 
P= ( b =) (17.3.46) 


with |1 — a — b| < 1. Clearly a or b cannot equal to zero. 

To prove this we note that at every step the process has two choices: either to stay in 
the same state or to move to the other state. Suppose the process is in state 0 at some time. 
The probability of a sequence of outcomes of the type {00---01} is required. Because of the 


TL 
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property of Markov-dependence, we therefore have the realization of a Bernoulli process with 
n consecutive outcomes of one type followed by an outcome of the other type. Therefore, 
the probability distribution of ap is geometric with (1— a) as the probability of “failure,” 
and the distribution of o4 is geometric with (1— b) as the probability of failure. Thus, from 
Equation 16.6.5, we have that 


P(aog =n) = a(1— a)", (17.3.47) 
and 

P(oi =n) = b(1— b)”, (17.3.48) 
where n = 0,1,2,.... The expression for the mathematical expectation and variance of ag 


and o, easily follow from the corresponding expressions for the geometric distribution. 
e Example 17.3.3 


Let us illustrate our expectation and variance expressions for our precipitation model. 
From Equation 17.3.44 and Equation 17.3.45, we have that 


E(og) = (1 — a)/a = 2.3333,  E(ay) = (1 — b)/b = 4, (17.3.49) 


and 
Var(ag) = (1— a)/a? = 7.7777, | Var(aı) = (1 — b)/b? = 20, (17.3.50) 


since a = 0.3 and b = 0.2. 


Problems 


P= (tn wo) 


(a) compute P” and (b) find limpo P". 


1. Given 


2. Suppose you want to model how your dog learns a new trick. Let Fido be in state 0 if 
he learns the new trick and in state 1 if he fails to learn the trick. Suppose that if he learns 
the trick, he will retain the trick. If he has yet to learn the trick, there is a probability a 
of him learning it with each training session. (a) Write down the transition matrix. (b) 
Compute PC? where n is the number of training sessions. (c) What is the steady-state 
solution? Interpret your result. (d) Compute the expected amount of time that Fido will 
spend in each state during n training sessions. 


17.4 BIRTH AND DEATH PROCESSES 


In the previous section we considered two-state Markov chains that undergo n steps. As 
the time interval between each step tends to zero, the Markov process becomes continuous 
in time. In this section and the next, we consider two independent examples of continuous 
Markov processes. 

We began Chapter 16 by showing that the deterministic description of birth and death 
is inadequate to explain the extinction of species. Here we will fill out the details of our 
analysis and extend them to population dynamics and chemical kinetics. As Section 1.2 
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showed, deterministic models lead to first-order ordinary differential equations, and this 
description fails when the system initially contains a small number of particles. 
Consider a population of organisms that multiply by the following rules: 


1. The sub-populations generated by two co-existing individuals develop completely in- 
dependently of one another; 

2. an individual existing at time t has a chance  dt+o(dt) of multiplying by binary fission 
during the following time interval of length dt; 

3. the “birth rate" A is the same for all individuals in the population at any time t; 

4. an individual existing at time t has a chance u dt + o(dt) of dying in the following time 
interval of length dt; and 

5. the “death rate" u is the same for all individuals at any time t. 


Rule 3 is usually interpreted in the sense that in each birth just one new member is added 
to the population, but of course mathematically (and because the age structure of the 
population is being ignored) it is not possible to distinguish between this and an alternative 
interpretation in which one of the parents dies when the birth occurs and is replaced by 
two children. 

Let no be the number of individuals at the initial time t = 0 and let p, (t) denote the 
probability that the population size N(t) has the value n at the time t. Then 


dpn 
Fr = (n= DS SRA + H)Pn +UI DPap n21, (17.4.1) 
and dpo(t) 
E = upi(t), (17.4.2) 
subject to the initial condition that 
j=l oo P UK 17.4.8 
mme qu n X ng. (17.4.3) 


Equation 17.4.1 through Equation 17.4.3 constitute a system of linear ordinary equa- 
tions. The question now turns on how to solve them most efficiently. To this end we 
introduce a probability-generating function: 


oo 


$(2,t) = M z"p(t). (17.4.4) 


n=0 


Summing Equation 17.4.1 from n = 1 to oo after we multiplied it by z" and using Equation 
17.4.2, we obtain 


oo „dpn oo 
b» 2 2 n—1)z"p, 1(t) — (A4- ui) > nz"p,(t)--u 3 (n+1)z”pn+1(t). (17.4.5) 
Because 

= dpn OQ 
DE ELI 17.4. 
dU? dt ot’ ( 6) 


So nz™pn(t) = 2 5) nz" pa(t) = get (17.4.7) 
n=1 n=0 
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Son — 1)z"pn_i(t) = 5 kz" pit) = 2? » kz"-!py(t) = pot 
n=1 k=0 k=0 d 
and 
oo oo a5 8 
$n t 1)2"^ Pn (t) = 5 kz^- py (t) = 2. kz"! p(t) = E 
n=0 k=1 k=0 


subject to the initial condition 


(17.4.8) 


(17.4.9) 


(17.4.10) 


(17.4.11) 


(17.4.12) 


$(z,0) = 27°. 
Equation 17.4.10 is an example of a first-order partial differential equation of the general 
form a ə 
u u 
Ox Oy 


This equation has solutions! of the form u(x, y) = f(£) where f(-) is an arbitrary function 


that is differentiable and £(z, y) = constant are solutions to 


dx dy 


P(x,y) Q(z,y) 


In the present case, 


dt dz dz dz 


1^ pene- A-D Q0-mÓm6-nuAY 


Integrating Equation 17.4.14, 


or 
Elz, t) = yle) O, 
where a 
pz) = ==. 
z—1l 


'Therefore, the general solution is 


$(5t) = [oieee]. 


(17.4.13) 


(17.4.14) 


17.4.15) 


17.4.16) 


17.4.17) 


(17.4.18) 


Our remaining task is to find f(-). From the initial condition, Equation 17.4.11, we 


have that 
$(2,0) = f[v(z)) = z^". 


(17.4.19) 


^ See Webster, A. G., 1966: Partial Differential Equations of Mathematical Physics. Dover, 446 pp. 


See Section 22. 
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Because z = [u — v(z)]/[A — v(z)], then 


f(y) = =y ; (17.4.20) 


Therefore, 


pv (e T (17.4.21) 


$0 = eec 


Once we find ó(z,t), we can compute the probabilities of each of the species from the 
probability generating function. For example, 


P{N(t) = 0|N(0) = no) = po(t) = &(0, t). (17.4.22) 


From Equation 17.4.17 we have (0) = u and 


no 
ED [1 = ér ioc) 
$(0,t) = [E ; AZ (17.4.23) 
and 2 
Se” es (17.4.24) 
UY NT NM ^ Pa zu 
An important observation from Equation 17.4.23 and Equation 17.4.24 is that 
jim po(t) = 1, à<, (17.4.25) 
—oo 
and 
lim po(t) = (5 A» (17.4.26) 
mue S 


This limit can be interpreted as the probability of extinction of the population in a finite 
time. Consequently there will be “almost certain" extinction whenever A < jq. These 
results, which are true whatever the initial number of individuals may be, show very clearly 
the inadequacy of the deterministic description of population dynamics. 

Finally, let us compute the mean and variance for the birth and death process. The 
expected number of individuals at time t is 


m(t) = ENH] = So npa(t) = So npn(t). (17.4.27) 
n=0 n=1 
Now 
d 2 dp s 
T ue n(A + p)pa + u(n + 1)Pngtl (17.4.28) 
ZARI (n — 1) pn- AS n.— Dpa-i—(A- u) 5 9 TN (n+ 1? ps1 
n=l n=1 n=1 n=1 
EC 1)Pn+1 (17.4.29) 
n=1 


=-A+p) Sopp EAM Ppm uM ke’ pet AS ipic pw S kpr (17.4.30) 
n=1 1=0 k=2 1=0 k=2 


878 Advanced Engineering Mathematics with MATLAB 


In the first three sums in Equation 17.4.30, terms from i, k, n — 2, and onward cancel and 
leave —(A + u)pi + Apy = —upi. Therefore, 


m min - DNI A -— p) mt — u)m (17.4.31) 


If we choose the initial condition m(0) — no, the solution is 
m(t) = noeh =t. (17.4.32) 


This is the same as the deterministic result, Equation 1.2.9, with the birth rate b replaced 
by A and the death rate d replaced by u. Furthermore, if A = u, the mean size of the 
population is constant. 

The second moment of N (t) is 


t) =X npt). (17.4.33) 
n=0 
Proceeding as before, we have that 
dM 2 dpn 
P M s n? [A(n — 1)pn-1 — (A+ p)n pn + u(n + 1)pn+1] (17.4.34) 


2 n= H 


+u A +1)? pn -2u aC +1)? pny + pb aC +1)pn41 (17.4.35) 


n=1 n1 n=1 
— AM Ep, +S kpr EA kp — (AFH) S npa 
k=1 k=1 k—1 n=1 
+u> Pp- 24V > Pp + nO ipi. (17.4.36) 
1—2 1—2 i=2 


The three sums, which contain either i? or k? or n? in them, cancel when i, k,n = 2 and 
onward; these three sums reduce to —upı. The sums that involve i? or k? can be written 
in terms of M(t). Finally, the sums involving i and k can be expressed in terms of m(t). 
Therefore, Equation 17.4.36 becomes the first-order ordinary differential equation 


—— —2(A — u)M = (A+ u)m(t) = (A+ unge ^t, (17.4.37) 
with M (0) = n. 
Equation 17.4.37 can be solved exactly using the technique of integrating factors from 
Section 1.5. Its solution is 


M (t) = n2g?o-9t S gc Capes = 1| l (17.4.38) 
-p 


From the definition of variance, Equation 16.6.5, the variance of the population in the birth 
and death process equals 


Var[N ()] = nog me Coes = 1| , ek (17.4.39) 
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or 
Var[N (t)] = 2Anot, À = p. (17.4.40) 


e Example 17.4.1: Chemical kinetics 


The use of Markov processes to describe birth and death has become quite popular. 
Indeed, it can be applied to any phenomena where something is being created or destroyed. 
Here we illustrate its application in chemical kinetics. 

Let the random variable X (t) be the number of A molecules in a unimolecular reaction 
A — B (such as radioactive decay) at time t. A stochastic model that describes the decrease 
of A can be constructed from the following assumptions: 


1. The probability of transition from n to n — 1 in the time interval (t, t + At) is nAAt + 
o(At) where A is a constant and o(At) denotes that o(At)/At > 0 as At > 0. 


2. The probability of a transition from n to n — j, j > 1, in the time interval (t, t + At) is 
at least o(At) because the time interval is so small that only one molecule undergoes 
a transition. 


3. The reverse reaction occurs with probability zero. 
The equation that governs the probability that X(t) — n is 
pa (t + At) = (n+ 1)AAtps qa (t) + (1 — AnAt)pn(t) + o(At). (17.4.41) 


Transposing p,,(t) from the right side, dividing by At, and taking the limit At — 0, we 
obtain the differential-difference equation? 


dpn 


P^ — (n+ Dp (€) = Apa (Ë). (17.4.42) 


Equation 17.4.42 is frequently called the stochastic master equation. The first term on the 
right side of this equation vanishes when n = ng. 

The solution of Equation 17.4.42 once again involves introducing a generating function 
for pn(t), namely 


no 
F(nt- v se Jet (17.4.43) 
n=0 


Summing Equation 17.4.42 from n = 0 to no after multiplying it by z”, we find 


no no—1 no 
P» P » (n + De pa (t) = 2 pa (t). (17.4.44) 
Because 
no 
dp, OF 
RUE ur 17.4.4 
2? dt — at’ ( 5) 
n=0 
ES i OF 
Dn" pnt = 2 - A (17.4.46) 


5 McQuarrie, D. A., 1963: Kinetics of small systems. I. J. Chem. Phys., 38, 433-436. 
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and 
no—1 no no OF 
So (n+ Dz" piat) = V hz Ip (t) = X k2**p, (t) = 35 (17.4.47) 
n=0 k=1 k=0 


Equation 17.4.44 becomes the first-order partial differential equation 


OF Xn 
i aaz 


(17.4.48) 


The solution of Equation 17.4.48 follows the method used to solve Equation 17.4.10. 
Here we find ¿(z,t) via 
dt dz 


17 Xe- (17.4.49) 

li £(z,t) = (z-De™. (17.4.50) 
Therefore, 

F(z,t) = f[(z-De~™’] . (17.4.51) 


To find f(-), we use the initial condition that F(z, 0) = 2"°. This yields f(y) = (1+y)”° 
and 
F(z,t) = [1--(z-1)e *]"^. (17.4.52) 


Once again, we can compute the mean and variance of this process. Because 


a e = 30, (17.4.53) 
the mean is given by 
Ex (o = 28 mE (17.4.54) 
To compute the variance, we first compute the second moment. Since 
T = Yon (17.4.55) 
and m 
2 (m) = 2n 7 p (17.4.56) 
we have that n 
Y n?p,(t) = PO JN oT i (17.4.57) 
n=0 


From Equation 16.6.5, the final result is 


(17.4.58) 


Var[x(t)] = 2 me ii Zaun Ba | 


Upon substituting Equation 17.4.52 into Equations 17.4.54 and 17.4.58, the mean and 
variance for this process are 


E[X(t)] = noe, and Var[X (t)] = noe * (1- e) , (17.4.59) 
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Because the expected value of the stochastic representation also equals the deterministic 
result, the two representations are “consistent in the mean." Further study shows that this 
is true only for unimolecular reactions. Upon expanding Equation 17.4.52, we find that 


Palt) = (t) ee ae (17.4.60) 


TL 


An alternative method to the generating function involves Laplace transforms. To 
illustrate this method, we again examine the reaction A — B. The stochastic master 
equation is 

dpn 
dt 
Pn(t) = 0 for 0 < n < no, where pn(t) denotes the probability that we have n particles of 
B at time t. The initial condition is that p,,(0) = 1 and p,,(0) = 0 for m z no where no 
denotes the initial number of molecules of B. 
Taking the Laplace transform of Equation 17.4.61, we find that 


= (n — 1)Apn_i(t) — nàpn (t), no € n < oo, (17.4.61) 


sP,(s) = (n — 1)AP, 1(s) — nAP,(s), no < n < oo, (17.4.62) 


and 
s Pno (8) — 1 = —nAPno(s)- (17.4.63) 


Therefore, solving for P, (s), 


|». (n—1)A ee A(n- T s E 
P,(s) = ar P,_-1(s) = D I RA), (17.4.64) 
From partial fractions, 
| (n-1) c (—1)*-"% 
Puis) = ee cy 2 E ie (1AB) 


Taking the inverse Laplace transform, 


(n — 1)! j'- E Em 
Yom 17.4.66 
Pat) no — 1)'( TET k — ng)! Yin — ky! © ) 


=e "E (1) (n — no)! oss 
se aAa (71 (n — no)! =at 17.4.67) 
no — 1)!(n — no)! pur j!(n — ng — j)! 


- {n= 1)le- net LM nno 
"mcis cx (1—e7*) 17.4.68) 


where we introduced j — k — no and eliminated the summation via the binomial theorem. 
Equation 17.4.68 is identical to results" given by Delbrück using another technique. 


6 Ishida, K., 1969: Stochastic model for autocatalytic reaction. Bull. Chem. Soc. Japan, 42, 564—565. 


T Delbrück, M., 1940: Statistical fluctuations in autocatalytic reactions. J. Chem. Phys., 8, 120-124. 
See his Equation 7. 
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e Example 17.4.2 
A 
In the chemical reaction rA 2 B, r molecules of A combine to form one molecule of B. 
H 
If X(t) = n is the number of B molecules, then the probability pn(t) = P{X(t) = n} of 


having n molecules of B is given by 


= — [ny + (N = n)A] pn + (N — n + 1)Apn-1 + (n + 1)upasaa, (17.4.69) 


where 0 < n < N, rN is the total number of molecules of A, A is the rate at which r 
molecules of A combine to produce B, and p is the rate at which B decomposes into A. 
Multiplying Equation 17.4.69 by z” and summing from n = —1 to N + 1, 


N+1 N+1 N+1 N+1 


dpn 
So Pe = NA fmt- p) Y; npe NA, rea 
n=—1 n=—1 n-——1 n=—-1 
N+1 N41 
—A »» (n—1)z"p, icu 5 (n + 1)z" Pn+1. (17.4.70) 
n=—1 n=-1 
Defining 
N+1 
Peg wes cpm (17.4.71) 
n=-1 


with p_1 = pw441 = 0, we have that 


N+1 
OF n dpn 
--1 
ər NH N l N+1 
BE = so nz" lp, = 5 (i + 1)z'piii = 5 (i + 1)2"piad, (17.4.73) 
n=—1 i--2 i=—1 
N+1 
OF 
ta = 5 nz” Pn, (17.4.74) 
e n=-1 
ər NH N42 l N41 
om = »2 nz” Tip, = 5 (i — 1)z'pi-1 = 5 (i — D)z'pia, (17.4.75) 
n=—1 i=0 i--1 
and 
N+1 N+2 N+1 
= 5 gly, = 5 z'Dii-— »2 z^pi-1. (17.4.76) 
n2-1 i=0 i--1 


'Therefore, the differential-difference equation, Equation 17.4.70, can be replaced by 


OF 
Oz 


— = NX(z-1)F + |u — (u — à)z — Az?] (17.4.77) 


Using the same technique as above, this partial differential equation can be written as 


dt dz dF 
-1 (-z(nu-Ae -NAXG-1) (17.4.78) 
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Equation 17.4.78 yields the independent solutions 


1— 
=o = €(z,t) = constant, (17.4.79) 

and 
(u +A) F(z,t) = n(z, t) = another constant, (17.4.80) 


where f(-) is an arbitrary, differentiable function. If there are m units of B at t = 0, 
0<m<N, the initial condition is F(z,0) = 2”. Then, 


l-z ym 
((; + x) BESON (17.4.81) 
or 
OS ar *Ax)tt. (17.4.82) 


After some algebra, we finally find that 


gegeben e pene 


x fu + re“ HIME p Az [1 = pu y" (17.4.83) 


F(z,t)= 


Computing the mean and variance, we obtain 


m N —m)A 
B(X) = [at we a] p Em bL wen] 17.4.84 
2 ax Pte ]* "ES | : | ( ) 
and 
m a, — 
Var(X) = oe È + pe (ran [1 =e al 
"DES lu sa dem A} [1 _ ec] (17.4.85) 
Problems 


1. During their study of growing cancerous cells (with growth rate o), Bartoszyński et 
al? developed a probabilistic model of a tumor that has not yet metastasized. In their 
mathematical derivation a predictive model gives the probability p,(t) that certain nth 
type of cells (out of N) will develop. This probability can change in two ways: (1) Each of 
the existing cells has the probability AnAt + o(At) of mutating to another type between t 
and t4- At. (2) The probability that cells in state n at time t will shed a metastasis between 
t and t + At is unce"/^ At + o(At), where ji is a constant and c is the size of a single cell. 
Setting p = Ac/N and v = uc, the governing equations for pn (t) is 


dpn 


dt B —(p 4- v)ne!/*p,, + p(n + 1e *p, ,1, n= 0,1,2: 44 — 1, (1) 


8 Bartoszyński, R., B. F. Jones, and J. P. Klein, 1985: Some stochastic models of cancer metastases. 
Commun. Statist.-Stochastic Models, 1, 317—339. 
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and 
dpv 
dt 


with the initial conditions py (0) = 1 and p,(0) 2 0 i£ n Z N. 


= -(p- v)Ne'l^py, (2) 


Step 1: Introducing the generating function 


$(z, t) = 5 z”Pn(t), O<z<l, (3) 


show that our system of linear differential-difference equations can be written as the first- 
order partial differential equation 


en 
üt 


= p - (p eye TE (4) 


with $(z,0) = zN. 
Step 2: Solve the partial differential equation in Step 1 and show that 


N 


s= (A) fi (: Exe exp[-alp +») (4^ -1)]] - (5) 


Project: Stochastic Simulation of Chemical Reactions 


Most stochastic descriptions of chemical reactions cannot be attacked analytically and 
numerical simulation is necessary. The purpose of this project is to familiarize you with 
some methods used in the stochastic simulation of chemical reactions. In particular, we will 
use the Lokta reactions given by the reaction equations: 


AO ox. (1) 
X +Y ŻY, (2) 
Yz (3) 


Surprisingly, simple numerical integration of the master equation is not fruitful. This occurs 
because of the number and nature of the independent variables; there is only one master 
equation but N reactants and time for independent variables. 

An alternative to integrating the master equation is a direct stochastic simulation. In 
this approach, the (transition) probability for each reaction is computed: p, = k,axAt, 
po = koxyAt, and p3 = k3yAt, where At is the time between each consecutive state and 
a is the constant number of molecules of A. The obvious question is: Which of these 
probabilities should we use? 

Our first attempt follows Nakanishi:? Assume that At is sufficiently small so that 
pı + po-- pa < 1. Using a normalized uniform distribution, such as MATLAB’s rand, 
compute a random variable r for each time step. Then march forward in time. At each 
time step, there are four possibilities. If 0 < r < pı, then the first reaction occurs and 


9 This is the technique used by Nakanishi, T., 1972: Stochastic analysis of an oscillating chemical 
reaction. J. Phys. Soc. Japan, 32, 1313-1322. 
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Figure 17.4.1: The temporal variation of the molecules in a Lokta reaction when At = 1075, kia = 10, 
k2 = 0.01, ka = 10, and x(0) = y(0) = 1000. 


x(t+ At) = a(t) +1, y(t + At) = y(t). If pı < r € pı + pz, then the second reaction occurs 
and z(t + At) = a(t io 1, y(t + At) = y(t) +1. If pı + pa < r < pi t pa + pa, then the third 
reaction occurs and z(t + At) = x(t), y(t + At) = y(t) — 1. Finally, if pı - po - p3 « r € 1, 
then no reaction occurs and a(t + At) = a(t), y(t + At) = y(t). 

For the first portion of this project, create MATLAB code to simulate our chemical 
reaction using this simulation technique. Explore how your results behave as you vary x(0), 
y(0) and especially At. See Figure 17.4.1. 

One of the difficulties in using Nakanishi’s method is the introduction of At. What 
value should we choose to ensure that pı + po + p3 < 1? Several years later, Gillespie! 
developed a similar algorithm. He introduced three parameters, a, = ky, ax, ag = koxy, and 
a3 = kay, along with ag = a4 + a2 + a3. These parameters a4, a2, and aa are similar to the 
probabilities p1, po, and pa. Similarly, he introduced a random number rz that is chosen 
from a normalized uniform distribution. Then, if 0 < rsao < a1, the first reaction occurs 
and «(t+ At) = x(t) +1, y(t+ At) = y(t). If a1 < reap € a1 + a, then the second reaction 
occurs and z(t + At) = z(t) — 1, y(t4- At) = y(t) - 1. If a1 +a2 < raao € ao, then the third 
reaction occurs and z(t + At) = x(t), y(t + At) = y(t) — 1. Because of his selection criteria 
for the reaction that occurs during a time step, one of the three reactions must take place. 
See Figure 17.4.2. 

'The most radical difference between the Nakanishi and Gillespie schemes involves the 
time step. It is no longer constant but varies with time and equals At = In(1/r4)/ao, where 
rı is a random variable selected from a normalized uniform distribution. The theoretical 
justification for this choice is given in Section III of his paper. 


10 Gillespie, D. T., 1976: A general method for numerically simulating the stochastic time evolution 
of coupled chemical reactions. J. Comput. Phys., 22, 403-434; Gillespie, D. T., 1977: Exact stochastic 
simulation of coupled chemical reactions. J. Phys. Chem., 81, 2340-2361 
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Figure 17.4.2: Same as Figure 17.4.1 except that Gillespie’s method has been used. 


For the second portion of this project, create MATLAB code to simulate our chemical 
reaction using Gillespie’s technique. You might like to plot x(t) vs y(t) and observe the 
patterns that you obtain. 

Finally, for a specific time, compute the probability density function that gives the 
probability that x and y molecules exist. See Figure 17.4.3. 


17.5 POISSON PROCESSES 


The Poisson random process is a counting process that counts the number of occur- 
rences of some particular event as time increases. In other words, for each value of t, there 
is a number N(t), which gives the number of events that occurred during the interval [0, t]. 
For this reason N (t) is a discrete random variable with the set of possible values (0, 1, 2,...}. 
Figure 17.5.1 illustrates a sample function. We can express this process mathematically by 


= Y^ H(t- Tin) (17.5.1) 
n=0 


where T'[n] is the time to the nth arrival, a random sequence of times. The question now 
becomes how to determine the values of Tn]. The answer involves three rather physical 
assumptions. They are: 


1. N(0) — 0. 


2. N(t) has independent and stationary increments. By stationary we mean that for any 
two equal time intervals At; and Atz, the probability of n events in At, equals the 
probability of n events in Ats. By independent we mean that for any time interval 


Random Processes 887 


1.2% 10 1.2% 10 
1 f 1 

0.8 Wt 0.8 
LL zs 
a " E 
n L 
€ E 
2o06 0.6 : 
E n h 
DB 
Im 

04 0.4 

0.2 0.2 

1000 2000 3000 % 1000 2000 3000 
number of x molecules number of y molecules 


Figure 17.4.3: The estimated probability density function for the chemical reactions given by Equations 
(1) through (3) (for X on the left, Y on the right) at time t = 10. Five thousand realizations were used in 
these computations. 


(t, t + At) the probability of n events in (t,t + At) is independent of how many events 
have occurred earlier or how they have occurred. 


3. 
1— AAt, k — 0, 
PIN (t + At) - N(t) =k] = | AA, k=l, (17.5.2) 
0, k >l, 


for all t. Here A equals the expected number of events in an interval of unit length of 

time. Because E[N(t)] = A, it is the average number of events that occur in one unit 

of time and in practice it can be measured experimentally. 

We begin our analysis of Poisson processes by finding P[N(t) = 0] for any t > 0. If 
there are no arrivals in [0, t], then there must be no arrivals in [0, t — At] and also no arrivals 
in (t — At, t]. Therefore, 

P[N(t) 20] = P[N(t — At) 20, N(t) — N(t — At) = 0]. (17.5.3) 
Because N (t) is independent, 
P[N(t) 20] = P[N(t — At) = 0]P[N(t) - N(t— At) =O]. (17.5.4) 
Furthermore, since N(t) is stationary, 
P[N(t) = 0] = P[N(t — At) = O] P[N(t + At) - N(t) 20]. (17.5.5) 


Finally, from Equation 17.5.2, 


P[N(t) = 0] = P[N(t — At) = 0](1 — AAt). (17.5.6) 
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Figure 17.5.1: Schematic of a Poisson process. 
Let us denote P[N(t) = 0] by P(t). Then, 

Po(t) = Po(t — At)(1 — AAt), (17.5.7) 
or 


Polt) - Polt — At) 
At 


Taking the limit as At — 0, we obtain the (linear) differential equation 


= —ABy(t— At). (17.5.8) 


dPo(t) 
dt 


= —APo(t). (17.5.9) 
The solution of Equation 17.5.9 is 
PojtbedXe75 (17.5.10) 


where C is an arbitrary constant. To evaluate C, we have the initial condition P5(0) = 
P[N(0) = 0] = 1 from Axion 1. Therefore, 


P[N(t) 20] = P(t) =e". (17.5.11) 


Next, let us find P; (t) = P[N(t) = 1]. We either have no arrivals in [0, £ — At] and one 
arrival in (t — At, t| or one arrival in [0, t — At] and no arrivals in (t — At, t]. These are the 
only two possibilities because there can be at most one arrival in a time interval At. The 
two events are mutually exclusive. Therefore, 


PIN) 21] = P[N(t— At) = 0, N(t) - N(t— At) = 1] 
+ PING- At) = 0, N(t) — N(t— At) = 0] (17.5.12) 
= P[N(t — At) = 0JP[N(t) - N(t— At) = 1] 
+ P[N(t — At) = 1]P[N(t) - N(t — At) =0] (17.5.13) 
= P[N(t — At) = 0|]P[N (t + At) - N(t) =1] 
+ P[N(t — At) = 1]P[N(t- At) - N(t) = 0] (17.5.14) 
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Equation 17.5.13 follows from independence while Equation 17.5.14 follows from stationar- 
ity. Introducing Pi(t) in Equation 17.5.14 and using Axion 3, 


P (t) = Po(t — At)AAt + P(t — At)(1 — AA), (17.5.15) 
ial Pt PA(C— At 
(0) = = ue AP, (t — At) + APo(t — At). (17.5.16) 
Taking the limit as At — 0, we obtain 
dP. 
i) + AB,(t) = AP (t). (17.5.17) 
In a similar manner, we can prove that 
dP, (t 
ari) + \P;,(t) = APe_1(t), (17.5.18) 


where k = 1,2,3,... and Px(t) = P[N(t) = k]. 
This set of simultaneous linear equations can be solved recursively. Its solution is 


(At) 


Pelt) = exp A) 7, 


k 20,1,2,..., (17.5.19) 


which is the Poisson probability mass function. Here A is the average number of arrivals 
per second. 

In the realization of a Poisson process, one of the important quantities is the arrival 
time, tn, shown in Figure 17.5.1. Of course, the arrival time is also a random process and 
will change with each new realization. A related quantity Z; = t; — t; 4, the time intervals 
between two successive occurrences (interoccurrence times) of Poisson events. We will now 
show that the random variables Z1, Z2, etc., are independent and identically distributed 
with 

P(E,€u)el—g^. £20, n=1,2,3,... (17.5.20) 


We begin by noting that 
P(Z, > t) = PIN (t) = 0] = e~ (17.5.21) 


from Equation 17.5.19. Therefore, Zı has an exponential distribution. 
Let us denote its probability density by pz, (zı). From the joint conditional density 


function, 
& 


P(Z3 > t) = | P(Z» > tZ = 21)DzZ, (21) dz, (17.5.22) 
0 


where 0 < & < t. If Zi = 21, then Z > t if and only if N(t + 21) — N(z1) = 0. Therefore, 
using the independence and stationary properties, 


P(Zs > tZ) = PIN + 1) - N(x) = 0]} = P[N(t) = 0] =e. (17.5.23) 


Consequently, 
P(Z d eg (17.5.24) 
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showing that Zə is also exponential. Also, Zə is independent of Z1. Now, let us introduce 
Pz,(z2) as the probability density of Z1 + Za. By similar arguments we can show that Za 
is also exponential. The final result follows by induction. 


e Example 17.5.1: Random telegraph signal 


We can use the fact that interoccurrence times are independent and identically dis- 
tributed to realize the Poisson process. An important application of this is in the generation 
of the random telegraph signal: X(t) = (—1)N(9. However, no one uses this definition to 
compute the signal; they use the arrival times to change the signal from +1 to —1 or vice 
versa. 

We begin by noting that T; = T; 4-- Zi, with i = 1,2,..., To = 0, and T; is the ith 
arrival time. Each Z; has the same exponent probability density function. From Equation 
16.4.17, 


1 1 
Z;2-l 17.5.25 
n n(; = z) ( ) 
where the U;'s are from a uniform distribution. The realization of a random telegraphic 
signal is given by the MATLAB code: 


clear 
N = 100; % number of switches in realization 
lambda = 0.15; % switching rate 
Re L J; 
^ generate N uniformly distributed random variables 
S = rand(1,N); 
^4 transform S into an exponential random variable 
T = - log(S)/lambda; 
cumsum(T); ^ compute switching times 
[0.01:0.01:100]; % create time array 
icount - 1; amplitude - -1; ^ initialize X(t) 
for k - 1:10000 
if ( t(k) >= V(icount) ) % at each switching point 
icount - icount -* 1; 


ct ox 
uo gd 


amplitude = - amplitude; % switch sign 
end 
X(k) = amplitude; % generate X(t) 


end 

plot(t,X) % plot results 
xlabel(’\it t’,’FontSize’ ,25); 
ylabel(’\it X(t)/a’,’FontSize’ ,25) ; 
axis([O max(t) -1.1 1.1]) 


This was the MATLAB code that was used to generate Figure 17.5.2. 


e Example 17.0.2 


It takes a workman an average of one hour to put a widget together. Assuming that 
the task can be modeled as a Poisson process, what is the probability that a workman can 
build 12 widgets during an eight-hour shift? 
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The probability that n widgets can be constructed by time t is 


P[N(t) =n] = ee (17.5.26) 


Therefore, the probability that 12 or more widgets can be constructed in eight hours is 


PING S 12] = e8 y = = 0.1119, (17.5.27) 
n: 


n=12 


since À = 1. 
We could have also obtained our results by creating 12 exponentially distributed time 
periods and summed them together using MATLAB: 


t uniform = rand(1,12); 
T = - log(1-t uniform); 
total time = sum(T); 


Then, by executing this code a large number N of times and counting the number icount 
of times that total time «- 8, the probability equals icount / N. 


Problems 
1. Use the generating function 
F(z, t) =X pas, den 
n=0 


with F(z,0) = 1 to solve Equation 17.5.18 by showing that F(z,t) = e*@-!). Then, by 
expanding F'(z,t), recover Equation 17.5.19. 


Project: Output from a Filter 
When the Input Is a Random Telegraphic Signal !! 


In the study of many systems, such as linear filters, the output y(-) can be written as 


y(t) — L W(t — r)a(7) dr, 


where W(-) is the weight function and x(-) is the input. The purpose of this project is 
to explore the probability density P(y) of the output when x(t) is the random telegraphic 
signal, a Poisson random process. You will filter this input two ways: (1) ideal integrator 
with finite memory: W(t) = H(t) — H(t — 71), 1 > 0, and (2) simple RC = 1 low-pass 
filter W(t) = e * H(t). 


Step 1: Use MATLAB to code x(t) where the expected time between the zeros is A. 
Step 2: Develop MATLAB code to compute y(t) for each of the weight functions W(t). 
Step 3: Compute P(y) for both filters. How do your results vary as À varies? 


11 Suggested by a paper by McFadden, J. A., 1959: The probability density of the output of a filter when 
the input is a random telegraphic signal: Differential-equation approach. IRE Trans. Circuit Theory, 6, 
228-233. 
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Figure 17.5.1: The probability density P(y) of the output from an ideal integrator with finite memory 
when the input is a random telegraphic signal when At = 0.01, A = 2, and 7, = 10. 


P(y) 


Figure 17.5.2: The probability density P(y) of the output from a simple RC = 1 filter, when the input is 
a random telegraphic signal, when A — 1, and At — 0.05. 


Further Readings 
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Chapter 18 


Itó's Stochastic Calculus 


In Chapter 1 we studied the solution to first-order ordinary differential equations 


» — a(t, x), z(0) = zo. (18.0.1) 


There we showed that Equation 18.0.1 has the solution 


t 
x(t) = x(0) «f aln, x ()] dn. (18.0.2) 
0 
Consider now the analogous stochastic differential equation: 
dX (t) = alt, X (t)] dt, X(0) = Xo. (18.0.3) 


Although Equations 18.0.1 and 18.0.3 formally appear the same, an immediate question is 
what is meant by d.X(t). In elementary calculus the concept of the infinitesimal involves 
limits, continuity, and so forth. As we shall see in Section 18.2, Brownian motion, a very 
common stochastic process, is nowhere differentiable. Here we can merely say that dX(t) = 
X (t 4- dt) — X(t). 

Consider now a modification of Equation 18.0.3 where we introduce a random forcing: 


dX(t) = alt, X(t)] dt + blt, X(£)) dB(,  X(0) = Xo. (18.0.4) 


Here dB(t) = B(t + dt) — B(t), B(t) denotes Brownian motion and aft, X (t)] and bft, X (t)] 
are deterministic functions. Consequently, changes to X (t) result from (1) the effects of the 
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initial conditions and (2) noise generated by Brownian motion (the driving force). Stochastic 
processes governed by Equation 18.0.4 are referred to as It6 processes. 

Following the methods used to derive 18.0.2 we can formally write the solution to 
Equation 18.0.4 as 


Speedo | a[n, X (5)] dn + f bin, X(n)] dB(n). (18.0.5) 


The first integral in Equation 18.0.5 is the conventional Riemann integral from elementary 
calculus and is well understood. The second integral, however, is new and must be treated 
with care. It is called /tó's stochastic integral and treated in Section 18.3. 

In summary, a simple analog to first-order ordinary differential equations for a single 
random variable X (t) raises several important questions. What is meant by the infinitismal 
and the integral in stochastic calculus? In this chapter we will focus on Itó processes and 
the associated calculus. Although Itó's calculus is an important discipline, it is not the only 
form of stochastic calculus. The interested student is referred elsewhere for further study. 


Problems 


1. The Poisson random process N (t) is defined by 
n=1 


where tn is a sequence of independent and identically distributed inter-arrival times tn. 
A graphical representation of N(t) would consist of ever-increasing steps with the edges 
located at t = tn. Use the definition of dN (t) = N(t + dt) — N(t) to show that 


1, for t — t4, 
otherwise. 
2. The telegraph signal is defined by X(t) = (—1)N(9, where N(t) is given by the Poisson 
random distribution in Problem 1. Show! that 
dX(t) = X(t + dt) - X(t) = (-1)% [De -1| 2-2x(t)aN(. 
Hint: Consider dN (t) at various times. 
3. If X(t) and Y (t) denote two stochastic processes, use the definition of the derivative to 


show that (a) d[cX (t)] = cdX(t), where c is a constant, (b) d[X (t) EY (t)] = dX (t) E aY (t), 
and (c) d XOY (t)] = X(t) aY (t) + Y (t) aX (t) + aX (t) dY (t). 


— 


18.1 RANDOM DIFFERENTIAL EQUATIONS 


A large portion of this book has been devoted to solving differential equations. Here 
we examine the response of differential equations to random forcing where the differential 


1 Taken from Janaswamy, R., 2013: On random time and on the relation between wave and telegraph 
equation. [EEE Trans. Antennas Propag., 61, 2735-2744. 
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equation describes a nonrandom process. This is an important question in the sciences and 
engineering because noise, a random phenomenon, is ubiquitous in nature. 

Because the solution to random differential equations can be found by conventional 
techniques, we can use them to study the effect of randomness on the robustness of a 
solution to a differential equation subject to small changes of the initial condition. Although 
this may be of considerable engineering interest, it is really too simple to develop a deep 
understanding of stochastic differential equations. 


e Example 18.1.1: LR circuit 


One of the simplest differential equations that we encountered in Chapter 1 involves 
the mathematical model for an LR electrical circuit: 
dI 
Lu + RI = E(t), (18.1.1) 
where I(t) denotes the current within an electrical circuit with inductance L and resistance 
R, and E(t) is the mean electromotive force. If we solve this first-order ordinary differential 
equation using an integrating factor, its solution is 


I(t) = 1(0) ew( 4 + zo +) [ro e(t) d (18.1.2) 


Clearly, if the electromotive forcing is random, so is the current. 

In the previous chapter we showed that the mean and variance were useful parameters 
in characterizing a random variable. This will also be true here. If we find the mean of the 
solution, 


E[(t)] = I(0) exp (-£) (18.1.3) 


provided E[F(t)| = 0. Thus, the mean of the current is the same as that for an ideal LR 
circuit. 
Turning to the variance, 


oit) = E[P (0) - CEU CO] (18.1.4) 


Es E['() se (-52:)] 


| 2) ew( y [ aro) dr 


| E ew( y [ [prore exp| MFT) dede (18.1.5) 
— I? (0) exp (-35) 
= ze) [ [prore exp| SH dr dr. (18.1.6) 


To proceed further we need the autocorrelation E|F(r)F(r’)|. In papers by Ornstein 
et al.? and Jones and McCombie,? they adopted a random process with the autocorrelation 


2 Ornstein, L. S., H. C. Burger, J. Taylor, and W. Clarkson, 1927: The Brownian movement of a 
galvanometer and the influence of the temperature of the outer circuit. Proc. Roy. Soc. London, Ser. A, 
115, 391-406. 


? Jones, R. V., and C. W. McCombie, 1952: Brownian fluctuations in galvanometer and galvanometer 
amplifiers. Phil. Trans. Roy. Soc. London, Ser. A, 244, 205-230. 
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function 
E|F(r)F(7)| = 2Dé(r — 7^). (18.1.7) 


'The advantage of this process is that it is mathematically the simplest because it possesses a 
white power spectrum. Unfortunately this random process can never be physically realized 
because it would possess infinite mean square power. All physically realizable processes 
involve a power spectrum that tends to zero at sufficiently high frequencies. If ®(w) denotes 
the power spectrum, this condition can be expressed as 


J $ (uw) dw < oo. (18.1.8) 
0 
In view of these considerations, let us adopt the autocorrelation 
Rx(r-7)- | (w) cos[w(r — 7’)| dw, (18.1.9) 
0 


where ®(w) is the power spectrum of F(T). Therefore, the variance becomes 


a?(t if [px ae] TES exp me) cos|w(t — 7^)] dw dr dr’. 
(18.1.10) 


Reversing the ordering of integration, 


o?(t) = — af x c) f f el- RET] cost - dr dr do. (18.1.11) 


We can evaluate the integrals involving 7 and 7’ exactly. Equation 18.1.11 then becomes 


?* w) i - 
a?(t) -f Ja RJE [1 pe 2RHUL _ Je RU E cos(wt)| dw. (18.1.12) 


Let us now consider some special cases. As t — 0, o7(t) — 0 and the variance is 
initially small. On the other hand, as t > oo, 


gri l DID dw. (18.1.13) 


Thus, the variance grows to a constant value, which we would have found by using Fourier 
transforms to solve the differential equation. 

Consider now the special case ®(w) = 2D/7, a forcing by white noise. Ignoring the 
defects in this model, we can evaluate the integrals in Equation 18.1.13 exactly and find 
that p 

eit) = => (1- e 90m), (18.1.14) 
These results are identical to those found by Uhlenbeck and Ornstein? in their study of a 
free particle in Brownian motion. 


^ Uhlenbeck, G. E., and L. S. Ornstein, 1930: On the theory of the Brownian motion. Phys. Review, 
36, 823-841. See the top of their page 828. 
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e Example 18.1.2: Damped harmonic motion 


Another classic differential equation that we can excite with a random process is the 
damped harmonic oscillator: 


y" + 2éwoy’ + wey = F(t), (18.1.15) 


where 0 < € < 1, y denotes the displacement, t is time, wł = k/m, 2wo = B/m, m is 
the mass of the oscillator, k is the linear spring constant, and 8 denotes the constant of a 
viscous damper. In Chapter 2 we showed how to solve this second-order ordinary differential 
equation. Its solution is 


NN —Ewot Ewo : y (0) —Ewt „a: i 
y(t) = y(0)e cos(w1t) + S sin(wit)| + pU sin(wit) + | h(t — T)F(7) dr, 
1 1 0 
(18.1.16) 
where w1 = woy 1 — €?, and 
e Swot 
h(t) = sin(wıt)H (t). (18.1.17) 
Wy 


Again we begin by finding the mean of Equation 18.1.16. It is 


1 0 t 

E[y(t)] = y(0)e 8" [eost + em sint) UO) oes sin(uit)4- f h(t-rT)E[F(r)] dr. 
1 1 0 

(18.1.18) 

If we again choose a random process where E[F(t)| = 0, the integral vanishes and the 


stochastic mean of the motion only depends on the initial conditions. 
Turning to the variance, 


o7(t) = Ely’ H] - (E[y(0]P? = | J h(t — r)h(t - T)E|F(T)F(T')|drdr’. (18.1.19) 
0 Jo 
If we again adopt the autocorrelation function 
Rx(r—-7)- l (w) cos[w(r — 7")] dw, (18.1.20) 


where ®(w) is the power spectrum of F(T), then 


o?(t) = T i gré tenor!) in[wj (t — 7)] sin[w (t — r’)] cos (T — r’)] dr dr’ dw 
i-f "Af sinan (t — v) sinl (t — 7^)] cos — 7*)] dr dr! dw. 


(18.1.21) 
Carrying out the integrations in 7 and 7’, we finally obtain 


c(t) = l e (: + gn + 260 sin(w t) cos(wit) 


I (7)? Wy 
2Ew 
Wy 


2 
— evost E cos(w1t) + sinant) cos(wt) — e5^ot E sin(w;it) sin(wt) 
Wy 


2,2 ,2 2 
T soia RM sin? (ot) }) dw, (18.1.22) 
wy 
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Figure 18.1.1: The mean and variance of the response for the differential equation y’ + y = f(t) when 
forced by Gaussian random noise. The parameters used are y(0) = 1 and Ar = 0.01. 


where |Q(w)|? = (uà — w?)? + 4w?wé?. 
As in the previous example, o? (t) — 0 as t — 0 and the variance is initially small. The 
steady-state variance now becomes 


To Sw) 
i= n vs dw (18.1.23) 
! o IR) 
Finally, for the special case ®(w) = 2D/r, the variance is 
D e 26wot 
o7(t) = z41 5— [wi + wowr€ sin(2u1t) + 2° wh sin?(uit)] >. (18.1.24) 
2w wi 


These results are identical to those found by Uhlenbeck and Ornstein? in their study of a 
harmonically bound particle in Brownian motion. 


Project: Low-Pass Filter with Random Input 
Consider the initial-value problem 
y ty = f(t), y(0) = vo. 
It has the solution 


y(t) = yoe * + e e' f(r) dr. 


This differential equation is identical to that governing an RC electrical circuit. This circuit 
has the property that it filters out high frequency disturbances. Here we explore the case 
when f(t) is a random process. 


5 Tbid. See their pages 834 and 835. 
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Figure 18.1.2: The probability density function for the response to the differential equation y’ + y = f(t) 
when f(t) is a Gaussian distribution. Twenty thousand realizations were used to compute the density 
function. Here the parameters used are y(0) = 0 and A7 = 0.01. 


oo 


D 
A 


Estimated PDF 
A 
m 


N 
Estimated PDF 


e 


Step 1: Using the MATLAB intrinsic function randn, generate a stationary white noise 
excitation of length N. Let deltat denote the time interval At between each new forcing so 
that n = 1 corresponds to t = 0 and n = N corresponds to the end of the record t = T. 


Step 2: Using the Gaussian random forcing that you created in Step 1, develop a MATLAB 
code to compute y(t) given y(0) and f(t). 


Step 3: Once you have confidence in your code, modify it so that you can generate many 
realizations of y(t). Save your solution as a function of t and realization. Use MATLAB’s 
intrinsic functions mean and var to compute the mean and variance as a function of time. 
Figure 18.1.1 shows the results when 2000 realizations were used. For comparison the mean 
and variance of the forcing have also been included. Ideally this mean and variance should 
be zero and one, respectively. We have also included the exact mean and variance, given 
by Equation 18.1.3 and Equation 18.1.14, when we set L = R = 1 and D = At/2. 


Step 4: Now generalize your MATLAB code so that you can compute the probability density 
function of finding y(t) lying between y and y -- dy at various times. Figure 18.1.2 illustrates 
four times when y(0) = 0 and Ar = 0.01. 


Step 5: Modify your MATLAB code so that you can compute the autocovariance. See Figure 
18.1.3. 


Project: First-Passage Problem with Random Vibrations ? 


In the design of devices, it is often important to know the chance that the device will 
exceed its design criteria. In this project you will examine how often the amplitude of a 


6 Based on a paper by Crandall, S. H., K. L. Chandiramani, and R. G. Cook, 1966: Some first-passage 
problems in random vibration. J. Appl. Mech., 33, 532-538. 
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autocovariance 


Figure 18.1.3: The autocovariance function for the differential equation y' + y = f(t) when f(t) is a 
Gaussian distribution. Twenty thousand realizations were used. The parameters used here are y(0) = 0 
and Ar = 0.01. 
simple, slightly damped harmonic oscillator 

y" t 26woy' + wey = f(t), 0<¢<l, (1) 


will exceed a certain magnitude when forced by white noise. In the physical world this 
transcending of a barrier or passage level leads to “bottoming” or “short circuiting.” 


Step 1: Using the MATLAB command randn, generate a stationary white noise excitation 
of length N. Let deltat denote the time interval At between each new forcing so that n = 
1 corresponds to t = 0 and n = N corresponds to the end of the record t = T. 


Step 2: The exact solution to Equation (1) is 


y(t) = y(O)e eot [es (virer + Tae E] 
" er BÉ sin( V/A - Cunt) (2) 
+ | E sin[V1 - Pwt- 7) pz) 
0 1-9 wo 


= y(0)e "t [e (Env + qo i 
y (0) —Cwot a: 
+ ee e sin( V1 — wot) (3) 
: sin( v 1— C wot 
Vine 


d(woT) 


e $0 


f eio cos( V/1 — 2 wor) fe d(wor) 
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Figure 18.1.4: A realization of the random function y(t) governed by Equation (1) when forced by the 
Gaussian random forcing shown in the top frame. The parameters used here are y(0) = 1, y'(0) = 0.5, 
C = 0.1, and wâr = 0.02. 


A f eso" sin( YT- C wor) Q dari. 


Because you will be computing numerous realizations of y(t) for different f(t)'s, an efficient 
method for evaluating the integrals must be employed. Equation (3) is more efficient than 
Equation (2). 

Using the Gaussian random forcing that you created in Step 1, develop a MATLAB code 
to compute y(t) given y(0), y'(0), ¢ and f(t). Figure 18.1.4 illustrates a realization where 
the trapezoidal rule was used to evaluate the integrals in Equation (3). 


e Swot 


Step 3: Now that you can compute y(t) or y (n) for a given Gaussian random forcing, 
generalize your code so that you can compute irun realizations and store them in y (n,m) 
wherem = 1:irun. For a specific n or wot, you can use MATLAB’s commands mean and var 
to compute the mean jx (t) and the variance oł (t). Figure 18.1.5 shows the results when 
1000 realizations were used. For comparison the mean and variance of the forcing have also 
been included. Ideally this mean and variance should be zero and one, respectively. The 
crosses give the exact results that 


ux (t) = y(0)e- set [erra + A so( VIB) 


QE) tune sin( V1 Punt) (4) 


Qo 1- e 


and Equation 18.1.25 when D — woAt/2. 


Step 4: Finally generalize your MATLAB code so that you store the time T(m) that the 
solution y (n) exceeds a certain amplitude b > 0 for the first time during the realization m. 
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Figure 18.1.5: The mean px (t) and variance c2. (t) of a slightly damped simple harmonic oscillator when 


forced by the Gaussian random noise. The parameters used here are y(0) = 1, y'(0) = 0.5, ¢ = 0.1, and 
woAT = 0.02. 


Of course, you can do this for several different b's during a particular realization. Once you 
have this data you can estimate the probability density function using histc. Figure 18.1.6 
illustrates four probability density functions for b = 0.4, b = 0.8, b = 1.2, and b= 1.6. 


Project: Wave Motion Generated by Random Forcing” 


In the previous projects we examined ordinary differential equations that we forced 
with a random process. Here we wish to extend our investigation to the one-dimensional 
wave equation 


Oui Oru 

—— — a = cos — X(t 

BB Da cos(uwt)ó x (t)], 
subject to the boundary conditions 

lim u(x,t) > 0, 0<t, 
|x|—-0o 
and initial conditions 
u(z,0) = uz(a,0) = 0, —oo < T < oo. 


Here w is a constant and X(t) is a stochastic process. 
In Example 15.4.4 we show that the solution to this problem is 


ulz, t) = jd H[t — 7 —|X(r) — z|] cos(wr) dr. 


" Based on a paper by Knowles, J. K., 1968: Propagation of one-dimensional waves from a source in 
random motion. J. Acoust. Soc. Am., 43, 948-957. 
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Figure 18.1.6: The probability density function that a slightly damped oscillator exceeds b at the time 
woT. Fifty thousand realizations were used to compute the density function. The parameters used here are 
y(0) = 0, y'(0) = 0, ¢ = 0.05, and woAr = 0.05. The mean value of woT is 10.7 when b = 0.4, 41.93 when 
b = 0.8, 188.19 when b = 1.2, and 1406.8 when b = 1.6. 


When the stochastic forcing is absent X(t) = 0, we can evaluate the integral and find that 


u(x,t) = >H — |z|) sin[w(t — |æ|)]. 


Step 1: Invoking the MATLAB command randn, use this Gaussian distribution to numeri- 
cally generate an excitation X (t). 


Step 2: Using the Gaussian distribution from Step 1, develop a MATLAB code to compute 


u(x,t). Figure 18.1.7 illustrates one realization where the trapezoidal rule was used to 
evaluate the integral. 


Step 3: Now that you can compute u(z, t) for a particular random forcing, generalize your 
code so that you can compute irun realizations. Then, for particular values of x and t, 
you can compute the corresponding mean and variance from the irun realizations. Figure 
18.1.8 shows the results when 10,000 realizations were used. 


Step 4: Redo your calculations but use a sine wave with random phase: X(t) = Asin(Ot--£), 


where A and 2 are constants and € is a random variable with a uniform distribution on 
[0, 27]. 


18.2: RANDOM WALK AND BROWNIAN MOTION 
In 1827 the Scottish botanist Robert Brown (1773-1858) investigated the fertilization 


process in a newly discovered species of flower. Brown observed under the microscope that 
when the pollen grains from the flower were suspended in water, they performed a “rapid 
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u(x,t) 


Figure 18.1.7: The solution (realization) of the wave equation when forced by a Gaussian distribution 
and w = 2. In frame (a), there is no stochastic forcing X(t) = 0. Frame (b) shows one realization. 
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Figure 18.1.8: The mean and variance when the wave equation is forced by the stochastic forcing 
cos(wt)ó|r — X(t)], where w = 2 and X(t) is a Gaussian distribution. 


oscillation motion.” This motion, now known as Brownian motion, results from the random 
kinetic strikes on the pollen grain by water molecules. Brownian motion is an example of 
a random process known as random walk. This process has now been discovered in such 
diverse disciplines, from biology® to finance. In this section we examine its nature. 
Consider a particle that moves along a straight line in a series of steps of equal length. 
Each step is taken, either forwards or backwards, with equal probability i. After taking 
N steps the particle could be at any one (let us denote it m) of the following points: 


8 Codling, E. A., M. J. Plank, and S. Benhamou, 2008: Random walk models in biology. J. R. Soc. 
Interface, 5, 813-834. 
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Figure 18.2.1: Three realizations of a one-dimensional random walk where N — 50. 


N,—N -1,...,—-1,0, 1,..., N — 1 and N. Here m is a random variable. 
We can generate realizations of one-dimensional Brownian motion using the MATLAB 
code: 


clear 


NN = 50; % select the number of steps for the particle to take 
t = (0:1:NN); % create ‘‘time’’ as the particle moves 


^ create an array to give the position at each time step 
m = zeros(size(t)); 
m(1) = 0; 4 initialize the position of particle 


for N = 1:NN % now move the particle 
x = rand(1); % generate a random variable lying between (0, 1] 
if (x <= 0.5) step = 1; % if less then 0.5, make it a ‘ ‘head’? 
else step = -1; end % otherwise it is a ‘‘tail’’ 

^ move the particle one step to the right or left 
m(N+1) = m(N) + step; 


end 
^ plot the results 
hold on 


plot(t,m,’--ko’,’LineWidth’ ,2,’MarkerSize’ ,8) 
xlabel(’N’,’FontSize’,25); ylabel(’m’,’FontSize’ ,25) 
grid on % add a grid to axes 


Figure 18.2.1 illustrates three such realizations. 
A natural question would now be: What are the quantitative properties of random 
walk? In particular, what is the probability P(m, N) that the particle is at point m after N 


displacements? We begin by noting the probability of any given sequence of N steps is Gy 
The desired probability P(m, N) equals (ay times the number of distinct sequences of steps 
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that will lead to the point m after N steps. To reach m, we must take (N --m)/2 steps in the 
positive direction and (N—m)/2 in the negative direction since (N --m)/2— (N —m)/2 = m. 
(Note both m and N must be even or odd.) The number of these distinct sequences is 


N! 


[E(N + m)]! [SN — m)]t (18.2.1) 
Therefore, . 
N! 1 
D [$(N + m)]| [3(N — m)]! (5) (18.2.2) 


Comparing these results with Equation 16.6.14, we see that P(m, N) is simply a binomial 
distribution. For this reason, we immediately know E(m) = 0 and Var(m) = N. That 
is, the average position is the origin and the spread of the Brownian motion occurs as the 
square root of steps taken increases. 

The case of greatest interest arises when N is large and m < N. Then we can approx- 
imate P(m, N) by the Poisson distribution, 


P(m, N) = "E (5) . (18.2.3) 


Let us reexpress Equation 18.2.3 in terms of x and t where x = mAz and t = NAt. Using 
these definitions, our equation becomes 


P(z,t) (18.2.4) 


1 ( x? ) 
= ex ; 
2/zDt ^X ADI 
if we define D = (Ax)? /(2At). The attentive student will note that P(z,t) is the Green's 

function for the heat function, Example 15.5.1. 

An alternative approach to this problem would be to compute many random walks and 
then calculate the probability density function from these computations. We can construct 
a MATLAB code to do this. First we would realize many random walks (here 2000) and 
count the number of times that they end at position m: 


clear 


NN = 100; % set the end point of the random walks 

^ introduce intermediate positions along the random walk 
t = (0:1:NN) ; 

^ initialize array 
m = zeros(size(t)); 


‘‘m?? which gives the position at any time 


for icount = 1:2000 % now perform many random walks 
m(1) = 0; % initial position of particle in each walk 


for N - 1:NN 
x = rand(1); % create a random variable between [0,1] 
^ if ’’x’? less than 0.5, we have a ’’heads’’ 
if (x <= 0.5) step = 1; 
else step = -1; end % otherwise we have a ’’tail’’ 
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Figure 18.2.2: Numerical computation of P(m,100) using 2000 random walks. The black line gives 


Equation 18.2.3. 


m(N+1) = m(N) + step; % now take a step forward or backward 
end 


% set up array that tracks of the final position of the particle 
location(icount) = m(N+1); 


end 


xx = -40:1:40; 

^ now count the particles that ended somewhere 

% between -40 and 40 

[n,xout] = hist(location,xx) 

% for comparison, compute Equation 18.2.3 

w exact = sqrt(2/(pi*NN))*exp(-xout.*xout/(2*NN)) ; 

n =n / 2000; % now compute the mass probability function 


^ plot the results 

bar.h = bar(xout,n) 

bar_child = get(bar_h, ’Children’) 

set (bar_child, ’CData’ ,n) 

colormap (Autumn) 

hold on 

plot (xout ,w_exact, ’-k’,’LineWidth’ ,3) 

xlabel(C Mit m’,’FontSize’ ,25) 
ylabel(’Estimated Mit P(m,100)’,’FontSize’ ,25) 


Figure 18.2.2 illustrates the results of simulating random walk. 


e Example 18.2.1: On the probability of striking a barrier 


An important question in engineering is what is the probability that a given random sys- 
tem will exceed its design constraints. Here we ask a similar question about one-dimensional 
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Figure 18.2.3: Several random walks from the origin to point (£, N). All of these walks would be excluded 
from our calculations because they either cross or touch the line m = € before the final step. 


Brownian motion: What is the probability that after taking N steps the particle arrives at 
€ without ever having touched or crossed the line m = € at any earlier step? We will do it 
exactly and then confirm our results using MATLAB. 

'The arrival of the particle at £ after N steps implies that its position after N — 1 steps 
must have been either £ — 1 or £ + 1. However, a trajectory from (£+ 1, N — 1) to (€, N) is 
not allowed because it must have crossed the line m = € earlier. On the other hand, not all 
trajectories arriving at (£, N) from (£ — 1, N — 1) are acceptable because a certain number 
will have touched or crossed the line m = € earlier than its last step. See Figure 18.2.3. 
Thus the number of permitted ways of arriving at € for the first time after N steps equals 
all possible ways of arriving at € minus any arrivals from (€ — 1, N — 1) and any arrivals 
that crossed or touched the line m = £ earlier than the N — 1. 

From our previous work, the number of possible ways from the origin to (£, N) is 


MI (18.2.5) 
[30N ONEN - O]U > 
The number of possible ways from the origin to (£ + 1, N — 1) is 
N — 1)! 
) (18.2.6) 


EN EN - £- 2] 
Finally, the number of trajectories arriving at (€ — 1, N — 1) but having an earlier contact 
with, or a crossing of, the line m — £ is also 
(N — 1)! 
[BN + e Qv - £- 2] 


since it equals the number of trajectories that arrive at (£ + 1, N — 1). From Figure 18.2.3 
we see that, due to symmetry, the trajectory that leads to (£ + 1, N — 1) also leads to 


(18.2.7) 
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One of the great mathematicians of the twentieth century, Norbert Wiener (1894-1964) graduated 
from high school at the age of 11 and Tufts at 14. Obtaining a doctorate in mathematical logic 
at 18, he repeatedly traveled to Europe for further education. His work extends over an extremely 
wide range from stochastic processes to harmonic analysis to cybernetics. (Photo courtesy of the 
MIT Museum with permission.) 


(£ — 1, N — 1). Consequently the number of trajectories from the origin to (£, N) that have 
never touched or crossed m — £ is 


N! j (N — 1)! (Sd 
porog- "Borxerme -e- 3r 4 
or 
€ N! 
N TOES] - 9] n 
The probability P(£, N) that we are seeking is 
£ N! iN 
PE, N) = 5 ETCETSTIER SERE (5) (18.2.10) 


For large N, P(£, N) is approximately given by 


2 
P(E,N) = SN ES ew( A) : (18.2.11) 


We can also compute this probability using the MATLAB code given above. In this 
code we replace the counting process location(icount) = m(N+1); by 


b = sort(m); 


if ( (m(NN+1) == b(NN-1)) & (b(NN+1)>b(NN)) ) 
jcount = jcount + 1; 

location(jcount) = m(NN+1); 

end 


where we initialize jcount = 0 at the beginning. The idea behind this code is as follows: 
For each of the icount trajectories, we use the MATLAB routine sort to arrange them from 
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Estimated P(6,100) 


Figure 18.2.4: The probability P(£, N) that a particle will reach the point m = € without the particle 
ever crossing or touching the line m = £ earlier than N = 100. The solid line is the theoretical probability 
given by Equation 18.2.11. Here 50, 000 random walks were taken. 


the left-most to the right-most position. To be included in the count of particles reaching 
m = € at step N, the last position of the particle must be (£, N) and it may never have 
reached or crossed m = €. The if condition ensures that both conditions are met. If 
they are, that particular walk is accepted. Once again the various right-most positions are 
binned and the probability is computed. Figure 18.2.4 illustrates this process using 5000 
random walks and this result is compared with the probability given by Equation 18.2.11. 


e Example 18.2.2: Wiener process 


Consider the time interval (0,t] and let us subdivide it into subintervals of length At 
so that there are t/At subintervals. Suppose now that a particle, initially at x = 0, takes a 
step (in one space dimension) at the times At, 2At,... and that the size of the step is either 
Ax or —Az, with a probability of 5 that the step is to the left or right. The position of 
the particle X(t) at time t is a random walk, which has executed t/At steps. Because the 
position depends on the choice of At and Az, X(t) depends upon t, At and Az. 
Mathematically we can describe this random process by 


t/ At 
A= 2, (18.2.12) 
n=1 


where the Z;'s are independent and identically distributed with 


P(Z; = At) =P(Z,=—Ax) = 5, (18.2.13) 
and n = 1,2,.... For each Z;, 
E(Z;) — 0, and . Var(Z;) = E(Z2) = (Az)?. (18.2.14) 
From Equation 18.2.12, we see that 
A 2 
E[X()] 20, and Var[X(t)] = E(Z2) = A; Var(Z:) = A (18.2.15) 
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Presently we have said nothing about the relationship between At and Az except 
that both are small. However, we cannot have just any relationship between them because 
the variance would be either zero or infinite. The only reasonable choice is Ax = VAt, 
which makes Var[X (¢)] = t for all values of At. In the limit At > 0 the random variable 
X(t) converges into a random variable, hereafter denoted by B(t), with the properties that 
E|[BD(t) = 0 and Var[B(t)] = t. The collection of random variables {B(t),t > 0} isa 
continuous process in time and called a Wiener process. 


Our previous example shows that Brownian motion and the Wiener process are very 
closely linked. Because Brownian motion occurs in so many physical and biological processes 
we shall focus on that motion (and the corresponding Wiener process) exclusively from now 
on. We define the standard Brownian motion (or Wiener process) B(t) as a stochastic 
process that has the following properties: 


1. It starts at zero: B(0) = 0. 

. Noting that B(t) — B(s) ~ N(0,t—s), E([B(t) - B(s)?) =t—s and Var([B(t) — 
B(s)?) = 2(t — s. Replacing t with t + dt and s with t, we find that 
E([dB(t)]?) = dt. 

. It has stationary and independent increments. Stationary increments means 
that B(t + h) — B(n +h) = B(t) — B(n) for all h. An independent increment 
means B(t2) — B(t1),..., B(t4) — B(t, 1) are independent random variables. 


. Because increments of Brownian motion on adjacent intervals are independent 
regardless of the length of the interval, the derivative will oscillate wildly as 
Ax — 0 and never converge. Consequently, Brownian motion is nowhere differ- 
entiable. 


. It has continuous sample paths, i.e., “no jumps.” 


. The expectation values for the moments are given by 


(2n)!t” 
n!2n ^" 


E[B^"(t)] = 


and  — E[B^"-!(5] =0, 


where n > 0. See Problem 1 at the end of Section 18.4. 


Problems 


1. Show that F{sin[aB(t)]} = 0, where a is a real. 


2. Show that " 
E(cosaB(D]) = Y; 9" ear, 
n=0 


where a is a real. 
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3. Show that E(exp[aB(t)]) = exp(a?t/2), where a is a real. 
Project: Probabilistic Solutions of Laplace's Equation 


In Section 9.7 and Section 9.8 we showed that we could solve Laplace's equation using 
finite difference or finite element methods, respectively. During the 1940s, the apparently 
unrelated fields of random processes and potential theory were shown to be in some sense 
mathematically equivalent.? As a result, it is possible to use Brownian motion to solve La- 
place's equation, as you will discover in this project. The present numerical method is useful 
for the following reasons: (1) the entire region need not be solved in order to determine 
potentials at relatively few points, (2) computation time is not lengthened by complex ge- 
ometries, and (3) a probabilistic potential theory computation is more topologically efficient 
than matrix manipulations for problems in two and three spatial dimensions. 

To understand this technique,!? consider the following potential problem: 


auta o O0<a<l1, 0<y<l, (1) 
subject to the boundary conditions 
u(z,0) = 0, u(a,1) = z, 0<a<1, (2) 
and 
u(0, y) = u(1, y) = 0, O0<y<l. (3) 


If we introduce a uniform grid with Ax = Ay = As, then the finite difference method yields 
the difference equation: 


A4u(i, j) = u(i 4- 1, j) 4- u(i — i 3) + uli, j +1) + uli, j — 1), (4) 


with 7,7 = 1, N — 1 and As = I/N. 

Consider now a random process of the Markov type in which a large number NN; of non- 
interacting particles are released at some point (x1, y1) and subsequently perform Brownian 
motion in steps of length As each unit of time. At some later time, when a few arrive at 
point (x,y), we define a probability P(i, j) of any of them reaching the boundary y = 1 
with potential uz, at any subsequent time in the future. Whenever one of these particles 
does (later) arrive on y = 1, it is counted and removed from the system. Because P(i, j) is 
defined over an infinite time interval of the diffusion process, the probability of any parti- 
cles leaving (x,y) and arriving along some other boundary (where the potential equals 0) 
at some future time is 1 — P(i, j). Whenever a particle arrives along these boundaries it is 
also removed from the square. 

Having defined P(i, 7) for an arbitrary (x,y), we now compute it in terms of the proba- 
bilities of the neighboring points. Because the process is Markovian, where a particle jumps 
from a point to a neighbor with no memory of the past, 


9 See Hersh, R., and R. J. Griego, 1969: Brownian motion and potential theory. Sci. Amer., 220, 
67-74. 


10 For the general case, see Bevensee, R. M., 1973: Probabilistic potential theory applied to electrical 
engineering problems. Proc. IEEE, 61, 423-437. 
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Figure 18.2.5: Four Brownian motions within a square domain with Az = Ay. All of the random walks 
begin at grid point i = 4 and j = 6. 


where p(t + 1,j|i,j) is the conditional probability of jumping to (a + As, y), given that 
the particle is at (x,y). Equation (5) evaluates P(i,j) as the sum of the probabilities of 
reaching y — 1 at some future time by various routes through the four neighboring points 
around (x,y). The sum of all the p’s is exactly one because a particle at (x, y) must jump 
to a neighboring point during the next time interval. 

Let us now compare Equation (4) and Equation (5). The potential u(i, 7) in Equation 
(4) and P(i, j) becomes an identity if we take the conditional probabilities as 


pli + 1, jli, j) =pli- 1, jli, j) = pli, j + 1i, j) = p, j — 15.3) = 3> (6) 


and if we also force u(i, N) = P(i, N) = i, u(i,0) = P(i,0) = 0, u(0,7) = P(0,j) = 0, 
and u(N, j) = P(N,j)=0. Both the potential u and the probability P become continuous 
functions in the space as As — 0, and both are well behaved as (x, y) approaches a boundary 
point. A particle starting along y = 1, where the potential is ug, has a probability uj of 
arriving there; a particle starting on the remaining boundaries, where the potential is zero, 
is immediately removed with no chance of arriving along y — 1. From these considerations, 
we have 


V miss Ayn 
u(i, j) = P(ij)- lim x 23 kk, (7) 
where N is the number of particles starting at (x,y) and Nz equals the number of particles 
that eventually - after infinite time - arrive along the entire boundary at potential uj. This 
sum includes the boundary y — 1 and (trivially) the remaining boundaries. 


Step 1: Develop a MATLAB code to perform two-dimensional Brownian motion. Let U 
be a uniformly distributed random variable lying between 0 and 1. You can use rand. If 
0<U< + take one step to the right; + < U < 5, take one step to the left; if i «Ux 2, 

take one step downward; and if i «U < m pale one step upward. For the na bittaty pote 
i,j located on a grid of N x N points with 2 < 7,7 < N — 1, repeatedly take a random step 
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50 realizations 200 realizations 


800 realizations 3200 realizations 


Figure 18.2.6: Solution to Equation (1) through Equation (3) using the probabilistic solution method. 


until you reach one of the boundaries. Record the value of the potential at the boundary 
point. Let us call this result u k(1). Figure 18.2.5 illustrates four of these two-dimensional 
Brownian motions. 


Step 2: Once you have confidence in your two-dimensional Brownian motion code, generalize 
it to solve Equation (1) through Equation (3) using runs realizations at some interior grid 
point. Then the solution u(i,j) is given by 


uli, j) = —— M ux(n). (8) 


Step 3: Finally, plot your results. Figure 18.2.6 illustrates the potential field for different 
values of runs. What are the possible sources of error in using this method? 


18.3: ITÓ'S STOCHASTIC INTEGRAL 


In the previous section we noted that Brownian motion (the Wiener process) is nowhere 
differentiable. An obvious question is what is meant by the integral of a stochastic variable. 


Consider the interval [a, b], which we subdivide so that a = tg < tı < t2 < +- < tn — b. 
The earliest and simplest definition of the integral is 


f(t)dt = lim M f(x) Ati, (18.3.1) 
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where tj;_1 < 7; < t; and At; = t; — t;_,. In the case of the classic integral, the integration 
is with regards to the increment dt. 

Itó's integral is an integral where the infinitesimal increment involves Brownian motion 
dB(t), which is a random variable. Before we can define this integral, we must introduce 
two important concepts. The first one is nonanticipating processes: A process F(t) is a 
nonanticipating process if F(t) is independent of any future increment B(s) — B(t) for any 
s and t where s > t. Nonanticipating processes are important because Itó's integral applies 
only to them. 

'The second important concept is convergence in the mean square sense. It is defined 
by 
2 


b 
lim E4 |$,— T F(t)dB(t) = 0, (18.3.2) 
n—oo a 
where Sn is the partial sum 
Sn = 5 F(ti-1) [B(t;) — B(ti-1)]. (18.3.3) 


i=l 


We are now ready to define the Itó integral: It is the limit of the partial sum Spn: 


ms-lim Sn = / FA dB(t), (18.3.4) 


n— Co 
where we denoted the limit in the mean square sense by ms—lim. Combining Equation 
18.3.3 and Equation 18.3.4 together, we find that 
b n 
| fit, B(t)) dB(t) = Le flti-1, B(ti-1)] [B(t:) - B(ti-1)]; (18.3.5) 
where t; = iAt and At = (b — a)/N. As one might suspect, 


f dB(t) = B(b) — B(a). (18.3.6) 


Because F(t) and dB(t) are random variables, so is Itó's integral. 

'The results from Equation 18.3.6 would be misunderstood if we think about them as 
we do in conventional calculus. We cannot evaluate the right side of Equation 18.3.6 by 
looking up B(t) in some book entitled “Tables of Brownian Motion." This equation only 
holds true for a particular realization (sample path). 


e Example 18.3.1 


Let us use the definition of the Itó integral to evaluate Itó integral ru B(z)dB(x). In 
the present case, 


S, = M B(zi 1) [B(z;) - B(zi1)], (18.3.7) 


i=l 
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where x; = it/n. Because 2a(b — a) = b? — a? — (b — a)?, 


Be : ys B?(a,) — ; T B? (z1) — Y [B(z;) — B(z; if? (18.3.8) 
= iB*(t)- 23 [B(z;) — B(rii)]. (18.3.9) 
Therefore, 

ms—lim Sp = 1 Bt) — 3 ms—lim J [B(z;) — B(z; i) (18.3.10) 
= I1B*(t) - 5 B (18.3.11) 

As a consequence, 
P Bo) dB(n) = 4.B?(t) — :, (18.3.12) 
: B(t) dB(t) = 4[B?(b) — B?(a)] ^ = (18.3.13) 

Consider now the derivative of B?(t), 

d[B?(t)] = [B(t + dt) — B(t)? = 2B(t) dB(t) + dB(t) dB(t). (18.3.14) 


In order for Equation 18.3.12 and Equation 18.3.14 to be consistent, we arrive at the very 
important result that 


[dB(t)|*? = dt (18.3.15) 


in the mean square sense. We will repeatedly use this result in the remaining portions of 
the chapter. 


Because the Itó integral is a random variable, two important quantities are its mean 
and variance. Let us turn first to the computation of the expectation of Je f |t, B(t)| aB(t). 
From Equation 18.3.5 we find that 


Bf [ rien uno) = dE: fiis B6 olan. (18.3.16) 


Therefore 


ef [ ren aso) =0. (18.3.18) 
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To compute the variance, we begin by noting that 


-1 B(ti-i))( AB)? (18.3.20) 


\| 
der 
= 4B 
Hsc 

e 


p i-1; B(ti-1)] AB: f[tj 1, B(tj 1)])AB;. 


t=1 j—1 
izj 


Taking the expectation of both sides of Equation 18.3.20, we have that 


b 2 l n " E 
EJ Lf. ri popa) | = m D PUPH- Bc ODE BS (18.3.21) 
+2 dim 3 > E{f(ti-1, B(ti-1) |} E[ABi] E{ Ff (t;-1; B(tj-1)) E[AB;] 
“13 
= dim, 5, E(f^[t; 1, B(t; -1)] (tiga — ti). (18.3.22) 


The double summation vanishes because of the independence of Brownian motion. There- 
fore, the final result is 


sf fit, B(t)] 4AB(t e] = [eri Be )}} dt. (18.3.23) 


e Example 18.3.2 


Consider the random number X = f? vt sin|B(t)] dB(t). Let us find E(X) and E(X?). 
From Equation 18.3.18, we have that E(X) — 0. For that reason, var(X) — E(X?) 
and 


Var(X) = E(X?) -f Ef |V sin[B( t)]| *\ at = [ents wey} a (18.3.24) 


= [aimi à - cosa) a= f : | »» Rud dt — (18.3.25) 


EE 2n! 
n=0 
jv ue equam [uon 
EI MOD ent] ge = prt n+2) 18.3.26 
2. n! 2 2. (n + 2)n! ( a) (1505:20) 


The value of E(cos[2B(t)]) follows from Problem 2 at the end of the last section. 
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Table 18.3.1 gives a list of Itó stochastic integrals. Most of these results were not 
derived from the definition of the Itó stochastic integral but from Itó lemma, to which we 
now turn. 


Problems 


Consider the random variable X — f? fit, B(t) dB. Find E(X) and Var(X) for the follow- 
ing f[t, B(t)]: 


1. flt, B(t)] =t 2. flt, B(t)] = tB(t) 
3. flt, B(t)] = |B(t) 4. f[f, B(t)] = vt exp[B(t)] 


5. If X = f? f(t){sin[B(t)] + cos[B(t)]) dB(t), show that var(X) = f? f?(t) dt, if f(t) is a 
real function. 


Project: Numerical Integration of Itó's Integral 


Equation 18.3.5 is useful for numerically integrating the Itó integral 


n fle, B(x)]| dB(x). 


Write a MATLAB script to check Example 18.3.1 for various values of n when t — 1. How 
does the error vary with n? 


Project: Numerical Check of Equations 18.3.18 and 18.3.23 

Using the script from the previous project, develop MATLAB code to compute Equation 
18.3.18 and Equation 18.3.23. Using a million realizations (sample paths), compare your 
numerical results with the exact answer when a = 1, b = 1, and /[t, B(t)] = vt sin|B(t)]. 
18.4: ITÓ'S LEMMA 

Before we can solve stochastic differential equations, we must derive a key result in 
stochastic calculus: It6’s formula or lemma. This is stochastic calculus's version of the 
chain rule. 

Consider a function f(t) that is twice differentiable. Using Taylor's expansion, 


df(B) = f(B + dB) - f(B) = f'(B)dB + f" (B) (AB? +-->, (18.4.1) 


where B(t) denotes Brownian motion. Integrating Equation 18.4.1 from s to t, we find that 


J «m - fiei- rb - f ranapei[ rase. (18.4.2) 


because [dB(a)|? = dx. The first integral on the right side of Equation 18.4.2 is an Itó's 
stochastic integral while the second one can be interpreted as the Riemann integral of f" (B). 
Therefore, Itó's lemma or formula is 


fIB@)] - f[B(] = J f'(B)dB +2 | J"(B) de (18.4.3) 
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Table 18.3.1: A Table of Itó Stochastic Integrals with t > 0 and b» a0 


1. 
[as = B(b) — B(a) 
2: i 
f B(n) dB @)-t] 
3. , 
f [B? (y) — n] dB(n) = XB* (t) — tB(¢) 
4. 
[nae = O- f Ban 
5. , ; 
J aB - 45 - f Bon 
0 0 
6. : 
I e"? cos[AB(n)] dB(n) = xd sin[AB(t)] 
0 
7. : 
en? in = E — eu cos 
[ "ssa = x {i NO, 
8. : 
ri exp[-39n + AB(j)] dB() = +5 (exp[-3996 + AB()] - 1) 
P i B?(n)] dB(n) B?(b) ) 
fro t ep e 
10. : " i 
J taB - rome - | FB an 
11. i: 
1 
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for t > s. 
e Example 18.4.1 


Consider the case when f(t) = t? and s = 0. Then, Itó's formula yields 


ze B =2 f B(x) apo) - f dz. (18.4.4) 


Evaluating the second integral and noting that B(0) — 0, we again obtain Equation 18.3.12, 
that 


[x ) dB(x) = 4[B2(t) — t]. (18.4.5) 


e Example 18.4.2 


Consider the case when f(t) = e*’ and s = 0. Then, Itó's formula yields 
t a rt 
pret) 1 = af e*P9 dB(a) + a et Bl) ds. (18.4.6) 
0 0 
Computing the expectation of both sides, 
Ble «50 = :- 2 fe] iul (18.4.7) 


Solving this integral equation, we find that E[e^P(0] = e«*!/2 


in Problem 2, Section 18.2. 


, a result that we found earlier 


e Example 18.4.3 
If f(t) = sin(At), A > 0, then Itó's formula gives 
t t 
sin[AB(t)] = af cos[AB(n)] dB(n) — bof sin[AB (1)] dn. (18.4.8) 
0 0 
Taking the expectation of both sides of Equation 18.4.8, we find that 
t 
E(sin[AB(t)]) = - f Esin[AB (0)]) dq. (18.4.9) 
0 


Setting g(t) = E(sin|AB(t)]), then 


a(t) = -3x | g(n) dn. (18.4.10) 


The solution to this integral equation is g(t) = 0. Therefore, E'{sin[AB(t)]} = 0. 
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Educated at the Imperial University of Tokyo, Kiyoshi Itó (1915-2008) applied the techniques of 
differential and integral to stochastic processes. Much of Itó's original work from 1938 to 1945 was 
done while he worked for the Japanese National Statistical Bureau. After receiving his doctorate, 
Itó became a professor at the University of Kyoto from 1952 to 1979. (Author: Konrad Jacobs, 
Source: Archives of the Mathematisches Forschungsinstitut Oberwolfach.) 


'The second version of Itó's lemma begins with the second-order Taylor expansion of 
the function f(t, x): 


flt + dt, B(t + dt)] — fit, B(t)] = felt, B(Q)] dt + felt, B(t)] aB(t) 
+ ${fult, B(t)] (dt)? + fat, B(t)) dtdB(t) — (18.4.11) 
+ fss[t. B(t)] [AB(t)? ) +. 


Here we assume that f|t, B(t)] has continuous partial derivatives of at least second order. 
Neglecting higher-order terms in Equation 18.4.11, which include the terms with factors 
such as (dt)? and dtdB(t) but not [dB(t)|? because [dB(t)]? = dt, our second version of 
Itó's lemma is 


flt, B(t)] — f[s, B(s)] =| { filn, BM] + 3 fez, B(n)]} a+ f faln, B(n)] dB(n) 
i (18.4.12) 
ift >s. 
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e Example 18.4.4 
Consider the function f(t, £) = e*-'/?. Then, 
filt, £) = —4e®t/?, Lease, and uhr oe: v. (18.4.13) 


Therefore, from Itô’s lemma, we have that 


t 
ents VE) eaae if e n/2eB( dB(n). (18.4.14) 


S 


e Example 18.4.5: Integration by parts 
Consider the case when F(t, x) = f(t)g(x). The Itó formula gives 
d(f(t)g(x)) = Cf'(09[B(O] + SOBO) dt + FO [BE] 4B(0. (18.4.15) 


Integrating both sides of Equation 18.4.15, we find that 


b 
] (ose dB(t) — f(t) | - [row Bat- 3 f Os "[B(t)] dt, (18.4.16) 


which is the stochastic version of integration by parts. 
For example, let us choose f(t) = e** and g(x) = sin(x). Equation 18.4.16 yields 


t 


i e cos[B(n)] dB(n) = e*" sin[B(n)]| — à f e% sin[B(n)] dn + à | e% sin[B(n)] dn 


0 
(18.4.17) 
t 
= e™ sin[B(t)] — (a — 1) | e*" sin[B()] dn. (18.4.18) 
0 
In the special case of a = i, Equation 18.4.18 simplifies to 
t 
n e^" cos[B(n)] dB(n) = e'? sin[B(t)]. (18.4.19) 
0 


An important extension of Itó's lemma involves the function f[t, X(t)] where X(t) is no 
longer simply Brownian motion but is given by the first-order stochastic differential equation 


dX (t) = cX(t) dt + o X (t) dB(t), 18.4.20) 
where c and c are real. The second-order Taylor expansion of the function f [t, X (t)] becomes 


f [t + dt, X (t + dt)] — flt, XH] = filt, XH] dt + felt, X (t)] AX (t) 18.4.21) 
+2{ fault, X (0) (dt)? + for[t, X (0] dt aX (t) + feelt, X (t)] [EX (0) + 
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Next, we substitute for dX (t) using Equation 18.4.20, neglect terms involving (dt)? and 
dt dB(t), and substitute [dB(t)]? = dt. Consequently, 


df = flt + dt, X (t + dt)] — flt, X (t)] (18.4.22) 
= oX (t) falt, X (t) dB(t) + fs [t, X (£)] + eX (t) falt, X (£)] + 1o? X? (t) f... lt, xol} dt. 
(18.4.23) 


The present extension of Itó's lemma reads 

fi X €) = fis Xx] = | [utm X Xn fe X0] 30X0) feeb XI | an 
«f " oX(n) feln, X(n)] dBQ) (18.4.24) 
= f S fan XODI + texto xe s 
+ f tel. xoaxo, (18.4.25) 


where 
dX (n) = cX (n) dn + oX (n) dB (n) (18.4.26) 


and t > s. 

We can finally generalize Itó's formula to the case of several Itó processes with respect 
to the same Brownian motion. For example, let X(t) and Y (t) denote two Itó processes 
governed by 

dX(t) = AW) (t) dt + AC? (t) aB(t), (18.4.27) 


and 


dY (t) = AC? (t) dt + AP?) (t) dB(t). (18.4.28) 
For stochastic process f[t, X(t), Y (t)], the Taylor expansion is 
df[t, X (t),Y (0) = fil XO, Y (0] dt + felt, X (0. Y (0] aX E) 
+ fylt, X(t), Y (0] aY (t) (18.4.29) 
+ ACD (JAOD (t) dt + $ faylt, X(t), Y(QIAC? HAP (t) dt 
AP?) (1) AC-D (0) dt + 3 fuylt, X (t), Y OAC? (HAC (t) dt. 


LASIEXQuYG 
1fys]t, X(t), Y (t 


)] 
)] 


e Example 18.4.6: Product rule 


Consider the special case f(t, x,y) = zy. Then f; = 0, fe =y, fy =£, fos = fyy = 0, 
and fry = fys = 1. In this case, Equation 18.4.29 simplifies to 


d[X (£)Y (t)] = Y (t) dX (t) + X (t) dY (t) + AC? (t, X(t), Y OJA? [t, X(t), Y (2)] dt. 
(18.4.30) 
A very important case occurs when A-D ft, X (t), Y (£)| = 0 and X(t) = g(t) is purely 
deterministic. In this case, 


d[g(t)Y (t)] = Y (t) dg(t) + g(t) dY (t). (18.4.31) 
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This is exactly the product rule from calculus. 
Problems 


1. (a) Use Equation 18.4.3 and f(t) = t" to show that 


Bryan [ mo) ) dB(a 202 f gna: 


(b) Show that 
E[B"(t)] = DE | E[B"-2(2x)] dx 


(c) Because E[B(t)] = 0 and E[B?(t)| = t, show that 


QE, 


E[B?"()] 20, and | E[B?*()]- SEE 


2. Let f(t, x) = z?/3 — ta and use Itó's formula to show that 


J (B?(n) - n] dB(n) = 4B*(t) - t B0. 


3. If f(x) is any continuously differentiable function, use Equation 18.4.29 to show that 


| f(n) dB(n) = f()B() — n FB) dn 
0 0 


4. If f(t) =e", use the previous problem to show that 


[ e" dB(n) = e'B(t) — J e" B(y) dn. 


5. Let G(x) denote the antiderivative of g(x). Use Equation 18.4.3 to show that 


b 


b 
J J[B(©)] 4B(t) = GIB) 


6. (a) If g(x) = xe”, use Problem 5 to show that 


n B(n)e?™ dB(n) = [B(t) — 1]eP0 +1 — 3 fino + 1]e82 dn. 
x 0 
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(b) Use Equation 18.3.18 to show that 
t 
E|B()e? 9] = E [eO] -1+ 2! E[B(g)e29?] + Efe} an 
peer pne nga ee] 


=e? 1+ a [e + E|B(y)e?9»|Y dn. 
0 


(c) Setting g(t) = E[B(t)eP(?], use Laplace transforms to show that 


E [Be] = tet/?, 


7. (a) If g(x) = 1/(1 + z?), use Problem 5 to show that 


* dB() un * Bn) 
] gT tente | Ip 


(b) Use Equation 18.3.18 to show that 


"ue BD «d ase peat 
[fec gn] n Bren. 


(c) Because 


F = 
V --——— om BL Geren Sp 
16 ^ (132)? ^ 16 16 o [L+ B2(g)f? 16 
show that 
E < E [axctan|B(t)]) < a; 


8. If g(x) = x/(1 +27), use Problem 5 to show that 


Bln) ate if 2B» 
J, igi 900 = Lost 801-1 | igi o 


9. Use integration by parts with f(t) = e?* and g(x) = — cos(x) to show that 
t t 
| e?" sin[B(n)] dB(n) = 1 — e”* cos[B(t)] + (8 — 4) jJ e?" cos[B (n)] dn. 
0 0 
Then, take 6 = i and show that 


T e"/? sin[B()] dB(n) = 1 — e'? cos[B(t)]. 
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10. Redo Example 18.4.3 and show that E(cos[AB(t)]) = e-^ /2, A » 0. 


11. Use trigonometric double angle formulas to show that 


(a) E(sin[t + AB(t)]) = e^"? sin(t), 
and 

(b) E{cos|t + AB(t)]) = e^?" !? cos(t), 
when \ > 0. 


12. Following Example 18.4.4 with f(t, £) = + exp(+Ax — A?t/2), A > 0, show that 


J T [taso = 22 dB(n) = +5 {exp E = 2d = i} 


t 2 1 2t 
1 ew (^!) cos[AB(n)| dB(n) = = ex (7) sin[AB(t)]. 
^ 2 d 2 
14. Following Example 18.4.4 with f(t, x) = — exp(A?t/2) cos(Az), A > 0, show that 


[ Bis (2 sin[\B(n)] dB(n) = x E " ex (57*) spunto] 


15. Following Example 18.4.4 with f(t, £) = t- !/? exp[x?/(2t)], show that 


[ ses [ 20] 480. ias [9980]. ia MO] 


2t t3/2 2a 


16. The average of geometric Brownian motion on [0, t] is defined by 


T 
G(t) = Wu eP) dn. 


Use the product rule to find dG(t). Hint: Take the time derivative of tG(t) = Ih e80) dn. 
18.5: STOCHASTIC DIFFERENTIAL EQUATIONS 


We have reached the point where we can examine stochastic differential equations. Of 
all the possible stochastic differential equations, we will focus on Langevin's equation!! - a 


11 Langevin, P., 1908: Sur la théorie du mouvement brownien. C. R. Acad. Sci. Paris, 146, 530—530. 
English translation: Langevin, P., 1997: On the theory of Brownian motion. Am. J. Phys., 65, 1079-1081. 
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model of the velocity of Brownian particles. We will employ this model in a manner similar 
to that played by simple harmonic motion in the study of ordinary differential equations. 
It illustrates many of the aspects of stochastic differential equations without being overly 
complicated. 


e Example 18.5.1 


Before we consider the general stochastic differential equation, consider the following 
cases where we can make clever use of the product rule. For example, let us solve 


dX(t) = [t + B?(t)| dt + 2tB(t) dB(t), X(0) = Xo. (18.5.1) 
In the present case, we can find the solution by noting that 
dX(t) = B?(t) dt + t[2B(t) dB(t) + dt] = B?(t) dt +td[B?(t)] = q(.B?(t. — (18.5.2) 
Integrating both sides of Equation 18.5.2, we find that the solution to Equation 18.5.1 is 
X(t) = tB? (t) + Xo. (18.5.3) 
Similarly, let us solve the stochastic differential equation 


b — X(t) 


dX(t) = 


dt+dB(t), O<#t<]1, (18.5.4) 


with X(0) = Xo. 
We begin by writing Equation 18.5.4 as 


db- X(t)] b-— X(t) dB(t) 
dt = 18.5. 
Der Pape TE USD) 
Running the product rule backwards, 
b- X(t dB(t 
a| ( J = W (18.5.6) 
1-t 1-t 
Integrating both sides of Equation 18.5.6 from 0 to t, we find that 
b- X(t ‘dB 
(0 _4_ x(0) J (7) (18.5.7) 
1-t o l-7 
Solving for X (t), we obtain the final result that 
‘dB 
X(t) =b-[b-x@]Q-H)+(0 of l 2. (18.5.8) 
d: b= 


In the present case we cannot simplify the integral in Equation 18.5.8 and must apply 
numerical quadrature if we wish to have numerical values. 


In the introduction we showed that the solution to Langevin’s equation: 


dX(t) =cX(t)dt+odB(t),  X(0) = Xo, (18.5.9) 
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t t 
X(t) = Xo + cf X (n) dn + o f dB(n). (18.5.10) 
0 0 
An obvious difficulty in understanding this solution is the presence of X(s) in the first 
integral on the right side of Equation 18.5.21. 


Let us approach its solution by considering the function f(t,x) = e xz. Then, by Itó's 
lemma, Equation 18.4.16, 


flt, X (t)] — X(0) =| Un. X(n)] + eX (n) feln, XM] + 307 feel, X(m)]} an 


+ | o faln, X(n)] dB), (18.5.11) 
0 


because f[0, X (0)] = X (0). Direct substitution of f(t, x) into Equation 18.5.11 yields 
t 
e *X(t) - Xo = o f e ^ dB(n). (18.5.12) 
0 
Finally, solving for X (t), we obtain 
t 
X(t) = X(0)e + set f e "dB(n), (18.5.13) 
0 


an explicit expression for X(t). For the special case when Xo is constant, X(t) is known as 
an Ornstein- Uhlenbeck process.” 
An alternative derivation begins by multiplying Equation 18.5.9 by the integrating 
factor e~“ so that the equation now reads 
e * dX(t) — ce * X (t) dt = ce *' dB(t). (18.5.14) 
Running the product rule, Equation 18.4.23, backwards, we have that 
dle *X(t)| = ce^*' dB(t). (18.5.15) 
Integrating both sides of Equation 18.5.15, we obtain Equation 18.5.12. 


e Example 18.5.2: Exact stochastic differential equation 


Consider the stochastic differential equation 
t t 
X(t) = X(0) + al, X(s)ds + o f X(s)dB(s), (18.5.16) 
0 0 


with c,o > 0. 
If X(t) = f[t, B(t)], then by Itó's lemma, Equation 18.4.9, 


xi - xo« f { fils, B(s)] + $fea[s, B(s)]} as+ f fals, B(s)|dB(s). (18.5.17) 


1? Uhlenbeck and Ornstein, op. cit. 
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Figure 18.5.1: Ten realizations (sample paths) of geometric Brownian motion when c = 0.1, ø = 0.5, and 
X(0) = 1. The heavy line is the mean of X (t). 


Comparing Equation 18.5.16 and Equation 18.5.17, we find that 


oft a) = filt, £) futu, (18.5.18) 
and 
of(t,x) = fa(t, £). (18.5.19) 
From Equation 18.5.19, 
falt, £) =9 s es fua (18.5.20) 


Therefore, Equation 18.5.18 can be replaced by 
(c- 30°) ft, 2) = f(t, x). (18.5.21) 


Equation 18.5.19 and Equation 18.5.21 can be solved using separation of variables, which 
yields 
f(t,z) = f(0,0)exp|(c— $07) t+ ex], (18.5.22) 


X(t) = flt, B(t)) = X (0) exp| (c — 40°) t + oB(t)]. (18.5.23) 


Thus, a stochastic differential equation can sometimes be solved as the solution of a deter- 
ministic partial differential equation. In the present case, this solution is called geometric 
Brownian motion. For its solution numerically, see Example 18.6.1. See Figure 18.5.1. 


e Example 18.5.3: Homogeneous linear equation 
Consider the homogeneous linear stochastic differential equation 
dX(t) = e(t) X (t) dt + o1(t) X (t) aB(t). (18.5.24) 
Let us introduce f(t,x) = ln(x). Then by Itó's lemma, Equation 18.4.21, 


df = d{ln(X)] = [e1(t) — 401 (t)] dt + e(t) dB(t), (18.5.25) 
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because f; = 0, f, = 1/z and fry = —1/z?. Integrating both sides of Equation 18.5.25 and 
exponentiating the resulting expression, we obtain 


X() - x()exo| [ (een zeto] tv f ni) BG). (18.5.26) 


e Example 18.5.4: General case 
Consider the homogeneous linear stochastic differential equation 
dX(t) = [ex(t).X (t) + co(t)] dt + [o1 (t) X (t) + o2(t)] dB(t). (18.5.27) 
Our analysis begins by considering the homogeneous linear stochastic differential equa- 


tion 


dY(t) = e (HY t) dt oi HY t) dBA, X Y(0) — 1. (18.5.28) 


From the previous example, 


Y(t) = n [ei (0) — $o1(0)] dn + [aw aB(n)} (18.5.29) 


Next, let us introduce two random variables, X; = 1/Y and Xə = X. Using Itó lemma 
f (t, x) = 1/z, then 


dX; — df(t, Y) «d (=) = [of (t) ^ ci(t)] z - oie (18.5.30) 
= [of (t) — c1 (t)] Xı (t) dt — o1 (t)Xı (t) dB(t), (18.5.31) 


since f; = 0, fe = —1/z? and fre = 2/2?. 
Using Equation 18.4.30, where X, is governed by Equation 18.5.31 and Xə is governed 
by 18.5.27 because X5 = X, 
d( Xi X2) = [co(t) ver (t)o»(t)] Xi (t) dt + e»(t) X1 (t) dB(t). (18.5.32) 


Upon integrating both sides of Equation 18.5.32, we have 


X= X3) — f fal - mes wat fam pe. (18.5.33) 
Consequently, our final result is 
X(t) -vo [xo « | [c2 (m) — o1(n)o2(n)] ta «f oot) ee (18.5.34) 


where Y(t) is given by Equation 18.5.29. 
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e Example 18.5.5: Stochastic Verhulst equation 
'The stochastic Verhulst equation is 
dX(t) = aX(t)|M — X(t)] dt + 0X (t) dB(t), X(0) = Xo. (18.5.35) 


We begin its solution by introducing ®(t) = 1/X(t). Then by Itó's lemma, Equation 
18.4.21 with f(x) = l/z, 


dé(t) = —(t)((aM — b?) dt + bdB(t)] + a dt, $(0) = 1/ Xo. (18.5.36) 
To solve Equation 18.5.36, we use the results from Example 18.5.4 with cı (t) = b?-aM, 


c(t) = a, oi(t) = —b, and o2(t) = 0. Denoting e(t) = (aM — b?/2)t + bB(t), we can write 
Equation 18.5.34 as 


t 
&(t)e§ — 6(0) =a e$ dn, (18.5.37) 
0 
is E) t 
E 1 ik E(n) 
—_ —- — =a e dn. 18.5.38 
XG us eed Ups) 


Solving for X (t), we obtain the final result that 


xa- — Xewk 


= (18.5.39) 
1+aXo fo exp[£(m)] dn 


Problems 
1. Solve the stochastic differential equation 
dX(t) = 1e"? B(t) dt +e? dB(t), — X(0) = Xo, 
by running the product rule backwards. 
2. Solve the stochastic differential equation 
dX (t) = e” [1 + 2B?(t)| dt + 2e” B(t) dB(t, |^ X(0) = Xo, 


by running the product rule backwards. Hint: Rewrite the differential equation dX(t) = 
e?* [2 B(t) dB(t) + dt] + (2e?* dt) B?(t). 


3. Solve the stochastic differential equation 
dX(t) = [1 + B(t)] dt + [t + 2B(t)] dB(t), X(0) = Xo, 


by running the product rule backwards. Hint: Rewrite the differential equation dX(t) = 
2B(t) dB(t) + dt+ B(t) dt+tdB(t). 


4. Solve the stochastic differential equation 


dX (t) = [3t? + B(t)] dt + tdB(t), X(0) = Xo, 
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by running the product rule backwards. Hint: Rewrite the differential equation d.X(t) — 
3t? dt + [B(t) dt + tdB(t)]. 


5. Solve the stochastic differential equation 
dX(t) = B?(t) dt + 2tB(t) dB(t), X(0) = Xo, 


by running the product rule backwards. Hint: Rewrite the differential equation dX(t) = 
t[2B(t) dB(t) + dt] + B? (t) dt — t dt. 


6. Find the integrating factor and solution to the stochastic differential equation 
dX(t) = [1 + 2X (t)] dt + e” dB(t), | X(0) = Xo, 
where B(t) is Brownian motion. 


7. Find the integrating factor and solution to the stochastic differential equation 


Q(t) 
RC 


V(t) 


dQ(t) + == dt = 


a+ E apie), Q(0) = Qe 


where R and C are real, positive constants, and B(t) is Brownian motion. 
8. Find the integrating factor and solution to the stochastic differential equation!? 
dX (t) = 2tX(t) dt + e * dt + dB(t), t € [0,1], 
with X(0) = Xo, and B(t) is Brownian motion. 
9. Find the integration factor and solution to the stochastic differential equation 
dX(t) = [4X (t) — 1] dt + 2dB(t), X(0) = Xo, 
where B(t) is Brownian motion. 
10. Find the integration factor and solution to the stochastic differential equation 
dX(t) = [2 — X(t)| dt + e * B(t) dB(t), X(0) = Xo, 
where B(t) is Brownian motion. 
11. Find the integration factor and solution to the stochastic differential equation 
dX(t) = [1 + X(t)| dt + e! B(t) dB(t), AC (0) = Xo, 


where B(t) is Brownian motion. 


13 Khodabin, M., and M. Rostami, 2015: Mean square numerical solution of stochastic differential equa- 
tions by fourth order Runge-Kutta method and its applications in the electric circuits with noise. Adv. 
Diff. Eq., 2015, 62. 
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12. Find the integration factor and solution to the stochastic differential equation 
dX(t) = |3X(t) +1] dt+e'cos[B(t)|dB(t), — X(0) = Xo, 

where B(t) is Brownian motion. 

13. Find the integration factor and solution to the stochastic differential equation 
dX(t) = [t+ 3 X(t)] dt + é sin[B(t)] dB(t), X(0) = Xo, 

where B(t) is Brownian motion. 

14. Following Example 18.5.2, solve the exact stochastic differential equation: 


dX(t) = e [1+ B?(t)] dt + [1+ 2e’B(t)] aB(t), X(0) = Xo. 


Step 1: Show that f, +4 fee = e'(1-- 2?), and f; = 1+ 2e'z. 
Step 2: Show that f(t, £) = x + ea? + g(t). 
Step 3: Show that g(t) = Xo and X(t) = B(t) + e! B?(t) + Xo. 


15. Following Example 18.5.2, solve the exact stochastic differential equation: 


dX (t) = (2tB?(t) + 32 [L + B(t)]] at + [1 + 30^ B?(t)] dB, X(0) = Xp. 


Step 1: Show that f, + 3 fan = 2tz? + 3? (1 + x), and f, = 3t?z? +1. 
Step 2: Show that f(t, £) = t?z? + x + g(t). 

Step 3: Show that g'(t) = 3t?. 

Step 4: Show that X(t) = t?[B?(t) +t] + B) + Xo. 


Using Equation 18.5.26, solve the following stochastic differential equations: 


16. dX(t) = t? X (t) dt + tX (t) dB(t), X(0) = Xo 

17. dX(t) = cos(t) X(t) dt + sint) X(t) dB(t), ^ X(0) = Xo 

18. dX(t) = In(t + 1) X (t) dt + /In(t 4-1) X(t)dB(t, ^ X(0) = Xo 
19. dX(t) =In(t + 1)X(t) dt +tX(t)dB(t), — X(0) = Xo 


20. Following Example 18.5.5, solve the stochastic differential equation 
dX (t) = [a.X" (t) + 0X (t)] dt + cX (t) dB(t), X(0) = Xo, 


where n > 1. 


Step 1: Setting P(t) = X!^"(t), use Itó's lemma Equation 18.4.21 with f(x) = 1/z"^! to 
show that 
d®(t) = (1— n)&(t) [(b — Inc?) dt - cdB(t)] + (1 — n)a dt. 
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Step 2: Setting c(t) = (1 — n)b — n(1 — n)c?/2, c(t) = (1— n)a, ei(t) = (1— n)c, and 
d2(t) = 0, show that 


epla- De] — 1 ; 
Xe Tea 0 7 me | epli- DEd 


or 


exp(n-1() 1 f 
= + (1 —1 d 
Xn-1(t) AS ( n)a ù exp[(n )£()] n, 
where £(t) = (b — c?/2)t + cB(t). 
21. Following Example 18.5.5, solve the stochastic Ginzburg-Landau equation: 


dX(t) = [ae** +0] dt-cedB(), — X(0) = Xo. 


Step 1: Setting (t) = exp[-cX (t)], use Itó's lemma Equation 18.4.21 with f(x) = e ** to 
show that 
de (t) = — (bc — 3o?c?) (t) dt — oc®(t) dB(t) — ac dt. 


Step 2: Setting cı (t) = 07c?/2 — be, co(t) = —ac, o1(t) = —oc, and o2(t) = 0, show that 


X(t) = Xo + bt + o Bt) ;m[i oc f expleXo bet cB ach. 
0 


22. Following Example 18.5.5, solve the stochastic differential equation: 


dX(t) = {[1 + X(t)][1 + X7(¢)]] dt + [1 + X?(d] aB(t), X(0) = Xo. 


Step 1: Setting ®(t) = tan! [X(t)], use Itó's lemma Equation 18.4.21 with f(x) = tan! (x) 
to show that d®(t) = dt + dB(t). 


Step 2: Solving the stochastic differential equation in Step 1, show that 


X(t) = tan[tan ! (Xo) + t + B(t)]. 


18.6: NUMERICAL SOLUTION OF STOCHASTIC DIFFERENTIAL EQUATIONS 


In this section we construct numerical schemes for integrating the stochastic differential 
equation 
dX (t) = a[X (t), t] dt + b| X(t), t] dB(t) (18.6.1) 
on to € t € T with the initial-value X (to) = Xo. 
Our derivation begins by introducing the grid tg < t4 < t2 < -<tn <- «ty =T. 
For simplicity we assume that all of the time increments are the same and equal to 0 < 
At < 1 although our results can be easily generalized when this is not true. Now 


Xn =Xnt | Xondan | HXO naw. (18.6.2) 


n 
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The crudest approximation to the integrals in Equation 18.6.2 is 


f area DEO ae. (18.6.3) 


and 
n b[X (9), n] dB(n) © BLX (tn), t&] A Ba. (18.6.4) 


Substituting these approximations into Equation 18.6.2 yields the Euler-Marugama ap- 
proximation.'* For the Itó process X(t) = (X(t),to € t € T: 


Xn+1 = Xn + a(tn, Xn) (tn+1 = tn) + B(tn, Xn) (Binpi E Bi,,) (18.6.5) 


for n = 0,1,2,...,N — 1 with the initial value Xo. 

When 6 — 0, the stochastic iterative scheme reduces to the conventional Euler scheme 
for ordinary differential equations. See Section 1.7. When b Z 0, we have an extra term 
generated by the random increment AB, = B(t,,1) — B(t,) where n —0,1,2,..., N—1for 
Brownian motion (the Wiener process) B(t) = B(t),t > 0. Because these increments are 
independent Gaussian random variables, the mean equals E(AB,,) = 0 while the variance 
is E[(AB,,)?] = At. We can generate AB,, using the MATLAB function randn. 

An important consideration in the use of any numerical scheme is the rate of conver- 
gence. During the numerical simulation of a realization, at time t there will be a difference 
between the exact solution X(t) and the numerical approximation Y(t). This difference 
e(t) = X(t) — Y (t) will also be a random variable. A stochastic differential equation scheme 
converges strongly with order m, if for any time t, E(|e(t)|) = O[(At)”] for sufficiently small 
time step At. The strong order for the Euler-Marugama method can be proven to be 5. 

To construct a strong order 1 approximation to Equation 18.6.1, we return to Equation 
18.6.2. Using Equation 18.4.12, we have 


ini 


Xnai Xs = f Lats) n| + n (aa; + 3D9a,,) d£ + A bay aB()| dn 


n n 


" uu boc.) [ (ab, + $07 brx) ac f bb, Bi) dn (18.6.6) 


tn tn 
= a[X (tn), tnJAt + b[X (tn), tr) AB, + Rn, (18.6.7) 
where 
ini n 
R,= f if bb; B®) dB(n) + higher — /order terms. (18.6.8) 
t tn 


Dropping the higher-order terms, 
tn+1 n 
Rn & BIX (tn), tss [X (tn): ta] f f B®) dB(n). (18.6.9) 
t t 


Consider now the double integrals 


(AB,)? = QU BG) QU aB(n)) z [ [ aB) dB(n). (18.6.10) 


n n n n 


14 Maruyama, G., 1955: Continuous Markov processes and stochastic equations. Rend. Circ. Math. 
Palermo, Ser. 2,, 4, 48-90. 
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Now, 
M ma] mo [ne] mo [me] ms 
+ [wor (18.6.11) 
-2 [| fase] a+ f uB ase.) 
E Ju n Be) dB(n) + At, (18.6.13) 

because 


[wor = dn = At. (18.6.14) 


n 


Combining Equation 18.6.9, Equation 18.6.10, and Equation 18.6.13 yields 

Rn ~ b[X (tn), ts] bs [X (tn), tn] [((ABn)? — At]. (18.6.15) 
Finally, substituting Equation 18.6.15 into Equation 18.6.7 gives the final result, the Milstein 
method:!° 


Ob( X4, tn) 


Xn41 = Xn a(Xn, tn) Atn + O(Xn, tn) ABn + $0(Xn, tn) E» 


[(AB,)? — At]. 
(18.6.16) 


e Example 18.6.1 


Consider the Itó process X(t) defined by the linear stochastic differential equation 
dX (t) = aX (t) dt + bX (t) dB(t), (18.6.17) 


for t € [0, T]. If this Itó process has the drift a(z,t) = ax and the diffusion coefficient 
b(x,t) = bx, the exact solution (see Equation 18.5.16) is 


X(t) = Xo exp| (a- z) eene) (18.6.18) 


for t € [0, T]. Figure 18.6.1 compares the numerical solution of this stochastic differential 
equation using the Euler-Marugama and Milstein method against the exact solution. Note 
that each frame has a different solution because the Brownian forcing changes with each 
realization. 


Although a plot of various realizations can give an idea of how the stochastic processes 
affect the solution, two more useful parameters are the sample mean and standard deviation 
at time tn: 


X(tn) = =X Xj (tn), (18.6.19) 


j=1 


15 Milstein, G., 1974: Approximate integration of stochastic differential equations. Theory Prob. Applic., 
19, 557—562. 
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Figure 18.6.1: The numerical solution of the stochastic differential equation, Equation 18.6.17, using the 
Euler-Marugama (crosses) and the Milstein (circles) methods for various time steps h. The dashed line 
gives the exact solution. 


and 


1 č = 
(tn) = 5 » [X tn) -X tn), (18.6.20) 
where J are the number of realizations and X;(t,) is the value of the random variable at 
time tn of the jth realization. 

In many physical problems, “noise” is the origin of the stochastic process and we suspect 
that we have a normal distribution N(u, o?) where u and ø are the population mean and 
standard deviation, respectively. Then, using the sample statistics, Equations 18.6.20 and 
18.6.21, a two-sided confidence interval can be determined as 


— O(tn) = 0 (l5 
X (tn) — Tstudent d X (tn) + Tstudent T 


based on the student-7 distribution with J — 1 degrees of freedom. 
Project: RL Electrical Circuit with Noise 


An important component of contemporary modelling is the mixture of deterministic 
and stochastic aspects of a physical system. In this project you will see how this is done 
using a simple electrical system.!9 

Consider a simple electrical circuit consisting of a resistor with resistance R and an 
inductor with inductance L. If the circuit is driven by a voltage source v(t), the current 
I(t) at a given time t is given by the first-order ordinary differential equation 

dl 


L^,- RI-v(), —I(0)— 1L. (1) 


16 See Kolárová, E., 2005: Modeling RL electrical circuits by stochastic differential equations. Proc. Int. 
Conf. Computers as Tool, Belgrade (Serbia and Montenegro), IEEE R8, 1236-1238. 
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2 
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Figure 18.6.2: Eleven realizations as a function of the nondimensional time Rt/L of the numerical solution 
of Equation (4) using the Euler-Marugama method when h = 0.02, o/ L = 1, 8/L = 0, Io = 0, and v(t) = R. 
The mean and 95% confidence interval (here tstudent = 2.228) are given by the heavy solid and dashed 
lines, respectively. Finally, the crosses (--) give the deterministic solution. 


Step 1: Using classical methods, show that the deterministic solution to Equation (1) is 


I(t) = Ipe" + I n ew tr = J v() dr. (2) 


There are two possible ways that randomness can enter this problem. First, the power 
supply could introduce some randomness so that the right side of Equation (1) could read 
u(t)+a dB5(t)/dt. Second, some physical process within the resistor could cause randomness 
so that the resistance would now equal R + 6dB,(t)/dt. Here B1(t) and B2(t) denote two 
independent white noise processes and a, are nonnegative constants. In this case the 
governing differential equation would now read 


dl 1l1[,, dBi] 1 
ar^ EE 


oer ju + i I(0) = Ip. (3) 


Converting Equation (3) into the standard form of a stochastic ordinary differential equa- 
tion, we have that 


dT = To) — RI()] dt - 16) dB: (t) + Ë dB), 1(0) = I. (4) 


Step 2: Using MATLAB, create a script to numerically integrate Equation (4) for a given 
set of a, B, Io = 0, R, L, and v(t). Plot I(t) as a function of the nondimensional time Rt/L 
for many realizations (say 20). See Figure 18.6.2. 


Step 3: Although some idea of the effect of randomness is achieved by plotting several 
realizations, a better way would be to compute the mean and standard deviation at a given 
time. On the plot from the previous step, plot the mean and standard deviation of your 
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E[x(t)] 


0 100 200 300 400 500 600 700 800 900 1000 
time 


Figure 18.6.3: Plot of E[r(t)| versus time for the FitzHugh-Nagamo model for three values of s. The 
value of the parameters are a — 0.8, m — 1.2, and 7 — 100. The Euler method was used with a time step 
of 0.1. 


solution as a function of nondimensional time. How does it compare to the deterministic 
solution? 


Project: Relaxation Oscillator with Brownian Motion Forcing 


The FitzHugh-Nagamo!" model describes excitable systems such as a neuron. We will 
modify it so that the forcing is due to Brown motion. The governing equations are 


dz = —z(a? — a?) dt — y dt + o dBı (t), 


and 
dy = (x — my) dt/T + e dB»(t), 


where a, m, c, and T are parameters. 

Write a MATLAB script to numerically integrate this modified FitzHugh-Nagamo model 
for various values of c. Using many simulations, compute E[x(t)] as a function of time t. 
See Figure 18.6.3. What is the effect of the Brownian motion forcing? 


Project: Stochastic Damped Harmonic Oscillator 


The damped stochastic harmonic oscillator is governed by the stochastic differential 
equations: 


dv(t) = —yv(t) dt — k?z(t) dt — ax(t) dB(t), and  dz(t) = v(t) dt, (1) 
where k, a and y are real constants. This system of equations is of interest for two reasons: 


(1) The system is forced by Brownian motion. (2) The noise is multiplicative rather than 
additive because the forcing term is x(t) dB(t) rather than just dB(t). 


17 FitzHugh, R., 1961: Impulses and physiological states in theoretical models of nerve membrane. 
Biophys. J., 1, 445-466; Nagumo, J., S. Arimoto, and S. Yoshizawa, 1962: An active pulse transmission 
line simulating nerve axon. Proc. IRE, 50, 2061-2070. 
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time 


Figure 18.6.4: Plot of E[x(t)] versus time for the damped harmonic oscillator forced by Brownian motion. 
The value of the parameters are k = 1, y = 0.25, and alpha = At = 0.1. Five thousand realization were 
performed. 


We could solve both equations numerically using Euler's method.!? The purpose of 
this project is to introduce you to the Heun method. In the Heun method we first compute 
an estimate of the solution z* and v* by taking an Euler-like time step: 


xz" = ti + viA, and v* = vi — yv;At — k?zi At — omi A Bi, (2) 


where x; and v; denote the displacement and velocity at time t; = iAt, At is the time step, 
and i = 0,1,2,.... With these estimates we compute the value for x; and vi+ı using 


Lig = vj 1(vi - v*)At, and Uj41— vi — iq(vi - v*)At ik (xi Ez*)At— ox; AB;. 
(3) 

Qiang and Habib!? developed a leapfrog algorithm to solve this problem. Because the 
algorithm is rather complicated, the interested student is referred to their paper. 

Write a MATLAB script to use the Euler and Huen methods to numerically integrate the 
stochastic harmonic oscillator when 10a = 4y = k = 1 and x(0) = v(0) = 0. Using many 
simulations, compute E[x(t)] as a function of time t. See Figure 18.6.4. What happens to 
the accuracy of the solution for larger values of At? 


Project: Mean First Passage Time 


'The stochastic differential equation 


dX(t) = [X (t) — X?(t)] dt + dB(t) 


X?(t) 1 


describes the motion of a particle in a double-well potential V(x) = z*/4 — z?/2, subject 
to a spacially dependent random forcing when the acceleration X"(t) can be neglected. 


18 For further details, see Greiner, A., W. Strittmatter, and J. Honerkamp, 1988: Numerical integration 
of stochastic differential equations. J. Stat. Phys., 51, 95-108. 


19 Qiang, J., and S. Habib, 2000: Second-order stochastic leapfrog algorithm for multiplicative noise 
Brownian motion. Phys. Review, 62, 7430-7437. 
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Figure 18.6.5: The mean time that it takes a particle to travel from X(0) = —1 to X = 0 in the double- 
well potential stated in the text. Sixty thousand realizations were used with a time step h. Two differential 
numerical schemes were used: the Euler-Marugama (crosses) and the Milstein (circles) methods. The curves 
are linear least-squares fits through the results. 


An important question is what is the average (mean) time that it takes a particle initially 
located at a minimum X(0) = —1 to reach the local maximum X (t) = 0. 

Write MATLAB code that computes X(t) as a function of time t. Using this code 
and creating N realizations, compute the length of time that it takes the particle to reach 
X(t) = 0 in each realization. Then compute the mean from those times and plot the results 
as a function V/A, the square root of the time step. See Figure 18.6.5. We used vt rather 
than h following the suggestions of SeeBelberg and Petruccione.?? 


Project: Bankruptcy of a Company 
The stochastic differential equation?! 
dX(t) = [uX (t) — iD] dt + e X (t) aB(t), 0<t<T, 


with X(0) = Xo, describes the evolution with time t of the wealth X(t) of a firm. Here u 
and o denote the deterministic and stochastic evolution of the firm’s wealth, respectively, 
Xo is the initial wealth of the firm, and iD gives the amount of payment to a financier (bank) 
who initially loaned the firm the amount D at the interested rate i. Write a MATLAB code 
to simulate the wealth of a firm during its lifetime T given a known D, i and Xo with 
u = 1.001/year, and various values of c. 

During the simulation there is a chance that the firm goes bankrupt at time t = 7 < 
T. This occurs when the stochastic process hits the barrier X(T) = 0. If n denotes the 
number of times that bankruptcy occurs in N simulations, the probability of bankruptcy 
is P[X(r) = 0] = n/N. Using your code for simulating a firm's wealth, compute the 
probability of bankruptcy as a function of interest rate for a small (D = 20, Xo = 100), 


20 Seeßelberg, M., and F. Petruccione, 1993: An improved algorithm for the estimation of the mean first 
passage of ordinary stochastic differential equations. Comput. Phys. Commun., 74, 241—255. 


?! See Cerqueti, R., and A. G. Quaranta, 2012: The perspective of a bank in granting credit: An 
optimization model. Optim. Lett., 6, 867—882. 
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Figure 18.6.6: The probability of bankruptcy over a three-year period as a function of interest rate of a 
firm with initial wealth Xo = 500 and debt D = 100. Other parameters are h = 0.01 yr and u = 1.001/year. 
The units on ø is year- !/ 2. Five hundred thousand realizations were used to compute the probability. 


medium (D = 100, Xo = 500), and large (D = 200, Xo = 1000) firm. See Figure 18.6.6. 
How does the average value of 7 vary with interest rate? 


Further Readings 


Kloeden, P. E., and E. Platen, 1992: Numerical Solution of Stochastic Differential Equa- 
tions. Springer-Verlag, 632 pp. A solid book covering numerical schemes for solving stochas- 
tic differential equations. 


Mikosch, T., 1998: Elementary Stochastic Calculus with Finance in View. World Scientific, 
212 pp. Very well-crafted book on stochastic calculus. 


Answers 
To the Odd-Numbered Problems 


Section 1.1 

1. first-order, linear 3. first-order, nonlinear 5. second-order, linear 
7. third-order, nonlinear 9. second-order, nonlinear 11. first-order, nonlinear 
13. first-order, nonlinear 15. second-order, nonlinear 

Section 1.2 

1. y(x) = —In(C — a?/2) 3. y?(z) — In? (z) = 2C 

5. 2+y?(x) = C(1 + 2?) 7. y(x) = —In(C — e?) 


9 ay? (t) cub = e 3t 
^ ay —b 


Vo S e tU (RC) 


18. VO= S y PELO 


15. N(t) = N(0) exp{In[K/N(0)] (1 — 7") } 


PARC 
“Ty = Bh) Ah Ci) e - x) 
(rg) 
(B]e — [A]g) (Blo — (Clo) \ [Blo — [X] 
: in( | lo ) = kt 
([C]o EE [A]o) ([C]o pe [B]o) [Clo E [X] 
Section 1.3 


1. Injy|—2/y=C 3. |æ|(£? +342) 2C 5. y=a(n|2|+C) 7. sin(y/z) 2 In|r| = C 
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Section 1.4 
1. zy?— i335 =C 3. zy? — x + cos(y) = C 5. y/z -li((y) =C 
7. cos(zy) = C 9. ^y? -a?y Hy =C 11. zyln(y) + e” -e * = C 


Section 1.5 


1. y(x) = łe” + Ce^*, x € (—00, 00) 3. y(x) = ln(x)/x + Cr !|, x#0 
5. y(x) = 2x7 In(z) + Cz’, x € (—0o, oo) 
Ter yl C, nr+ọ < 2a < (n+1)r+ọ, where y is any real and n is any integer. 


x“) = 
9. y(x) = $+ He, x € (—00, 00) 11. y(x) = (x + C) esc(x) 


_ cos(z)y(0)  , c cos^ (a) * [sec(£) + tan(£)]" 
NO e tanta sec etant P | orr 5 
E DUE i ui a sin(2wz) — wcos(2wx) 
pe 8a? ` 8a?(a? + w?) 8u(a? + w?) 


17. (xz) 22(z— z?/*)/(2— k) if kz2; y(x)=aln(1/x) if k=2 
ki[A], 


19. [A](t) = [A] e", [B] (4) zi a [e ^t E eit 
e kat _ 9e fat 
[Cl() = [Alo (1 phe ) 
Sha Cre 23. y(x) = [Ca? + 4x? In(z)]^ 


25. y(x) = [Ca — zln(a)]"? 


Section 1.6 


5. The equilibrium points are x = 0, i. 


the equilibriums at x — 0 and 1 are stable. 


and 1. The equilibrium at z = E is unstable while 


7. The equilibrium point for this differential equation is x = 0, which is stable. 


Section 1.7 

1. z(t) 2 ef +t+1 3. a(t) = [1 - In(t + 1)! 

Section 2.0 

1. yo(x) = A/x 3. ya(z) = Art 5. yo(x) = A(x? — x +1) 

7. yo(z) = Asin(z)/ yz 9. y(z) = Cze? o 11. y(z) = (1+ Coe?*) /C1 


13. y(x) = -In|1 - z| 15. y(z) = Cy — 21n(z? + C2) 
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Section 2.1 


1. y(x) = Cie? + Coe 9? 3. y(x) = Cie” + Coxe” 
5. y(x) = Cie?” cos(2x) + Coe?" sin(2x) 7. y(x) = C1e t? + Coe 
9. y(x) = e74" [C1 cos(3x) + C» sin(3z)] 11. y(x) = C1e^** + Core™? 


13. y(x) = Cy + Cox + C3 cos(2x) + C4 sin(2x) 
15. y(x) = Che?” + Cpe~* cos(V3 x) + Cae? sin(V/3 x) 
17. y(t) = e 9? { Aexp[tV/1 = 2Ar/(27)] + Bexp[-tV1 — 247/(27)]] 


Section 2.2 


1. x(t) = 2/26 sin(5t + 1.7682) 3. x(t) = 2cos(rt — 1/3) 


5. a(t) = so cos(wt) + 2 sin(wt) and v(t) = vo cos(wt) — wso sin(wt), where w? = Mg/(mL). 
w 


Section 2.3 
1. x(t) = 4e7% — 2e7% 3. a(t) = e75/2 [4cos(6t) + 33 sin(6t)] 
5. The roots are equal when c = 4 when m = —2. 


Section 2.4 
1. y(x) = Ae ?* + Be * + ix- i 3. y(x) = e *[Acos(x) + Bsin(x)| +x? — z +2 
5. (zx) = A+ Be ?* + ix? 2r Ele 79 7. y(x) = (A+ Buje + (1x - Z)) e* 


9. y(x) = Acos(3z) + Bsin(3z) + 152? sin(3z) + $z cos(3z) 
11. y(2) = 2ar — 1 rigs Sead asin(2wa) — wcos(2wx) 
ach 8a? 8a? (a? + w?) 8w(a? + w?) 
Section 2.5 
FK t— 
Kaca Beate OU Accs) d-Baintut] o aet v) 
Jew? + (k — mw?)? 
Section 2.6 
1. y(x) = Ae” + Be?” + te~” 3. y(x) = Ae?* + Be ?* — (3x + 2)e*/9 
5. y(x) = (A+ Bz)e ?* + a2e 7?" /6 T. y(x) = Ae?* + Bre” + (fa? + 129) e? 
9. y(x) = Ae” + Bre” + zln(r)e* 
Section 2.7 
1. y(x) = Ciz + Cox! 3. y(x) = Ciz? + Co/x 


5. y(x) = Cy /a + Coln(z)/x 7. y(x) = Ciz cos[21n(z)] + Coz sin[In(a)] 
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9. y(x) = Cı cos[In(z)] + C2 sin[In(z)] 11. y(z) = Ciz? + Caxt + Cs/x 
Section 2.8 


1. The trajectories spiral outward from (0, 0). 
3. The equilibrium points are (2,0); they are unstable. 


5. The equilibrium points are v = 0 and |z| < 2; they are unstable. 


Section 3.1 
= 5\_ 
TM 
7 10 15 21 
3. sa-2B=( 7 i seA- 5 - (5 » 


T (4 3 T,pr [4 3 
s aesre(i3). aresre(i 3) 
[mua rp [f5 5 f4 6 r, [5 8 
t ase (3. 2), arse (5 5) nau (4 rE 


muc A TENE UNES 
9. BB -( 3 Been J 


3 _/65 100 
naoa- (SS 10) 
11 8 
13. yes @ x) 15. ys | 8 4 17. no 
2 5 
5 3 
10 10 
19. 5(2A) = | 10 20 |] =10A 
30 10 
4 0 
21. (4+B)+0=(5 2)-4« G0) 
9 -1 
23. Ago) - (5 3) = AB+ AC 
1 -2 Tı > 5 
(3 1)(2)-() 
0 1 2 3 T1 2 
3 0 -4 -4 ty) _ 5 
a 1 1 1 1 33] | -3 
2 —-3 1 -3 T4 7 
Section 3.2 


1.7 3.1 5. —24 bo 
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Section 3.3 
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1. t= 2, X429 = 3 3. Tı 0, T2 0, X3 2 
Section 3.4 
1. xı = 1, z2 = 2 3. 31 = 23 = Q, £2 = —Q 
5. x1 = — 1l, £2 = 2a,%3 = Q T. 24 = 1,22 = 2.6, £3 = 2.2 
Section 3.5 

2 1 
1. A= 4, x -a(1); A--—3 x -5( » 

—1 3 1 
3.A—1 xo;—-a«a| 0 |—-8[|1]; A=0, xo=y7{ 1 

1 0 1 

1 0 1 
5.A=1, xx=za|0]+8| 1 |; A=2, xo—y|1 

0 —1 0 

1 3 7 
T. A20, xop=afl];A=1, x=8|2];à=2, x =7] 3 

1 1 1 
Section 3.6 


lx=c, (J Jette Gee 


5. x—0 ( 4)e* +e ea 


" ( 2 cos(2t) — 3sin(2t) ) E 


9. 


* 


|. ( —3cos(2t) — 2sin(2t) Y , 
-«( irs Je T 


2 cos(t) 


cos(2t) 


3. X= (3) et + es e et 


sin(2t) 


— cos(2t) 4- pum CE (^ cos(2t) — e) ái 


sin(2t) 
0 0 2 
17.x=c, 11] +ce.{2]eé+e,] 1 e?t 
0 1 0 
0 
1 e^t 
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Section 3.7 


t tot 14244 t t 9425 
t t e te’ zte e 2te' 2t*e 
1. (o gie ) 3. 0 & té RENE  2te 
Í 0 0 e 0 0 ë 
7. 
y(t) = å (3e! — e) (0) + 3 (e * — e*) (0) + 3te* — 3sinh(t) + 2t 
x(t) = 4 (3e! — 3e *) xı (0) + § (3e * — e*) x9(0) + 3te + 2e * — 5sinh(t) + 4t — 2 
9. 
x1(t) = [cos(t) + 2 sin(t)]a1(0) — 5 sin(t)a2(0) + 2t cos(t) — tsin(t) — 3 sin(t) 
X(t) = sin(t)z4 (0) + [cos(t) — 2sin(t)]x2(0) + t cos(t) — sin(t) 
11. 
zi(t-i [c^ + 2e] xı (0) + i [e^ — e] x2(0) + i fe“ — e*] x3(0) 
UST 4t, 44 44 
6°. Ge "Or one 
v(t) = 4 [e — e] x, (0) + 4 [e*t + 2e*] x2(0) + i fe“ — e] zs (0) 
QUA 44,44 4, 
r wm Ad. 
a3(t) = 4 [e* — e'] a(0) + 1 [e** — e*] za(0) + i [e*t + 2e*] x3(0) 
4 Bie Au 4 
6 uq* vas car 
13. 
n(0 = (Ge e) (0) (fe - 37) 25(0) + 8 (e 6) 1 (et — et) 
x(t) = e~**a9(0) + 2 (e — e?) 
valt) = (6% — 67) a1(0) + (Be + Fert) es(0) + § (e! — e) +} (e — et) 
15. 
zı(t) = (1 — t)e?'z, (0) + te x2(0) + (2t + t?)Je”rs(0) + (t° + {t+ 4) e” — 3t- 1 
xolt) = —te” x1 (0) + (1 + t)e™z2(0) + (4t + t?)e**x3(0) 
(C49 pae 
z3(t) = e?*z3(0) +e” — et 
Section 4.1 
lLaxb--3i-19j--10k 3.axb-i-8j-4 7k S as bocicansesie 


9. Vf = ycos(yz)i + [x cos(yz) — xyz sin(yz)]j — zy? sin(yz)k 
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11. Vf = 2zy?(2z + 1)7i + 22?y (22 + 1)?j + 4a? y? (2z + 1)k 
13. Plane parallel to the zy plane at height of z = 3, n = k 


15. Paraboloid, 
2ri 2y j k 


+ 
V 14r? +4y? y1++4r?+4y2? 1+ 4x? + 4y? 
17. A plane, n = j/ V2 — k//2 


19. A parabola of infinite extent along the y-axis, 


n = —2ri/v 1 + 4r? + k/V1 + 4r? 
21. y =2/(x +1); z = exp[(y — 1)/y] 
23. y z; 2? =y/(3y — 2) 


Section 4.2 


1. V.F 22zz4 22, V xF = (ry -2yz)i + (z? -?)j, V(V - F) = 2zi + (22 + 2z)k 


3. V- F 22(r— y) — ze ?* +2079, V x F = 2rze™i-— ze?!3 + [2(x — y) — ye ?"] k, 
V(V-F) = (2 — e** + zye^*" +e”) it (a2e77*9 +2re% — 2)j 


5. V-F=0, VxF--z^i-cr(5y-9z?)-(2zz—5z)k V(V-F)=0 
7.V-F=e-%+4+27-3e7-7, Vx F = —2yzi - xe 4k, V(V-F) = —e^Vj + (22 + 3e ?)k 


9. V- F = yz + 2? ze? + rye’, 
V x F = (ze? — z?ye? — a?yze*)À + (xy — ye?)j + (3z?yze? — xz)k, 
V(V - F) = (32?ze? + ye?)i + (z + ze*)j + (y+ 33e? + x ze* + zye?)k, 
11. V. F =y? +22? — xy sin(z), 
V x F = [zcos(z) — 2zyz]i — y cos(z)j + (yz? — 2xy)k, 
V(V -F) = [2? — ysin(z)ji+ [2y — xsin(z)]j  [2zz — xy cos(z)]k 


13. V. F = y? + zz — xzysin(z), 
V x F = [xcos(z) — xyli — y cos(z)j + (yz — 2ry)k, 
V(V-F)- [z — ysin(z)]i + [2y — zsin(z)]j + [x — zy cos(z)]k 


Section 4.3 


1. 16/7 + 2/ (3n) 3. e? + 26/3 + e64 /2 — 13/6 5.—4m 7. 0 9. 27 


Section 4.4 


1. (x,y,z) = x?y + y?z + Az + constant 3. (x,y, z) = xyz + constant 
5. p(a,y, z) = a? sin(y) + ze’? + Az + constant 


7. p(x, yY, z) = re?” + y? + constant 9. p(z, Y, zZ) = zy + xz + constant 
Section 4.5 
1.1/2 3. 0 5. 27/2 7.5 


9. 0 11. 40/3 13. 86/3 15. 967 
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Section 4.6 
1. —5 3.1 5.0 7.0 9. —167 11. —2 
Section 4.7 
1. —10 3. 2 5. T 7. 45/2 
Section 4.8 
1.3 3. —16 5. 4m 7. 5/12 
Section 5.1 
1 2^. sin[((2m — 1)t] 
1. f= 
f 2 = 2m — 1 
je 1-32(-1)* . 
: t ————— $8 t 
3 ps T ian )+ a sin(nt), 
2 cos rm sin? (nr /4) sin(nz/2) . 
Deaf 2» cos(nt) + ar sin(nt) 


Qe n CE cos( 22t) 


m a2 L2 +4 n2 72 


n(—-1)”  , fmnt 
—2m sinh(aL) > SI. ag sin( L ) 
=1 


2  cos(2mt) 
= 4m? — 1 


TW 50-5 E 3 CENE, cos] Om= Ut) EE ex De a (®t) 


m=1 nat 


9. fü)- | ; sin(t) 


13. f(t) = ^ 1 TE 1 > E TOES 4a es Y mn 


n=1 m=1 


L f(a) = T : 5 cos|(2m — Dar fle) = : Y (—1)^*! sin(nx) 


2 lím Qm—1)? m n 
3 2 C? MTE 2 9 Tn — T 
o£) = 5 = m E E a ) F(z) = = Sno sin| = | 
m=1 m=1 
122 2(2m —1)rx 4 «& (—1)"*! sin[(2m — lyra 
hdc Eo S =e) Gee 
m=1 m=1 


Answers to the Odd-Numbered Problems 


a 4a « (—1)* sin[(2m — 1)7/6] (2m — 1)zx 
9. f(z)— st 25 (2m — 12 zi | 


f(a) = = Y (—1)™ sin(mz/3) sin( 222) = cU sin("™*) 


m2 


n. f(z) = : " : Y ele s, [Eme 
ugs : y 1 + cos(nz/2) — 2(—1)” sin( 277) 


n 


n=1 


3a 2a q^. cos(n/2) - 1 NTT 
13. f(z) = 8 | =oD 2 cos( E e 
n=1 


Section 5.4 


sin[(2n — 1)t] m ee 2^. cos[(2n — 1)t — 7/2] 
E 5+ 2 2n — 1 DOG up 2n — 1 


(i) = DD Los [nee Cay T]. £0) = 29> t sinfni + [1+ (02) 


3 
Il 
un 


Section 5.5 
T 2 o9 ci(2m-1)t i oo enit 
T. fit 3. f(t) 214-— 
to 7 5 z 2. (m-1» f(t) bo - 
nz 
1 i pe e2(2m- 1)it 
: t)h= 
s. f() 2 223 2m —1 
Section 5.6 


25 sin[( nt 
T (2n — 1) + (2n — 1)? 


1. y(t) = Acosh(t) + B sinh(t) 


— xcd "ACE: cos|(2n — 1)t] 
3. y(t) = Ae“ + Be + ET 2 [2 — 2n — 1)2]2 + 9(2n — 1)? 


2 [2 — (2n — 1)?] sin[(2n — 1)t] 
2 (2n — 1){[2 — (2n — 1)?]? + 9(2n — 1)?) 


oo 


oo i(2n— 
ci Qn 1)t 


5. wlt) = 3-- 2. (2n — 1)?[4 — (2n — 1)?] 


BO 2 
W Pn inwot 


7. q(t) = 
(t) >, (inwo)? + 2ianwo + w? 


n=— 
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Section 5.7 
1. f(t) = $ — cos(1z/2) — sin(n2/2) — $ cos(nx) 
Section 6.1 


1. An = (2n — 1?s?/(AL?), y, (x) = cos[(2n — 1)nz/(2L)] 

3. Ag = —1, yo(x) = e7? and A, = n°, y,(x) = sin(nz) — n cos(nx) 
5. An = —n*a*/L^, ys (x) = sin(nrz/L) 

T. Àn = k2, ux) = sin(knx) with kn = — tan(kn) 


9. Ag = —mZ, yo(x) = sinh(mox) — mo cosh(mox) with coth(mom) = mo; An = k2, 
Yn(X) = sin(k,z) — kn cos(k,z) with kn = — cot (knr) 
11. 

(a) An = n?z?, Yn(x) = sin[nz In(x)| 

(b) An = (2n —1)°x?/4, Yn(x) = sin[(2n — 1)x1n(z)/2] 

(c) Ao=0, yo(x) = 1; A, =n T, y. (x) = cos[nz ln(x)] 


13. An = n? +1, y, (z) = sin[nIn(z)]/x 
15. A = 0, yo(x) = 1; yn(x) = cosh(Anx) + cos(Anx) — tanh(A;)[sinh(A5z) + sin(Asz)], 
where n = 1,2,3,..., and A, is the nth root of tanh(A) = — tan(A). 


Section 6.3 


1. f(z) = : »» co sin(“**) 3. f(x) = E E S sin] - re] 


n=1 n=1 


Section 6.4 


1. f(x) = 4 Po() + ain) jg PG) 3 f(x) = 4 Po(a) + $ Pr) — is Pa(z) + 
5. f(z) = 5Pi(2) = g P(x) ig P5) 
Section 7.3 
Eu 1 (2m — 1):z] . [(2m — 1)mct 
1. u(x,t) on? Gm — ly sinf L | sinf T | 


l no "m zl (2n “| sinf (2n — pze] sif (2n — Dzet 


7. u(x,t) = 


2 » CUN sin] - a) T | (2n — Dzet 


Answers to the Odd-Numbered Problems 


PES T 
where k,, is the tb solution of e = 0. 
-I (2n — l)zz]| . |(2n — l)mct 
11. u(x,t) 2 gel 217 sin 237 
Section 7.4 
1. u(x,t) = sin(2x) cos(2ct) + cos(x) sin(ct)/c 
lc-az?400 e” sinh(ct) 
3. t) = 
UR (1 + x? + c2t2)? + Ax? 0? t c 
DTE cos( Z2) TA met l sinh(ax) sinh(act) 
2 2 ac 
Section 8.3 
_ 4A X sin[(2m — 1)a] ~a2(2m—1)t 
1. u(x,t) x 20 PETERS 
> [D —a?n?t 
3 t)=-2 duc 
u(z, t) > E sin(nx)e 
4 > yer : —a? (2m—1)?t 
5. u(z,t) = 7 2, Om- 1j sin[(2m — 1)a]e~* ( ) 
Tdi T 4 Y cos[(2m — 1)z] _42(2m—1)24 
DUTI £4 m- ^ 
a A4«-cos[2n-—1)z] _ 2(2n—1)%t 
. t = a TL 
drum cus = Qn—-12 * 
_ 2S ng (2n — 1)z] 4205,44» 
11. a? (2n—1)?t/4 
I T os E zi 2 Je 
ia ae ec sin[(2n — 1) )z/2].— a? (2n—1)?t/4 
T 2n— 1 
n=1 
15. u(x,t) = i i aee E (2n — Ia e72? Qn-1?t/A 
Í : = |2n-1 (2n—1)m 2 
Toy | 279 1 Lahn 
1 t 
7.u(x, t) - = p» F sin(nx)e 
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1 
23. u(x,t) = 3 t = D cos( (nza)e-* mt 

4 x (-1)n*! 2 
25. Be: = | _ 54—(2n-1) i 
5. u 0-5) Ge Lus z sin[(2n — 1)z] |1 — e 

A 2:: 2 
27. u(x,t) = oh 29 QUE 


h 
"ar » sin(,) (22) ( A) 
BALL + esimh()/AL] A LJ PN IB 


where Bn is the nth doof of 8 tan(B) = hL/k. 


sin(k, L) cos(k,x) 3,9 
2 t) 24 ` , s 
9. u(x,t) = 4uo 1 Dey L F sink, L) cos(k,a) exp[— (ki + a^ k2)t] , 


where kn denotes the nth root of ktan(kL) = ka/a? 


2 e (-1)* 
31. u(r,t) 2 — [DE sin(nzr)e ^ "^7: 
mp 
_ Abuo xc. sin(ks) — kn cos(ks) . ( kar | 22a? 
33. u(r,t) — Ll s [Ds = sin, )] sin > Je ; 


where kpn is the nth root of kcot(k) = 1 — A, nm < ky < (n + 1)m. 


Jo(k um e 9 rst 
35. u(r,t) — D» Ten Ji ULT 
where kn is the nth root F pom i 
G 2Gb? A Jo(k vn 24/2 
. ES p? 2 o( e "Fato 
37. u(r,t) = 3 (9 - 7) iu 2n iJ. 


where kpn is the nth root of ros = 0. 


Ty NS Edi (es D) — bi (knb)]Jo(knr) ona 
39. u(r, t) = ape x eie B Js D pue 
where kpn is the nth root of kJi(kL) = hJo(KL). 
Section 9.3 

_ 4 c^ sinh[((2m — 1)r(a — x)/b] sin[(2m — 1)ry/b] 
pue lis (2m — 1) sinh[(2m — 1)ra/b] 

2a nh(nzy/a) sin(nzz/a) 

iis T == n sinh(nzb/a) 

_4 z 544 Sinh[(2n — 1)ry/2a] cos[(2n — 1)rz/2a] 
ue 2 y" (2n — 1) sinh[(2n — 1)x6/2a] 
7. u(z,y) =1 

= 4 «X cosh[(2m — 1)ry/a] sin((2m — 1)rz/a] 
mue (2m — 1) cosh[(2m — 1)zb/a] 


Answers to the Odd-Numbered Problems 


11. u(z,y) 21 
13. u(z,y) = To + AT cos(2na/ A)e ?79/^ 


AyL x cosh[(2m — 1)zz/L) m 


Po UUE + (2m — 1)? sinh((2m — 1)2/1] sin( L 


Aa?z n 24a SY sinh(k,z) Jı (kna) Jo(k,r) 
b2 b 49  kicosh(knL)J§(knb) ' 
where kn is nth positive zero of J4 (kb) = v =0. 


17. u(r, z) = 


(z — h)ró h)rà sinh|k h)/a] Ji (ksro/a) Jo(knr/a) 
2r 
Y X a > cosh kaha) 72k.) 
where kpn is nth deis Zero Er EAR = 0. 


19. u(r, z) = 


21. Case (a): 

| 4 x fl [Qn — 1)rr/2] sin[(2n — 1)n(z — 1)/2] 
uper 2 Ihn — 12/2] 2n —1 
Case (b): 


Y cosh(k,z) Ji(k,r) 
= cosh(kn) KnJo(kn) 
where kn is the nth root of Jı (k) = 0. 


u(r, z) = 


23. u(r, z) = 5 (-1)" A s(n) 


sinh(knz)Jı (knr) 
2 J 2 
K D na) Jy (Kn) 


where kn is nth positive zero of kJo(k) = Jı (k). 


dodi = = 200) Ji (in) Jo e cosh[us (L — z)/a] 
1 Hn [Jo (un) + Ji(us)) cosh(unL/a) 
where pn Ji (fin) = Tlu) B — aK/D and n = 1,2,3,.... 


cosh(k an knd knr 
— M 
if |z| « d/2, and 
sinh{k,d/( = knlz] knr 
u(r, z) =- > REATO exp z Jo = 
if |z| > d/2. 


> 1— /1 F 4k? )/2]Jo(kn 
31. u(r,z) 2 2BV7 exp[z(1 — /1 + 4k2 )/2]Jo( 5 
aci (k2 + B?) Jo(kn) 
where kn is nth positive zero of kJ, (k) = BJo(k). 


35. u(r, 0) = 100{ 7. Pileos(0)] » f (2) Paleos(0) us (e Picosto)]) 


, 


37. u(r,8) = Siny es 1 [cos(a a)] - Prsaleos(a)]} (“)" Pa [cos(6)) 


n=0 
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Section 9.4 


O04 R A S (HI) 
boe cum 2,25 (2n — (2m — 1) 
cos[(2n — 1)rz/2a] cos[(2m — 1)ry/b] 
(2n — 1) 2m - 1) n — 1)2/a? + (2m — 132/87] 


x 


Section 10.1 
1.142% 3. —2/5 5.2--2i/3 7. 4e™ 9. 54/2e97i/4- 11. 2627/8 


Section 10.2 


1. V2, V2 E + x : +V/2 (- 


i 
$ PD 29 


1 EL 
MC. 


: | 


Ea (-V a? + D? pastos a? + b? +a) T. +(1 +i),  Xx2(1- i) 


Section 10.3 


1. u=2—y, v=2 3. u= z? — 3ry?, v = 3r?y — y’ 

5. f'(z) = 3z(1 + 22) 7. f'(z) 2 2(0 +4i)z -3 

9. f'z)— —3i(iz —1)-^ 11. 1/6 

13. v(z, y) = 2xy + constant 15. v(x, y) = zsin(x)e * + ye ¥ cos(x) + constant 


Section 10.4 

1.0 3. 2% 5. 14/15 — i/3 
Section 10.5 

1. (e? — e~*)/2 3. 1/2 

Section 10.6 

1. 71/32 3. vi/2 5. —2ri T. 2nd 9. 67 


Section 10.7 


1. Sha +1)z” 
n=0 


8 T 
— 10 9 a d m 1 eee 
i ie ERE RN EE mz 


We have an essential singularity and the residue equals —1/11! 


1 22 gt 


We have a removable singularity where the value of the residue equals zero. 
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7. f(z) = 


We have a simple pole and the residue equals —2. 


bod. 1 --2 
DENG) OL um 


We have a simple pole and the residue equals 1/2. 


Section 10.8 

1. —37i/4 3. —2mi. 5. 2ni T. 2ri 

Section 10.11 

3.2— C r—m-mni 5.z — 1/4 or r= etl? 

Tomum = cosh” !(r), 0 € S[cosh ! (7)) < « 

Section 11.3 

1. vell /la| 

Section 11.4 

1. -t/1 +)? 3. f(t) = 1e-' H(t) + 1e! H(-t) 

5. f(t) =e" H(t) — e™ H(t) + 4te™™? H(t) 7. f(t) = ie“ H(t) /2 — ie H(—t)/2 
9. f(t) = (1 — alt|)e~ tl / (4a) 11. f(t) = (-1)* 2» eat A(t) /(2n +1)! 
13. f(t) =e" H(-t) - e! H(t) 


Oo innt 


ie 1 e 
15. — 
Js ) R2 — e24 s 2Rt+2 P» nma + [In(R) _ a] i 


—at ee innt 


_ ze , H(t — 2) € 
1-0 — uc G3 y Dae in) di 


d f6) = -7 He) E 

m x (ycos[(2n — 1)8nt/2L] + sin(2n — 1)8nt/2L]) 
Section 11.6 
1. y(t) = [(t — 1)e* +e] A) 3. y(t) = e * H(t) + [ge — ite] H(t) 


Section 11.7 


1 oa) = Afr t(1=2) tan (2)] 8 wea) = 2 [Eta (E) 


960 Advanced Engineering Mathematics with MATLAB 


Ti 1- 1 Ti — Ti —1)? 2 
5. u(z,y) = — | tan? RE ) + tan“! a pln Bevo 
T y y 2T x2 + y? 


T- 1— 
jpmp e x [ian =) + tan”! (z 
y y 


Section 11.8 


b— b 
1. uet) = 3 ef ( =) i } ext( 7) 


3. u(x,t) = To arf( ==] ATs erf( e) 


Section 12.1 


1. F(s) = s/(s? — a?) 3. F(s) = 1/s + 2/8? + 2/5? 

5. F(s) = [1 — e-26-V] /(s — 1) 

7. F(s) = 2/(s? 4-1) — s/(s? + 4) + cos(3)/s — 1/5? 

9. f(t) = e7’ 11. f(t) = § sin(3t) 

13. f(t) = 2sin(t) — Bt? + 2e™* — 6cos(2t) 17. F(s) = 1/(2s) — sT?/[2(s?T? + 7?)] 
Section 12.2 


1. f(t) = (t — 2)H(t — 2) — (t- 2)H (t —3) 3. y" +3y' + 2y = H(t— 1) 
5. y" + Ay! + Ay = LH (t — 2) 7. y" — 3y + 2y = e ! H(t — 2) 
9. y" +y =sin(t)[1 — H(t — x)] 


Section 12.3 


1. F(s) = 2/(s? + 2s + 5) 3. F(s) 22e^5/s? 2e 5/s? + e^ 5/s 
5. F(s) = 1/(s — 1)? + 3/(s? — 2s + 10) + (s — 2)/(s? — 4s + 29) 

7. F(s) = 2/(s +1)? + 2/(s? — 2s + 5) + (s + 3)/(s? + 6s + 18) 

9. F(s) 22e 5/s? + 2e 5/s? + 3e */s + e ?5/s 

11. F(s) = (1+ e ?7)/(s? +1) 13. F(s) = 4(s + 3)/(s? + 6s + 13)? 
15. f(t) = ite% — ite” 17. f (t) = e™ cos(t) + 2e™ sin(t) 
19. f(t) = — 2te ?* + cos(t)e * + sin(t)e* 

21. f(t) = PES — 3) 

23. f(t) = eY [cos(t — 1) — sin(t — 1) H (t — 1) 

25. f(t) = cos[2(t — 1) H (t — 1) + 1(t — 3)3e?0-3) H (t — 3) 

27. f(t) = {cos[2(¢ — 1)] + 4 sin[2(t — 1)]} (t — 1) + (t — 3) H (t — 3) 

29. f(t) = t[H(t) - H(t- a); F(s) = 1/8? — e^?*/s? — ae-**/s 


31. F(s) = 1/s? — e^ 5/s? — e ?5/s 33. F(s) = e 3/s? — e ?5/s? — e 35/s 
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35. Y (s) = s/(s? +4) + 3e-^5/[s(s? +4] 37. Y(s) = e G-9/[(s — 1)(s + 1)(s + 2)] 


39. Y (s) = 5/[(s — 1)(s — 2)] + e^ */(s?(s — 1)(s — 2)] 
4367" [sf (s — 1s — D] + e */s(s — Ds - 2) 


41. Y (s) = 1/[s?(s + 2)(s + 1)] + ae~*/[(s + 1)?(s + 2)] 
—e 9$ f/[s?(s + 1)(s + 2)] — e %*/[s(s + 1) (s + 2)] 


43. f(0) — 1 45. f(0) — 0 47. Yes 49. No 51. No 


Section 12.4 


1. F(s) = coth(Z) fce +1) 3. F(s) = ee 


Section 12.5 


1. f(t) 2e t — et 3. f(t) = et — 2e?! — Let 


5. f(t) = e? cos(t + 32) 7. f(t) = 2.3584 cos(4t + 0.5586) 


9. f(t) = 4 + X2 cos(2t + &) 


Section 12.6 
11. f(t)=e'-t-1 


Section 12.7 


1. f(t) 2 14 2t 3. f(t) 2t - ?/2 

5. f(t) = t? + 15/20 7. F(t) = t — tt/3 

9. f(t) = 5e% — 4et — 2tet 11. f(t) = (1 — t)?e™ 
13. f(t) = a [1 — e™ erfe(/nt)] /2 15. f(t) = i? 


1T. x(t) = [e^ [1 + erf (evit)] ee. c- l= 2eV/t]n} Jc? 
19. f(t) — a -- et + Laerf(Vat) + à3terf(v/at) + a? vte tt / Va 


Section 12.8 


1. y(t) = et = ; + st 3. y(t) = e” — e” 

5. y(t) = $e? + Fe" cite! — T. y(t) Set + fet - fet 

9. y(t) = (t — 1) H(t — 1) 11. y(t) = e% —e + [1 + 102070 — e1] H(t- 1) 
13. y(t) = [1 — e-?*-» — 2(t — 2)e2*-2] H(t — 2) 

15. y(t) = [1e2*79 — Set? + 197072] H(t — 2) 

17. y(t) = 1— cos(t) — [1 — cos(t — T)] H(t — T) 

19. y(t) 2e * — ie?! — 3 4 t [e 479 — 1e-20-9) — $-i(t—a)|H(t— a) 


e-3t-9) + (t — a)e- 6-9 — I] H(t — a) 


+ 
Q 
Nile 
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21. y(t) = tet + 3(t — 2)e*-2 H(t — 2) 


23. y(t) = 3 [e~2¢-2) — e-30-2] A(t — 2) 
44 [e —3(t-5) _ g-2(t— 5)] H(t— 5) 


25. x(t) = cos(v2t)e?t A sin(V2t)e; y(t) = P sin(/2t)e? 
27. z(t) =t— 1-- e * cos(t), y(t) = t? — t + esin(t) 


(t) = 
29. x(t) = 3F1 — 2F; — F, cosh(t) + Foe! — 2F; cos(t) + F» cos(t) — F> sin(t) 
(t) = Fy — 2F; + Fie™ — F cos(t) + F; cos(t) + Fi sin(t) 


Section 12.9 


1. f(t) = (2 — t)e?! — 2e73t 3. (f) = (40? - 14. 1) e* - 1 
5. f(t) = ae -1)-i-cie?€77] H(t — 1) 


, sin[(2n — ERAS 
Poi GREG ab) 80h 9-1 4a?b? + (2n — 1)?x 
2n — 1)r cos|(2n — 1)«t/(2a)] 
ds Yen 4a?b? + (2n — 1)?z? 


Section 12.10 


l. u 


4 s og E sin|(2m — 1)rz]sin|(2m — 1)rt] 
2 (2m — 1? 


5. u(x,t) =c% f(t — x/c— 2nL/c)H(t — x/c— 2nL/c) 


n= a 


ee f(t - x/c - 2mL/c)H (t + x/c - 2mL/c) 


7. u(x,t) = xt — te~® + sinh(t)e^* + [1 2 e 7? +t- x — sinh(t — x)| H(t — x) 


gr  2gw? sin(A,2) cos(Ast) s 
9. u(r,t) = -2 T, 2 X (uA +u2/L 4- X3) sinis D^ where An is the nth root of 
A = w? cot(AL). 
u AE (2n — l)re (2n — 1)ent 
11. u(x,t) = 2 7 sin] z | cos] z , or 


z M cA i ESI nat IE 
n=0 n=0 


eng D 2 u sin| E - nze] sin| E - pert) 


2L 2L 


15. u(r,t) = CTS 2 mU ete Cs - 5) sin(Br) + costó) 
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-e* pn. where T = t — (r — a)/c. 


3 2 6 272 nèr’ 


2t x? 1 = t)  2si t 
17. u(z,t) = us 22 ” cos(n7a) E ) | aari 
NAT 


Section 12.11 


11. u(x,t) = 1 ,o7t—pt = ente( = o i) | É ene( si +o J] 
= 3 ptc 2t Pe 2t 
P eit Berto = Ti + vi) 


p-c 
ids A ax) | pio L) x(x- Ta 2L) 
TIN 
jd ec z E sin( Z2) exp( =. 
15. u(x,t) 


4q )" cos|(2n — 1)rz/2L] 
-*Y5 xw EN nai 12a? /AL?] 
x {1 — exp[agt — (2n — 1)?z?a?t/AL?]) 


17. u(r,t) = uo | = 2— y. J + [ h E 2— py t— n g(r) dr, 


where f(r,t) — at(7- 4) ew (57 + 5) erfe( 2 + —) 


4u Aw? = 3 Ne 
mL 2X4 (GENO + EAR + WAR + SEEN, Fut 


where A, is the nth root of A? + 244372 coth( £ Av) / (mov) 4 o? — 0. 


19.y(t) = 


sin(A4z) exp(—A?t) 
Zu Te 2 [Sa +3 + A2 /(3aJ] sin" 


where A, is the pet Bot “of Acot(A) = (3a + A?) /3a. 
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23. u(x,t) 21— NL e "Ip (2 bin ) dn 
0 


Jo(k a 2472 
2 o( e Ent/a 
E 2 oo K3 Ji(kn) Ji(K f 


where kn is the nth root of ^h = 0. 


25. u(r,t) — 


b — = Jo( Kn? 
2T; —k | 
Dunt) | = a?b > K3 Ji (knb) J 


where kn is the nth root of 55 (6b) = = 0. 


Section 12.13 


Section 13.1 


1. F(z) = 2z/(2z — 1) if |z| > 1/2 3. F(z) = (36 — 1)/(z6 — 2°) if |z| > 0 
5. F(z) = (a? +a — z)/[z(z — a)] if |z| > a. 


Section 13.2 


1. F(z) = zTe*t /(ze*? — 1)? 3. F(z) = z(z + a)/(z — a? 

5. F(z) = [z — cos(1)]/{z[z? — 2z cos(1) + 1]) 

7. F(z) = z[zsin(0) + sin(woT — 0)]/[z? — 2z cos(woT) + 1] 

9. F(z) 2 z/(z 1) 11. fn *¥9n =n4+1 13. fn * Gn = 2" /n! 


Section 13.3 


1. fo = 0.007143, fı = 0.08503, fo = 0.1626, f3 = 0.2328 
3. fo = 0.09836, fı = 0.3345, fo = 0.6099, f = 0.7935 
5. fn =8—8(5)" -6n(3) 7. fa = (1- a4) /(1 — a) 


9. f E E n—11 


DS Hy 10 + (3) Hau 11. fn = 3(6n — 4)(=1)” + 5 (3) 
13. fa — a^ /n! 


Section 13.4 


l. y, — 14d an(n 1)(2n — 1) 3. Yn = in(n— 1) 
5. Yn = 5 15" - (—1)"] T. Yn = Qn- 1) (3)" + (-3)" 


Answers to the Odd-Numbered Problems 
Section 13.5 
1. marginally stable 3. unstable 


Section 14.1 


Section 14.3 
1. z(t) = et 
Section 14.4 


NE TEE. 


3. a(t) = ür8y 


Section 15.2 


1. G(s) = 1/(s +k) g(t|0) = e-** 
g(t|r) = eŒ A(t — 7) a(t) = (1— e-*)) /k 
3. G(s) = 1/(s? + As + 3) g(t|0) = $(e* — e?!) 
g(t|r) = 5 [go n) = uae | H(t — 7) 
a(t) = e7% — jet + i 
5. G(s) = 1/[(s — 2)(s — 1)] g(t|0) = e% — ef 
g(t|r) = [er(t—-7) e'-7| H(t — 7) a(t) = $ + 4e” — et 
7. G(s) 2 1/(s — 9)? g(t|0) = 4 sinh(3t) 
g(t|r) = i sinh[3(t — 7) H (t — 7) a(t) — z [cosh(3t) — 1] 
9. G(s) = 1/[s(s — 1)] g(t|0) =e’ — 1 
g(t|r) = [e77 — 1] H(t- 7) a(t) - e —t—1 
11. i kei 
and 


g ~ @eL—-1' r = n? (n?n? + L?) 


where n(x) = sin(nzz/L) + nr cos(nzz/ L)/ L. 
13. 
sinh(kzz) sinh[|k(L — xs)| 


gale) = k sinh(kL) 
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and 
sin(nz&/L) sin(nzv/L) 
soe) = arc Sern) 
15. 
_ sinh(ka<){k cosh[k(as — L)] — sinh|k(z.. — L)]) 
g&l) = k sinh(kL) + k? cosh(kL) ; 
and 


(1 + k2) sin(k,£) sin(k,z) 
gre) D» [EF (1 + ALIQ +R)” 


where kn is the nth root of tan(kL) = —k. 


1T. 
[a sinh(kzz) — k cosh(kzz)] cosh[k(L — z>)] 


g(z|£) = kla cosh(kL) — k sinh(KL)] 


and 


a? + k2) cos[k, (£ — L)] eos[k, (x — L) 
g(z£) D» (a? + k2)L = al(hR +h?) — C 


where kn is the nth root of k tan(kL) = — 


Section 15.4 


z . (NTT NT E nrt i L . [nrt 
u(x,t) = 23 sin( L ips e cos( L ) "IP eae sin( L )} 


Section 15.5 


3. 


dfe -HS nf Ps (2n — 1) Cn UR a nf" we a| (2n — e e-nn 


x H(t — 7) 


Answers to the Odd-Numbered Problems 967 


5. 
oo 2,2 
n . (nma E n n't 
u(x,t) = 2p» PI sin( L ) le ew( I ) 
4 Y 1 . [(2m — 1)rz (2m — 1)?z?t 
s 2.3m-1 sin L exp B 
T. 


Section 15.6 


1. 
z nT . (nmE\ . (nme 
g(x, y|£, n) x L exp( 7 ly nl) sin( "5 sin(“™*) 
5. 
ee nr’ nr 
r,0|o,0") = — = roly Seg in( 27) sin( 27) 
g(r, Olp, 0") a 3 : 
T. 


g(r, zl Q) = sum Y » SEA RI. SIUS «(E n 


Section 16.2 

1. (a) S={HH, HT,TH,TT} (b) S = {ab, ac, ba, be, ca, cb} 
(c) S = (aa, ab, ac, ba, bb, bc, ca, cb, cc) (d) S = (bbb, bbg, bgb, bgg, ggb, ggg, gbb, gbg} 
(e) S = (bbb, bbg, bgb, bgg, ggb, ggg, gbb, gbg} 

3. 1/3 5. 1/3 7. 2/13 9. 1/720, 1/120 

11. 1/2 13. 1/2 15. 9/16 


Section 16.3 
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Section 16.4 


0, z«-—1, 
1. Fx(z) = { x à = 3. Fx (2) = a o 2-5 
1, LS: 
Section 16.5 
1. E(X) = 1, and Var(X) = 1 3. k = 3/4, E(X) = 1, and Var(X) = } 


5. óx (w) = (pe +q)”, ux = np, Var(X) = npg 
7. ġx(w) = p/(1 — qe**), ux = q/p, Var(X) = q/p? 
Section 16.6 


1. (a) 1/16, (b) 1/4, (c) 15/16, (d) 1/16 5. P(X > 0) = 0.01, and P(X > 1) = 9 x 107? 
7. P(T < 150) = į, and P(X = 3) = 0.1646 


Section 16.7 


1. 


OEN 
() 


where x; = 0,1,2,3,4,5, y; =0,1,2,3,4,5 and 0 € z; +y; <5. 


pxy [zi yj] E 


Section 17.1 


1. ux (t) ^ 0, and o2, (t) = cos(wt) 


3. For ty = ta, Rx (ti, t2) = Dp; for ty Æ ta, Rx (ti, t2) = p. For ty = ta, C x (t1, t2) = p(1—p); 
for ty Æ ta, C x (t1, t2) = 0. 


Section 17.4 


Ie so [2/3 1/3 
2/3 — (2/3)(1/4)^ eo e dl 255 s 
Section 18.3 


1. E(X)=0, Var(X) = E(X?) = (b — a5)/3 

3. E(X)=0, Var(X) = E(X?) = (b? — a2)/2 

Section 18.5 

1. X(t) = e'/? B(t) + Xo 3. X(t) = B?(t) + tB(t) + Xo. 5. X(t) = tB?(t) — t?/2 + Xo 
7. e ROQ(t) - Q(0) = $ fo POV (n) dn + $ fo PO a(n) dn 


Answers to the Odd-Numbered Problems 969 


9. X(t) = X(0)e** + 1 (1— e74) +2 fo et- dB(n) 
11. X(t) = Xo + e — 1 + ze'[B?(t) — t] 


( 
13. X(t) = e'/? Xy — 2t — 4 + 5et/? — et cos[B(t)] 
17. ; 
X(t) = Xo exp | sin(2t) + sin(f) — 4t + J sin(n) BG) 
0 


19. 
X(t) 2 Xoexp [e + 1)In(¢+1)—t- i8 «f sab) 
0 


Taylor & Francis 


Taylor & Francis Group 


http://taylorandfrancis.com 


Index 


abscissa of convergence, 560 expansion in, 271—285 
absolute value of a complex number, 442 function of the second kind, 273 
Adams-Bashforth method, 40 function, modified, 275 
addition recurrence formulas, 276-277 
of a complex numbers, 441 Bessel, Friedrich Wilhelm, 272 
of matrices, 99 Biot number, 352 
of vectors, 145 boundary condition 
age of the earth, 553-554 Cauchy, 300 
aliasing, 231—233 Dirichlet, 339 
amplitude Neumann, 339 
of a complex number, 442 Robin, 340 
spectrum, 511 boundary-value problems, 46 
analytic complex function, 449 branches 
derivative of, 450 of a complex function, 449 
analytic signals, 718—720 principal, 443 
Archimedes’ principle, 184—185 Bromwich contour, 613 
argument of a complex number, 442 Bromwich integral, 613 
autonomous ordinary differential Bromwich, Thomas John l'Anson, 614 
equation, 4, 48 Buffon's needle problem, 846-847 


auxiliary equation, 50 
carrier frequency, 524 


back substitution, 102, 113 Cauchy 
band-pass functions, 717 boundary condition, 300 
basis function, 795 data, 300 
Bayes’ rule, 812 integral formula, 464—466 
Bernoulli equation, 28—29 principal value, 485—488 
Bessel problem, 300 
equation of order n, 271—275 residue theorem, 473-476 
function of the first kind, 273 Cauchy, Augustin-Louis, 451 
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Cauchy-Goursat theorem, 460 
Cauchy-Riemann eqns, 451 
centered finite differences, 326 
central limit theorem, 827 
Chapman-Kolmogorov equation, 868-869 
characteristic 

polynomial, 124 

equation, 50 

value, 124, 240 

vector, 124 
characteristic function, 240, 830-831 
characteristics, 318 
chemical reaction, 12-13, 882-883 
circular frequency, 58 
circulation, 159 
closed 

contour integral, 157, 459 

surface integral, 163 
coefficient matrix, 112 
cofactor, 105 
column of a matrix, 98 
column vector, 98 
combinations, 811 
complementary error function, 565 
complementary solution of an 

ordinary differential equation, 66 
complex-valued 

function, 448—450 

conjugate, 441 

number, 441 

plane, 442 

variable, 441 
complex matrix, 98 
components of a vector, 145 
compound interest, 9, 694—695 
conformable 

for addition of matrices, 99 

for multiplication of matrices, 99 
conformal mapping, 491—507 
conservative field, 159 
consistency in finite differencing 

for the heat equation, 378 

for the wave equation, 329 
consistent system of linear eqns, 111 
contour integrals, 456—460 
convergence 

of a Fourier integral, 512 

of finite difference solution 

for heat equation, 379 


for wave equation, 330 
of Fourier series, 189 
convolution theorem 
for Fourier transforms, 544—547 
for Hilbert transforms, 715-717 
for Laplace transforms, 588—591 
for z-transforms, 678 
Coriolis force, 147 
Cramer's rule, 108 
Crank-Nicholson method, 382 
critical points, 33, 88 
stable, 33, 89 
stable node, 90 
unstable, 33, 89 
cross product, 146 
curl, 154 
curve, 
simply closed, 460 
space, 146 
cutoff frequency, 754 


d’Alembert’s formula, 320 
d'Alembert's solution, 318-324 
d'Alembert, Jean Le Rond, 319 
damped harmonic motion, 61, 899-900 
damping constant, 61 
de Moivre's theorem, 443 
deformation principle, 462 
degenerate eigenvalue problem, 247 
del operator, 148 
delay differential equation, 608-609 
delta function, 512-513, 568-570 
design of film projectors, 585—587 
design of wind vane, 64-65 
determinant, 104—108 
diagonal, principal, 98 
difference eqns, 667 
differential eqns 

nth order, 45-95 

first-order, 1-43 

nonlinear, 1 

order, 1 

ordinary, 1-97 

partial, 1, 297—336, 

331—384, 385-440 

stochastic, 929—932 

type, 1 
differentiation of a Fourier series, 199 
diffusivity, 338 


Index 


dimension of a vector space, 125 
direction fields, 31 
Dirichlet conditions, 189 
Dirichlet problem, 339 
Dirichlet, Peter Gustav Lejeune, 191 
dispersion, 306 
divergence 
of a vector, 153 
theorem, 179-185 
division of complex numbers, 441 
dot product, 146 
double Fourier series, 427 
dual Fourier-Bessel series, 404 
dual integral eqns, 401 
Duhamel’s theorem 
for ordinary differential equation, 741 
for the heat equation, 651—659 


eigenfunctions, 239-256 
expansion in, 251 
orthogonality of, 249 
eigenvalue(s) 
of a matrix, 124 
of a Sturm-Liouville problem, 239—247 
eigenvalue problem, 124-129 
for ordinary differential eqns, 239-247 
singular, 240 
eigenvectors, 124-129 
orthogonality of, 132 
electrical circuits, 24, 76, 604—609 
electrostatic potential, 392 
element of a matrix, 98 
elementary row operations, 111 
elliptic partial differential equation, 385 
entire complex function, 449 
equilibrium points, 33, 88 
equilibrium systems of linear eqns, 111 
error function, 565 
essential singularity, 469 
Euler's formula, 442 
Euler’s method, 34-36 
Euler-Cauchy equation, 83-86 
evaluation of partial sums 
using z-transform, 688 
exact ordinary differential equation, 17 
existence of ordinary differential eqns 
nth-order, 46 
first-order, 8 
explicit numerical methods 
for the heat equation, 377 
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for the wave equation, 326 
exponential order, 560 


fast Fourier transform (FFT), 231 
filter, 234 
final-value theorem 
for Laplace transforms, 575 
for z-transform, 676 
finite difference approximation 
to derivatives, 326 
finite element, 795 
finite Fourier series, 225—234 
first-order ordinary differential eqns, 1-43 
linear, 20-31 
first-passage problem, 901—903 
flux lines, 150 
folding frequency, 233 
forced harmonic motion, 71—76 
Fourier 
coefficients, 188 
cosine series, 194 
cosine transform, 555 
Joseph, 190 
number, 347 
series for a multivalued function, 217 
series for an even function, 194 
series for an odd function, 194 
series in amplitude/phase form, 211—213 
series on [—L, L], 187-198 
sine series, 194 
sine transform, 555 
Fourier coefficients, 253 
Fourier cosine series, 194 
Fourier transform, 509—549, 736-740 
basic properties of, 520—530 
convolution, 544—547 
inverse of, 510, 532-542 
method of solving the heat eqn, 551—556 
of a constant, 518 
of a derivative, 524 
of a multivariable function, 514 
of a sign function, 519 
of a step function, 520 
Fourier-Bessel 
coefficients, 279 
expansions, 277 
Fourier-Legendre 
coefficients, 264 
expansion, 263 
Fredholm integral eqn, 121 
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free umderdamped motion, 58 
frequency convolution, 546 
frequency modulation, 526 
frequency response, 735 
frequency spectrum, 511 

for a damped harmonic 

oscillator, 736-737 

for low-frequency filter, 738-139 
function 

even extension of, 206 

generalized, 570 

multiplied complex, 448 

odd extension of, 206 

single-valued complex, 448 

vector-valued, 148 
fundamental of a Fourier series, 188 


Galerkin method, 795-801 
gambler’s ruin problem, 858 
Gauss's divergence theorem, 179-185 
Gauss, Carl Friedrich, 180 
Gauss-Seidel method, 429 
Gaussian elimination, 114 
general solution to an 

ordinary differential equation, 4 
generalized Fourier series, 252 
generalized functions, 570 
generating function 

for Legendre polynomials, 260 
Gibbs phenomenon, 202-205, 267 
gradient, 148 
graphical stability analysis, 33 
Green's function, 725-742 

for a damped harmonic oscillator, 737 

for low-frequency filter, 738 
Green's lemma, 169-172 
grid point, 325 
groundwater flow, 388-392 


half-range expansions, 206-209 
Hankel transform, 399 
harmonic functions, 386, 455 
complex, 455 
harmonics of a Fourier series, 188 
heat conduction 
in a rotating satellite, 220—224 
within a metallic sphere, 409-414 
heat dissipation in disc brakes, 640-642 
heat equation, 337-383, 551-556, 637-662 


for a semi-infinite bar, 551—553 
for an infinite cylinder, 357—360 
nonhomogeneous, 339 
one-dimensional, 340-343 
within a solid sphere, 355-357 
Heaviside 
expansion theorem, 581—587 
step function, 563-565 
Heaviside, Oliver, 564 
Hilbert pair, 704 
Hilbert transform, 703-723 
and convolution, 715-716 
and derivatives, 714—715 
and shifting, 714 
and time scaling, 714 
discrete, 711—712 
linearity of, 713 
product theorem, 716-717 
Hilbert, David, 705 
holomorphic complex function, 449 
homogeneous 
ordinary differential eqns, 16-17, 45 
solution to ordinary differential eqn, 66 
system of linear eqns, 101 
hydraulic potential, 338 
hydrostatic equation, 8 
hyperbolic partial differential equation, 297 


ideal Hilbert transformer, 703 
ideal sampler, 668 
imaginary part of a complex number, 441 
importance sampling, 833 
impulse function 
see (Dirac) delta function. 
impulse response, 732 
inconsistent system of linear eqns, 111 
indicial admittance for ordinary 
differential eqns, 733-734 
of heat equation, 654 
inertia supercharging, 208 
initial 
-value problem, 45, 597—610 
conditions, 300 
initial-boundary-value problem, 339 
initial-value theorem 
for Laplace transforms, 574 
for z-transforms, 676 
inner product, 99 
integral curves, 87 


Index 


integral equation, 731 
of convolution type, 592-594 
integrals 
complex contour, 456—460 
Fourier type, evaluation of, 536-537 
line, 156-159 
real, evaluation of, 477—482 
integrating factor, 19 
integration of a Fourier series, 200-201 
interest rate, 9, 694 
inverse 
discrete Fourier transform, 226 
Fourier transform, 510, 532-542 
Hilbert transform, 704 
Laplace transform, 581—588, 613-617 
z-transform, 681—689 
inversion formula 
for the Fourier transform, 510 
for the Hilbert transform, 704 
for the Laplace transform, 613-617 
for the z-transform, 681—689 
inversion of Fourier transform 
by contour integration, 533-542 
by direct integration, 532 
by partial fraction, 532—533 
inversion of Laplace transform 
by contour integration, 614-617 
by convolution, 588 
by partial fractions, 581—583 
in amplitude/phase form, 583-587 
inversion of z-transform 
by contour integration, 685—689 
by partial fractions, 683-685 
by power series, 681-683 
by recursion, 682-683 
irrotational, 154 
isoclines, 31 
isolated singularities, 452 
iterative methods 
Gauss-Seidel, 429 
successive over-relaxation, 432 
iterative solution of the radiative 
transfer equation, 267—269 
Itó process, 896 
Itó's integral, 916-921 
Itó's lemma, 920-928 
Itó, Kiyhosi, 923 


joint transform method, 752 
Jordan curve, 460 
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Jordan’s lemma, 533 


Kirchhoff's law, 24 
Klein-Gordon equation, 307 
Kramers-Kronig relationship, 721—223 


Lagrange’s trigonometric identities, 445 
Laguerre polynomial, 596 
Laplace integral, 559 
Laplace transform, 559-619 
basic properties of, 571—571 
convolution for, 588—591 
definition of, 559 
derivative of, 573 
in solving 
delay differential equation, 608—609 
heat equation, 637-644 
integral eqns, 592-594 
Laplace equation, 662-664 
wave equation, 619-629 
integration of, 574 
inverse of, 581—587, 613-617 
of derivatives, 562 
of periodic functions, 579—581 
of the delta function, 568-570 
of the step function, 563-565 
Schouten-van der Pol theorem for, 619 
solving of ordinary 
differential eqns, 597-610 
Laplace's eqn, 385-433, 550—551, 662-664 
finite element solution of, 433 
in cylindrical coordinates, 386 
in spherical coordinates, 387 
numerical solution of, 428—433 
solution by Laplace transforms, 662-664 
solution by separation 
of variables, 388-415 
solution on the half-plane, 549—551 
Laplace's expansion in cofactors, 105 
Laplace, Pierre-Simon, 387 
Laplacian, 153 
Laurent expansion, 469 
Lax-Wendroff scheme, 334 
Legendre polynomial, 259 
expansion in, 263 
generating function for, 260—261 
orthogonality of, 263 
recurrence formulas, 261 
Legendre's differential equation, 257 
Legendre, Adrien-Marie, 257 
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length of a vector, 145 unit, 98 
line integral, 156-159, 456-460 upper triangular, 102 
line spectrum, 215 zero, 98 
linear dependence matrix exponential, 139 
of eigenvectors, 124 maximum principle, 386 
of functions, 53 Maxwell's field eqns, 156 
linear transformation, 103 mean, 828-830 
linearity mechanical filter, 587 
of Fourier transform, 520 meromorphic function, 452 
of Hilbert transform, 713 method of partial fractions 
of Laplace transform, 561 for Fourier transform, 532-533 
of z-transform, 674 for Laplace transform, 581—587 
lines of force, 150 for z-transform, 683-685 
Liouville, Joseph, 241 method of undetermined coefficients, 67-70 
logistic equation, 12 minor, 105 
low-frequency filter, 738-739 mixed boundary-value problems, 400—405 
low-pass filter, 900—902 modified Bessel function, 
LU decomposition, 122 first kind, 275 
second kind, 275 
magnitude of a vector, 145 Euler method, 34-36 
Markov chain modified Euler method, 34 
state, 867 modulation, 524—527 
state transition, 867 modulus of a complex number, 442 
time homogeneous, 868 Monte Carlo integration, 833 
matrices multiplication 
addition of, 99 of complex numbers, 441 
equal, 98 of matrices, 99 
multiplication, 99 multivalued complex function, 448 
matrix, 97 
algebra, 97 nabla operator, 148 
amplification, 129 natural vibrations, 306 
augmented, 112 Neumann problem, 339 
banded, 102 Neumann's Bessel function of order n, 274 
coefficient, 112 Newton's law of cooling, 351 
complex, 98 nonanticipating process, 917 
diagonalization, 138 nondivergent, 153 
exponential, 139 nonhomogeneous 
identity, 98 heat equation, 339 
inverse, 100 ordinary differential equation, 45 
invertible, 100 system of linear eqns, 101 
method of stability norm of a vector, 98, 145 
of a numerical scheme, 128 normal differential equation, 45 
nonsingular, 100 normal mode, 306 
null, 98 normal to a surface, 148 
orthogonal, 123 not simply connected, 460 
real, 98 numerical solution 
rectangular, 98 of heat equation, 377—383 
square, 98 of Laplace's equation, 428-433 
symmetric, 98 of stochastic differential eqn, 936-943 


tridiagonal, 102 of the wave equation, 326-334 


Index 


Nyquist frequency, 233 
Nyquist sampling criteria, 231 


one-sided finite difference, 326 
order 

of a matrix, 98 

of a pole, 470 
orthogonal matrix, 123 
orthogonality, 249 

of eigenfunctions, 249-251 

of eigenvectors, 132 
orthonormal eigenfunction, 251 


overdamped ordinary differential eqn, 62 
overdetermined system of linear eqns, 116 


parabolic partial differential eqn, 338 
Parseval's equality, 201—202 
Parseval's identity 
for Fourier series, 201 
for Fourier transform, 527—529 
for z-transform, 688 
partial fraction expansion 
for Fourier transform, 532 
for Laplace transform, 581—587 
for z-transform, 683-685 
particular solution to ordinary 
differential equation, 3, 66 
path 
in complex integrals, 457 
in line integrals, 157 
path independence 
in complex integrals, 462 
in line integrals, 159 
permutation, 811 
phase, 442 
angle in Fourier series, 211-213 
diagram, 87 
line, 33 
path, 87 
spectrum, 511 
phase of the complex number, 442 
phasor amplitude, 721 
pivot, 112 
pivotal row, 112 
Poisson process, 886-891 
arrival time, 889 
Poisson’s equation, 425-427 
integral formula 
for a circular disk, 414—415 
for upper half-plane, 551 


summation formula, 529—532 
Poisson's summation formula, 529 
Poisson, Siméon-Denis, 426 
polar form of a complex number, 442 
pole of order n, 470 
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population growth and decay, 11, 874—883 


position vector, 145 

positive oriented curve, 463 
potential flow theory, 155 
potential function, 161—162 
power content, 201 

power spectrum, 528, 864-867 
principal branch, 443 
principal diagonal, 98 


principle of linear superposition, 50, 303 


probability 
Bernoulli distribution, 834 
Bernoulli trials, 820 
binomial distribution, 836 
characteristic function, 830 
combinations, 811 
conditional, 812 


continuous joint distribution, 844 


correlation, 850 
covariance, 848 
cumulative distribution, 824 
distribution function, 824 
event, 806 

elementary, 806 

simple, 806 
expectation, 828 
experiment, 805 
exponential distribution, 839 
Gaussian distribution, 840 
geometric distribution, 835 
independent events, 814 


joint probability mass function, 842 


law of total probability, 813 


marginal probability functions, 842 


mean, 828 

normal distribution, 840 
permutation, 811 

Poisson distribution, 837 
probability integral, 841 
probability mass function, 818 
random variable, 817—818 
sample point, 805 

sample space, 805 


standard normal distribution, 841 


uniform distribution, 838 
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probability (continued) 
variance, 828 


QR decomposition, 123 
quadrature phase shifting, 703 
quieting snow tires, 195-198 


radiation condition, 300, 351 
radius of convergence, 467 
random differential equation, 897-898 
random process, 855-891 
autocorrelation function, 860 
Bernoulli process, 856 
Brownian motion, 905-913 
chemical kinetics, 879 
counting process, 857 
mean, 858 
power spectrum, 864 
realization, 855 
sample function, 855 
sample path, 855 
state, 855 
state space, 855 
variance, 858 
wide-sense stationary processes, 862 
Wiener process, 912-913 
random variable, 817 
discrete, 818 
domain, 817 
range, 817 
rank of a matrix, 114 
real definite integrals 
evaluation of, 477—482 
real matrix, 98 
real part of a complex number, 441 
rectangular matrix, 98 
recurrence relation 
for Bessel functions, 276-277 
for Legendre polynomial, 261—263 
in finite differencing, 92 
reduction in order, 47 
regular complex function, 449 
regular Sturm-Liouville problem, 240 
relaxation methods, 429—433 
removable singularity, 470 
residue, 469 
residue theorem, 472-476 
resonance, 75, 219, 600 
rest points, 33 
Riemann, Georg Friedrich Bernhard, 452 


Robin problem, 340 

Rodrigues’ formula, 260 

root locus method, 737 

roots of a complex number, 445—447 
row echelon form, 113 

row vector, 98 

rows of a matrix, 98 

Runge, Carl, 38 

Runge-Kutta method, 37—40, 93-97 


scalar, 145 
Schouten-Van der Pol theorem, 619 
Schwarz-Christoffel transformation, 496—507 
Schwarz’ integral formula, 551 
second shifting theorem, 565 
secular term, 219 
separation of variables 
for heat equation, 340—366 
for Laplace's equation, 388-415 
for ordinary differential eqns, 4-14 
for Poisson's equation, 425—427 
for wave equation, 300—314 
set, 804 
complement, 804 
disjoint, 804 
element, 804 
empty, 804 
intersection, 804 
null, 804 
subset, 804 
union, 804 
universal, 804 
shifting 
in the w variable, 525 
in the s variable, 571 
in the t variable, 521, 572 
sifting property, 513 
simple 
eigenvalue, 242 
pole, 470 
simple harmonic motion, 58, 600 
simple harmonic oscillator, 57—61 
simply closed curve, 460 
sinc function, 511 
single side-band signal, 720 
single-valued complex function, 448 
singular 
solutions to ordinary differential 
eqns, 6 
Sturm-Liouville problem, 240 


Index 


singular Sturm-Liouville problem, 240 
singular value decomposition, 132 
singularity 
essential, 469 
isolated, 469 
pole of order n, 470 
removable, 470 
slope field, 31 
solenoidal, 153 
solution curve, 31 
solution of ordinary differential eqns 
by Fourier series, 217—224 
by Fourier transform, 547—549, 135-140 
space curve, 146 
spectral radius, 124 
spectrum of a matrix, 124 
square matrix, 98 
stability of numerical methods 
by Fourier method for heat eqn, 379 
by Fourier method for wave eqn, 329 
by matrix method 
for wave equation, 128 
steady-state heat equation, 10, 347 
steady-state output, 33 
steady-state solution to ordinary 
differential eqns, 73 
steady-state transfer function, 735 
step function, 563—565 
step response, 733 
stochastic calculus, 895-941 
Brownian motion, 906—913 
damped harmonic motion, 899—900 
derivative, 895 
differential eqns, 928-932 
first-passage problem, 901—903 
integrating factor, 930 
Itó process, 896 
Itó's integral, 916-921 
Itó's lemma, 920—928 
low-pass filter, 900—902 
nonlinear oscillator, 941 
numerical solution, 936—939 
Euler-Marugama method, 937 
Milstein method, 938 
product rule, 926, 929 
random differential eqns, 897-898 
RL electrical circuit with noise, 939 
wave motion due to random 
forcing, 904—906 
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Wiener process, 913 
stochastic process, 855 
Stokes’ theorem, 173-178 
Stokes, Sir George Gabriel, 174 
streamlines, 150 
Sturm, Charles-Frangois, 240 
Sturm-Liouville 

equation, 240 

problem, 239-247 
subtraction 

of complex numbers, 441 

of matrices, 99 

of vectors, 145 
successive over-relaxation, 429 
superposition integral, 727 

for ordinary differential eqns, 741 

of heat equation, 651—659 
superposition principle, 303 
surface conductance, 351 
surface integral, 162-166 
system of linear 

differential eqns, 133-137 

homogeneous eqns, 101 

nonhomogeneous eqns, 101 


tangent vector, 146 
Taylor expansion, 467 
telegraph equation, 309, 620-629 
telegraph signal, 856, 863 
terminal velocity, 9, 27 
thermal conductivity, 338 
threadline equation, 299—300 
time shifting, 521, 571 
trajectories, 87 
transfer function, 732 
transform 

Fourier, 509-549, 736-740 

Hilbert, 703—723 

Laplace, 559-619 

z-, 667—702 
transient solution to ordinary 

differential eqns, 73 

transmission line, 620—629 
transmission probability matrix, 869 
transpose of a matrix, 101 
tridiagonal matrix, solution of, 101-102 


underdamped, 62 
underdetermined system of linear eqns, 113 
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uniformitarism, 554 
uniqueness of ordinary differential eqns 
nth-order, 46 
first-order, 3 
unit 
normal, 149 
step function, 563—565 
vector, 145 


Vandermonde's determinant, 108 
variance, 828-830 

variation of parameters, 78-83 
vector, 98, 145 

vector element of area, 165 
vector space, 125 

Venn diagram, 804 

vibrating string, 297—299 
vibrating threadline, 299—300 
vibration of floating body, 60 
Volterra equation of the second kind, 592 
volume integral, 179-185 


wave equation, 297—334, 619-635 
damped, 308-311 
for a circular membrane, 312-314 
for an infinite domain, 318-324 


one-dimensional, 299 
wave motion due 

to random forcing, 904—906 
weight function, 249 
Wiener process, 860, 913 
Wiener, Norbert, 911 
Wronskian, 54 


z-transform, 667—702 

basic properties of, 674—680 

convolution for, 678 

final-value theorem for, 676-671 

for solving difference eqns, 691—697 

initial-value theorem for, 676 

inverse of, 681—689 

linearity of, 674 

multiplication by n, 677 

of a sequence multiplied by an 
exponential sequence, 674 

of a shifted sequence, 674-676 

of periodic sequences, 671—678 

their use in determining 
stability, 697—702 


zero vector, 145 


